Lead Architect - SRE & Observability
Who We Are
Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips - the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world - like AI and IoT. If you want to push the boundaries of materials science and engineering to create next generation technology, join us to deliver material innovation that changes the world.
What We Offer
Location:
Bangalore,IND
You'll benefit from a supportive work culture that encourages you to learn, develop, and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible-while learning every day in a supportive leading global company. Visit our Careers website to learn more.
At Applied Materials, we care about the health and wellbeing of our employees. We're committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits.
About the GIS CAMO & SRE Team
The Cybersecurity Asset Management and Observability (CAMO) and Site Reliability Engineering (SRE) teams within GIS are at the forefront of ensuring operational excellence, resilience, and visibility across hybrid cloud and datacenter infrastructures. The CAMO team is responsible for enterprise-wide observability, IT asset visibility, event correlation, compliance monitoring, and tooling strategy. The SRE team ensures uptime, availability, and automation across mission-critical services through innovative engineering and DevOps practices.
Role Summary:
As a Lead Architect - SRE & Observability, you will play a key leadership role in designing, scaling, and governing monitoring and observability platforms, while ensuring the reliability of infrastructure and application services. You will lead cross-functional initiatives, establish technical standards, and drive automation, telemetry, and incident response maturity across the enterprise.
Key Responsibilities:
- Monitoring & Observability (CAMO Focus)
- Architect and lead end-to-end observability strategies (logs, metrics, traces) across on-premises, private, and public cloud environments.
- Manage and mature enterprise observability solutions across complex architectures.
- Define standards for telemetry data collection, correlation, and alerting for distributed systems.
- Collaborate with application and infrastructure teams to ensure instrumentation coverage and SLO/SLI definition.
- Lead the migration and consolidation of legacy monitoring platforms to modern observability stacks.
- Enable proactive problem detection, root cause analysis, and capacity forecasting using analytics and AI/ML insights.
- Site Reliability Engineering (SRE Focus)
- Define and implement SRE principles (SLIs/SLOs, error budgets, chaos testing, postmortems, etc.) across supported services.
- Design and manage infrastructure automation, CI/CD pipelines, AI/ML solutions, runbooks, and self-healing systems.
- Lead incident response coordination during major outages and drive post-incident analysis and systemic fixes.
- Collaborate with DevOps, Cloud, and Security teams to enforce resiliency, observability, and reliability as core design principles.
- Mentor junior SREs and CAMO engineers to grow technical and operational expertise.
Technical Skills:
- Expertise in designing and implementing observability frameworks including logs, metrics, and traces across hybrid environments (on-premises, private cloud, public cloud).
- Strong understanding of distributed systems, microservices architecture, and telemetry pipelines.
- Proficiency in infrastructure automation and configuration management using tools like Terraform, Ansible, and scripting languages (Python, Shell, etc.).
- Experience with CI/CD pipelines, incident response automation, and self-healing systems.
- Familiarity with container orchestration platforms (e.g., Kubernetes) and virtualization technologies.
Want more jobs like this?
Get jobs in Bangalore, India delivered to your inbox every week.

Functional Knowledge:
- Experience in implementing cyber asset management and security observability principles.
- Familiarity with AIOPS, ITSM, CAASM tools and configuration management databases.
- Exposure to compliance and governance frameworks such as CIS, NIST for cyber resilience, observability and alerting.
- Relevant certifications in observability, cloud platforms, SRE, or security domains.
Qualifications:
- Bachelor's or Master's degree in computer science, Engineering, or related field.10-15 years of experience in IT Operations, SRE, DevOps, or Monitoring Engineering roles.
- Strong expertise in modern observability platforms and telemetry pipelines.
- Experience with hybrid environments including virtualization, container orchestration, and cloud platforms.
- Proven track record in automation, telemetry governance, and infrastructure as code.
- Excellent incident management, communication, and stakeholder engagement skills.
Interpersonal Skills
- Communicates difficult concepts and negotiates with others to adopt a different point of view
Additional Information
Time Type:
Full time
Employee Type:
Assignee / Regular
Travel:
Yes, 10% of the Time
Relocation Eligible:
Yes
Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.
Perks and Benefits
Health and Wellness
- Health Insurance
- Dental Insurance
- Vision Insurance
- Life Insurance
- Short-Term Disability
- Long-Term Disability
- FSA With Employer Contribution
- HSA With Employer Contribution
- Health Reimbursement Account
- FSA
- HSA
- On-Site Gym
- Pet Insurance
- Mental Health Benefits
Parental Benefits
- Birth Parent or Maternity Leave
- Fertility Benefits
- Adoption Leave
- Non-Birth Parent or Paternity Leave
- Adoption Assistance Program
- Family Support Resources
- On-site/Nearby Childcare
Work Flexibility
- Flexible Work Hours
- Work-From-Home Stipend
Office Life and Perks
- On-Site Cafeteria
- Commuter Benefits Program
- Casual Dress
- Holiday Events
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
- Unlimited Paid Time Off
- Leave of Absence
Financial and Retirement
- 401(K) With Company Matching
- Stock Purchase Program
- 401(K)
- Performance Bonus
- Relocation Assistance
- Financial Counseling
Professional Development
- Tuition Reimbursement
- Access to Online Courses
- Internship Program
- Work Visa Sponsorship
- Associate or Rotational Training Program
- Promote From Within
- Mentor Program
- Shadowing Opportunities
Diversity and Inclusion