Software Engineer III-SRE
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the Employee Platform, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
- Collaborate with software engineers and partner teams to design, develop, test, and implement highly available, reliable, and scalable application solutions
- Implement infrastructure, configuration, and network as code for applications and platforms
- Design and implement deployment approaches using automated CI/CD pipelines
- Resolve complex technical problems by engaging technical experts, stakeholders, and team members
- Define, measure, and review service level indicators and use service level objectives to proactively prevent customer impact
- Support and advocate for adoption of site reliability engineering best practices within the team
- Automate operational processes, runbooks, and recovery procedures to reduce toil and improve reliability
- Establish observability across services, including metrics, logs, and traces, to enable rapid detection and diagnosis
- Perform incident response, post‑incident reviews, and root‑cause analysis to drive corrective actions
- Conduct capacity planning, performance tuning, and resilience testing to meet growth and reliability goals
- Document architectures, operational procedures, and deployment configurations to ensure repeatability and compliance
Required qualifications, capabilities, and skills
Want more jobs like this?
Get jobs in Bangalore, India delivered to your inbox every week.

- Formal training or certification on software engineering concepts and 3+ years applied experience
- Hands on experience, as a software engineer and/or site reliability engineer
- Program proficiently in Python and/or Java for large‑scale data handling and migration
- Operate platforms and applications on public, private, or hybrid cloud infrastructures
- Hold formal training or certification in site reliability engineering (SRE) and apply at least 3 years of SRE experience
- Implement observability using white‑ and black‑box monitoring, SLO‑based alerting, and telemetry with Grafana, Dynatrace, Prometheus, Datadog, and Splunk
- Apply SRE culture and principles in real‑world applications or platforms
- Apply knowledge of software applications and technical processes across Cloud, Artificial Intelligence, and Machine Learning
- Build and maintain CI/CD pipelines with tools such as Jenkins, GitLab, and Terraform
- Troubleshoot networking technologies and issues; collaborate effectively in large teams, communicate clearly, remove roadblocks proactively, and innovate while staying current with emerging technologies
Preferred qualifications, capabilities, and skills - Solve complex, mission‑critical problems across one or more technology domains
- Develop automated tools, systems, and services spanning multiple technology domains
- Apply working knowledge of infrastructure components such as routers, load balancers, cloud products, containers, compute, storage, and networks
- Debug and troubleshoot systems; implement service‑level changes and manage operations with monitoring and log analysis tools
Perks and Benefits
Health and Wellness
Parental Benefits
Work Flexibility
Office Life and Perks
Vacation and Time Off
Financial and Retirement
Professional Development
Diversity and Inclusion