Sr Lead Infrastructure Engineer - Infrastructure Monitoring
We have an opportunity to impact your career and provide an adventure where you can push the limits of what's possible.
As a Sr Lead Infrastructure Engineer-Infrastructure Monitoring at JPMorgan Chase within the Corporate Technology Enterprise Observability Platforms team , you will lead the modernization of Infrastructure monitoring into a strategic, secure, scalable, and automation-enabled observability platform-strengthening firmwide resilience and delivering trusted operational insights.
You will be a hands-on technical contributor who drives adoption and partners across infrastructure, application, and SRE teams to improve telemetry collection and signal quality, modernize event-to-incident workflows, and enable AIOps-driven reliability improvements aligned to business objectives.
Job responsibilities
- Lead the modernization of the infrastructure monitoring platform, defining target-state architecture and roadmap while balancing near-term delivery with long-term resiliency, scalability, security, and usability goals
- Engineer, operate, and continuously improve enterprise monitoring platforms to meet availability, performance, scale, and security requirements.
Own platform design and architecture for telemetry collection and integration across metrics, logs, events, and traces, including OpenTelemetry patterns where applicable - Drive large-scale enterprise onboarding across Linux, Windows, and complex network estates, including lifecycle management, versioning/upgrade strategies, and governance controls
- Standardize onboarding patterns (agents/collectors, configuration baselines, dashboards, alerting, metadata, and runbooks) to enable safe, repeatable adoption
- Improve signal quality and actionability through baselining, threshold strategy, noise reduction, enrichment, and topology/context alignment to reduce MTTR and operational overhead
- Develop and maintain production-grade automation, services, and configuration-as-code; establish engineering standards and conduct rigorous reviews for reliability, security, and maintainability
- Reduce operational toil through automation and CI/CD-driven configuration management, including infrastructure-as-code patterns (e.g., Terraform).
Lead production health and operational excellence for the monitoring platform, including incident triage, root-cause analysis, and corrective/preventative actions - Partner with infrastructure, application, and SRE teams to align platform capabilities to SLIs/SLOs, operational readiness, and continuous improvement objectives
- Advance AIOps capabilities (e.g., correlation, anomaly detection, guided remediation) through experimentation, proofs of concept, and governed rollouts, while mentoring junior engineers and fostering a strong engineering culture
Required qualifications, capabilities, and skills
- Formal training or certification on infrastructure engineering concepts and 5+ years applied experience
- Demonstrated experience owning/operating enterprise-scale monitoring/observability platforms in production, and designing & delivering monitoring solutions across large Linux and Windows estates.
- Strong expertise with enterprise-grade operating systems (Windows Server and/or Enterprise Linux), including secure configuration, patching, and vulnerability remediation in regulated environments.
- Strong understanding of telemetry concepts (metrics, logs, traces, events) and practical OpenTelemetry collection and integration patterns.
- Strong infrastructure knowledge across compute, networking, storage, databases, integration patterns, scaling, resiliency, and performance.
- Advanced proficiency in automation and scripting (Python, Ansible, PowerShell, Bash) with strong use of CI/CD for controlled change and safe rollout.
- Hands-on experience with infrastructure-as-code for repeatable, governed provisioning and deployments (e.g., Terraform).
- Extensive experience operating in hybrid infrastructure environments, including enterprise on-prem platforms and public/private cloud, including migration enablement and cloud operational patterns.
- Hands-on experience with data stores such as MS SQL Server, Oracle, and Cassandra and/or Cloud Native Databases.
- Strong collaboration skills, with the ability to partner effectively across infrastructure, application, and SRE teams to align observability capabilities.
Want more jobs like this?
Get Software Engineering jobs in Wilmington, DE delivered to your inbox every week.

Preferred qualifications, capabilities, and skills
- Experience operating large-scale enterprise monitoring platforms (e.g., Tivoli, SMARTS, IBM Instana, DX NetOps, ITNM, Netcool Suite) with deep operational ownership.
- Experience with modern observability ecosystems including Splunk, Dynatrace, Grafana, Prometheus, and multi-tool interoperability patterns.
Experience with Kubernetes (e.g., EKS) for container orchestration and production operations. - Experience implementing AIOps workflows such as noise reduction, anomaly detection, probable root-cause analysis, and guided remediation with appropriate governance.
- Experience with topology-driven monitoring and event correlation in large, distributed infrastructure environments.
- Experience defining and operationalizing SLOs, error budgets, and reliability metrics across platform services.
- Experience with network monitoring.
ABOUT US
JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability/Veterans
ABOUT THE TEAM
Our professionals in our Corporate Functions cover a diverse range of areas from finance and risk to human resources and marketing. Our corporate teams are an essential part of our company, ensuring that we're setting our businesses, clients, customers and employees up for success.
Perks and Benefits
Health and Wellness
Parental Benefits
Work Flexibility
Office Life and Perks
Vacation and Time Off
Financial and Retirement
Professional Development
Diversity and Inclusion