Principal Associate SRE

3 days ago• Mexico City, Mexico

WeWork Reforma Latino (97001), Mexico, Ciudad de Mexico, Ciudad de Mexico

Principal Associate SRE

We're building a Site Reliability Engineering center in Mexico City and hiring Principal Associate SREs to join one of our founding teams. You'll work on payment-critical systems across the Discover Network, Diners Club International, and PULSE - contributing to settlement reliability, alert quality, observability, and automation that directly impacts millions of transactions daily.

This is a ground-floor opportunity. You'll be part of the first cohort of engineers in CDMX, working alongside experienced SRE leaders to build the operational muscle that allows Mexico City to own reliability outcomes independently. Depending on team placement, you'll focus on one of the following areas:

Settlement - ensuring batch settlement cycles complete accurately, on time, and in compliance with regulatory requirements across domestic credit/debit and international cross-border networks
Alert Signal & Observability - reducing alert noise, building automated severity classification, and creating customer impact dashboards that make incident response faster and more decisive
Reliability Automation & Platform Convergence - building automated runbooks, driving Capital One platform adoption, and developing AI-powered remediation workflows

Want more jobs like this?

Get Software Engineering jobs in Mexico City, Mexico delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

What You'll Do

Build and maintain reliability tooling - observability dashboards, automated alerts, runbooks, and remediation scripts that reduce toil and improve mean time to recovery
Develop automation solutions - using Python, Java, and shell scripting to eliminate manual operational processes, from certificate rotation to compliance artifact generation
Troubleshoot and debug complex production issues - diagnose failures across distributed systems spanning on-prem data centers and AWS, identify root causes, and implement durable fixes
Contribute to observability - configure and tune monitoring in Datadog and Observe, build dashboards that surface actionable signals, and reduce unactionable alert volume
Support incident response - participate in on-call rotations, respond to production incidents, drive diagnosis, and contribute to blameless postmortems
Leverage AI tools to accelerate engineering - use agentic AI automation (Claude Code and others) to develop solutions, generate runbook drafts, and build automation agents
Manage secrets and certificates - automate rotation and provisioning, ensuring security posture without manual toil
Deliver through CI/CD pipelines - build, test, and deploy automation via continuous integration and API automation frameworks

What Success Looks Like

Independently troubleshooting and resolving production issues within your domain without escalation
At least one operational process fully automated and running in production
Contributing measurably to team OKRs - whether that's alert noise reduction, MTTR improvement, or settlement cycle reliability
Producing or improving runbooks and dashboards that your teammates and partner teams actively use

The Environment

You'll work across hybrid on-prem and cloud infrastructure supporting real-time and batch financial transaction systems at global scale. The tech stack includes Python, Java, shell scripting, AWS, Kubernetes, OpenShift, CI/CD pipelines, and API automation frameworks. Observability runs on Datadog and Observe with extensive dashboard configuration. Secret management uses HashiCorp Vault. You'll use agentic AI tools (Claude Code and others) to develop automation solutions and accelerate your engineering output. The systems span three on-prem data centers and AWS, with both modern cloud-native services and legacy payment platforms. Strong troubleshooting and debugging skills are essential.

Basic Qualifications

Professional English fluency
Bachelor's degree
Background in SRE, production operations, or reliability engineering
At least 4 years of experience in DevOps Engineering (internship experience does not apply)
4+ years of experience in at least one of the following: Java, Python, Go
At least 2 years of experience with Cloud Native technologies (Amazon Web Services, Microsoft Azure, Google Cloud Platform)
2+ years of experience with container orchestration services including Docker or Kubernetes
Experience with Shell or Bash scripting
At least 2 years of Unix or Linux system administration experience

Preferred Qualifications

Experience developing automation solutions using agentic AI tools (Claude Code, Copilot CLI)
Troubleshooting and debugging skills across distributed systems
Familiarity with payments, financial services, or other regulated high-availability domains
Knowledge or experience of Networking concepts (TCP/DNS/TLS)

At Capital One, we respect individual differences in culture, religion, and ethnicity. Likewise, we promote equal opportunities and development for all personnel. In the hiring process, we seek to provide equal employment opportunities to candidates, regardless of race, color, religion, gender, sexual orientation, marital or civil status, national origin, disability, or any other situation protected by federal, state, or local laws.

For technical support or questions about Capital One's recruiting process, please send an email to Careers@capitalone.com

Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.

Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe, any position posted in the Philippines is for Capital One Service Corp (COPSSC), and any position posted in Mexico is for Capital One Technology Labs Mexico.

Client-provided location(s): Mexico City, Mexico

Job ID: capital-R245530

Employment Type: FULL_TIME

Posted: 2026-06-28T19:45:44

Perks and Benefits

Health and Wellness
- Health Insurance
- Health Reimbursement Account
- Dental Insurance
- Vision Insurance
- Life Insurance
- Short-Term Disability
- Long-Term Disability
- FSA
- FSA With Employer Contribution
- HSA
- HSA With Employer Contribution
- On-Site Gym
- Pet Insurance
- Mental Health Benefits
- Virtual Fitness Classes
Parental Benefits
- Fertility Benefits
- Adoption Assistance Program
- Family Support Resources
- Birth Parent or Maternity Leave
- Non-Birth Parent or Paternity Leave
- Adoption Leave
Work Flexibility
- Flexible Work Hours
- Remote Work Opportunities
- Hybrid Work Opportunities
Office Life and Perks
- Commuter Benefits Program
- Casual Dress
- Happy Hours
- Snacks
- Company Outings
- On-Site Cafeteria
- Holiday Events
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Personal/Sick Days
- Leave of Absence
- Volunteer Time Off
Financial and Retirement
- 401(K) With Company Matching
- Stock Purchase Program
- Performance Bonus
- Relocation Assistance
- Financial Counseling
- Profit Sharing
Professional Development
- Tuition Reimbursement
- Promote From Within
- Mentor Program
- Shadowing Opportunities
- Access to Online Courses
- Lunch and Learns
- Internship Program
- Work Visa Sponsorship
- Leadership Training Program
- Associate or Rotational Training Program
Diversity and Inclusion
- Diversity, Equity, and Inclusion Program
- Employee Resource Groups (ERG)
- Founder led

Company Videos

Hear directly from employees about what it is like to work at Capital One.

Want more jobs like this?

Perks and Benefits

Health and Wellness

Parental Benefits

Work Flexibility

Office Life and Perks

Vacation and Time Off

Financial and Retirement

Professional Development

Diversity and Inclusion

Company Videos