Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Site Reliability Engineer (SRE)

2 days ago Santiago, Chile

Job Description - Site Reliability Engineer (SRE)
Role Purpose
The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and performance of digital services in production, balancing service stability with the ability to deliver change at speed.
The role focuses on strengthening operational resilience through engineering, automation, and proactive reliability practices, working closely with application and platform teams.
Scope of the Role
The locally applied SRE role covers:

  • Production digital services (applications, platforms, data products)
  • Associated infrastructure (cloud, CI/CD pipelines, integrations)
  • Continuous operations (24/7 reliability mindset, not necessarily shift-based)
  • Production changes (deployments, configurations)
  • Incidents, problems, and service degradations
  • Continuous improvement of stability and operational efficiency
Key Responsibilities
Service Reliability & Availability
  • Define, implement, and maintain SLIs and SLOs
    (availability, latency, error rates)
  • Continuously monitor service health and anticipate degradations
  • Ensure services operate within business-agreed reliability thresholds
  • Manage reliability trade-offs between speed and stability
Incident & Problem Management
  • Lead or coordinate response to relevant incidents (L2/L3)
  • Ensure:
    • Rapid and structured diagnosis
    • Safe service restoration
    • Clear and effective communication
  • Facilitate blameless postmortems
  • Convert recurring incidents into engineering improvement backlog
  • Drive long-term remediation rather than reactive firefighting
Automation & Operational Excellence
  • Identify repetitive and manual operational tasks
  • Design and implement automation for:
    • Deployments
    • Monitoring and alerting
    • Health checks
    • Basic recovery and self-healing (where applicable)
  • Reduce toil and increase system resilience through engineering solutions
Change Governance & Production Readiness
  • Support vendor and internal team change tracking
  • Ensure changes:
    • Are traceable
    • Have defined rollback strategies
    • Minimize operational risk
  • Validate operational readiness before production
  • Participate early in solution and architecture design from a reliability perspective (early involvement)
Metrics, Observability & Continuous Improvement
  • Define and maintain near real-time operational KPIs ("service pulse")
  • Ensure every deviation has:
    • Clear ownership
    • Defined corrective actions
  • Prevent reactive operations by driving data-driven decision making
  • Support identification, prioritization, and planning of technical debt remediation
What This Role Is Not
The SRE role will not be:
  • A dedicated incident operator only
  • An advanced Service Desk
  • The sole owner of service stability (reliability is shared)
  • A gatekeeper blocking changes without technical justification
  • The owner of contractual MOPs
  • A commercial or account management role
  • The customer-side account or delivery lead
Experience & Profile (Indicative)
  • Proven experience as SRE, Production Engineer, or similar role
  • Strong background in production systems and reliability engineering
  • Experience working with:
    • Cloud platforms
    • CI/CD pipelines
    • Monitoring and observability tools
  • Comfortable operating in product-oriented or POD-based team models
  • Strong problem-solving, communication, and collaboration skills
Operating Model Alignment
  • Works embedded or as an enabling function with PODs
  • Focused on enablement and reliability patterns, not centralized control
  • Promotes shared ownership of reliability

Want more jobs like this?

Get Science and Engineering jobs in Santiago, Chile delivered to your inbox every week.

Job alert subscription
Client-provided location(s): Santiago, Chile
Job ID: Infosys-147300BR
Employment Type: OTHER
Posted: 2026-04-23T18:51:57

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Life Insurance
    • HSA
    • Short-Term Disability
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • On-site/Nearby Childcare
  • Work Flexibility

    • Office Life and Perks

      • Commuter Benefits Program
    • Vacation and Time Off

      • Paid Vacation
      • Paid Holidays
      • Personal/Sick Days
      • Sabbatical
    • Financial and Retirement

      • 401(K)
      • Relocation Assistance
    • Professional Development

      • Learning and Development Stipend
    • Diversity and Inclusion

      • Employee Resource Groups (ERG)