Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

IND - Staff Engineer, Reliability

Yesterday Hyderabad, India

IND - Staff Engineer, Reliability - GCC070
We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals - and to help others accomplish theirs, too. Join our team as we help shape the future.

Key Responsibilities

  • Partner with Enterprise governors to ascertain key reliability, security, and resilience requirements set by The Hartford and bring those requirements into the Platform team for implementation
  • Patternize resilience capabilities into useful tools, services, and products to be used by customers to ensure users of the Platform build fault-tolerant systems
  • Develop governing frameworks to ensure each release is compliant with the standards we expect
  • Liaise with key business and technical customers to understand predictive applications and their infrastructure. Through this consultation, you would be working with them to build resilience and reliability capabilities into their application.
  • Drive IM, cloud ops , and RE efforts across the platform by applying industry best practices and maturing existing the problem management lifecycle by building standards and contributing to runbooks, standard operating procedures, and incident management lifecycle
  • Performance engineering of deployed analytics and AI solutions across the portfolio to ascertain enhancement opportunities


Required Skills & Experience :

  • 4 + years of experience programming in Python to build automation tools, operational scripts, and platform support capabilities, including infrastructure and reliability automation.
  • 4+ years of experience using Infrastructure as Code to provision and manage cloud environments, including Terraform and/or CloudFormation, with a focus on repeatability, security, and scalability.
  • 2-3 years of experience deploying and operating systems on public cloud platforms such as AWS and/or Google Cloud Platform, including familiarity with serverless architectures, multi-region deployments, and recoverability strategies.
  • 4+ years of experience designing and operationalizing resilience, reliability, and disaster recovery capabilities for distributed systems and ML/AI platforms, including performance engineering and fault-tolerant system design.
  • 2+ years of experience building and maintaining CI/CD pipelines using tools such as GitHub and Jenkins, including embedding security checks, compliance gates, and automated validation into deployment workflows.
  • 4+ years of experience applying core reliability engineering concepts, including authoring runbooks, operational guides, and automation to support resilient platform operations.
  • 3+ years of experience designing observability solutions, including logging, monitoring, and alerting using tools such as Splunk, with dashboards and metrics that surface service health, SLO/SLA adherence, and early-warning signals for ML and data workloads.
  • 4+ years of hands-on experience with incident and problem management practices, including ITIL-based processes, postmortems, and blameless root cause analysis, as well as disaster recovery planning, failover testing, and resilience frameworks such as FMEA.
  • Foundational knowledge of networking fundamentals and operations architecture to support IT service management (ITSM) automation and distributed system reliability.

Want more jobs like this?

Get Science and Engineering jobs in Hyderabad, India delivered to your inbox every week.

Job alert subscription
  • Familiarity with relational databases such as Snowflake or other RDBMS platforms, with an understanding of data reliability, availability, and consistency requirements in analytics and ML environments.

Client-provided location(s): Hyderabad, India
Job ID: Hartford_Fire_Insurance_Company_FGB-3999
Employment Type: FULL_TIME
Posted: 2026-04-17T18:55:14

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • On-Site Gym
    • Mental Health Benefits
    • Virtual Fitness Classes
    • Fitness Subsidies
    • FSA
    • HSA
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • Fertility Benefits
    • Adoption Assistance Program
    • Family Support Resources
    • Adoption Leave
  • Work Flexibility

    • Hybrid Work Opportunities
    • Remote Work Opportunities
    • Flexible Work Hours
  • Office Life and Perks

    • Commuter Benefits Program
    • Casual Dress
    • On-Site Cafeteria
    • Company Outings
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Volunteer Time Off
    • Personal/Sick Days
  • Financial and Retirement

    • 401(K) With Company Matching
    • Stock Purchase Program
    • Performance Bonus
    • Relocation Assistance
    • Financial Counseling
    • Profit Sharing
  • Professional Development

    • Internship Program
    • Leadership Training Program
    • Associate or Rotational Training Program
    • Tuition Reimbursement
    • Promote From Within
    • Mentor Program
    • Shadowing Opportunities
    • Access to Online Courses
    • Lunch and Learns
    • Learning and Development Stipend
  • Diversity and Inclusion

    • Employee Resource Groups (ERG)
    • Diversity, Equity, and Inclusion Program

Company Videos

Hear directly from employees about what it is like to work at The Hartford.