Reliability Engineer - Production Support
Reliability Engineer - IE08GE
We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals - and to help others accomplish theirs, too. Join our team as we help shape the future.
This role is within the Reliability Engineering team that combines software engineering and systems engineering to build and run Enterprise Contact Center platforms, systems, and integrated applications - both our internally critical and externally visible systems - are resilient, have reliability, availability, and innovation appropriate to our customer's needs. The Engineer must bring "Everything as Code" mindset to their jobs.
Much of the work in the Team shall focus on optimizing existing software systems, building infrastructure, and reducing toil through automation. On the RE Team, you will have the opportunity to manage the complex challenges of scale which are unique to the Enterprise Contact Center domain.
This role will have a Hybrid work schedule, with the expectation of working in an office (Columbus, OH, Chicago, IL, Hartford, CT or Charlotte, NC) 3 days a week (Tuesday through Thursday).
Responsibilities:
Operations Focus:
- Assists with instrumenting code/application technology stack to enable the generation of relevant metrics on overall technology health - availability, performance, quality, currency, and resiliency. (10%)
- Contributes to the architecture and software engineering teams to influence the technical strategy for the organization, keeping in mind its cross-functional impacts, integration across the organization, and architecture rationalization. (10%)
- DevSecOps Solution Responsibilities: (10%)
Want more jobs like this?
Get jobs in Hartford, CT delivered to your inbox every week.

- Build the necessary tooling, alerts, and response mechanisms to identify and address reliability risks leveraging automation to support problem prevention, detection, mitigation, and resolution.
- Enhance the delivery flow by building the appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.
- Progressively implement preventative controls and build increased automation and self-healing capabilities. Continue to improve cost efficiency baselines.
- Promote and implement innovative solutions
- IT / Data Engineering Responsibilities: (35%)
- Participate in the elimination of toil by creating automation or engineering autonomous solutions requiring minimal manual effort (e.g., covering OS patching to CICD to infrastructure configuration mgmt.)
- Ability to build reliable and performant data systems to support data delivery
- Ability to build scalable SDLC environments using COTS, SaaS, PaaS products to support Data Pipeline needs
- IT Ops Responsibilities: (35%)
- Promote operational excellence. Participate in the triaging and service restoration of all high impact incidents in order to minimize the mean time to service restoration and impact to the business. Demonstrate end-to-end ownership.
- Partner with infrastructure Product teams to design and implement intelligent incident routing, enhanced monitoring/alerting capabilities and automated service restoration processes. Take proactive measures to prevent high impactful incidents.
- Achieve and maintain the continuity of Hartford and third-party assets that support a business function. Accountable for keeping the IT application and infrastructure metadata repositories current.
Application Focus:
- Promote the reliability (such as availability, capacity, performance) of the solution. Participate in on-call activities to mitigate incidents as quickly as possible. (10%)
- Participate in the development of effective tooling, alerts, and response to both identify and address reliability risks including automatic problem detection and mitigation. (55%)
- Build and operate reliable and performant data systems and services that enable the business to make data-driven decisions
- Engage with the service consumers to define functional and non-functional requirements for the solutions. (10%)
- Participate with training, best-practices, and sample code to enable consumers to take advantage of the solution to the best degree possible. (10%)
- Partner with the RE and Software Engineering teams to collect ongoing feedback and improvement backlog items for the Infrastructure Product teams. (10%)
- Leverage analyst reviews, vendor offerings, client success stories to evolve the portfolio. Participate in relevant vendor / community / industry conferences. (5%)
- Build and maintain Governance policies especially to Data masking (PII management), data lifecycle management needs
Qualifications:
- DevOps Mindset
- Enjoy solving difficult engineering problems and don't mind getting your hands dirty
- Maintains personal responsibility and commitment to respond to and address incidents quickly
- Good Software engineering skills ideally with experience in Java, Python, .Net and/or Go.
- Understanding of Linux system internals, are familiar with the TCP/IP stack, network routing and load balancing
- Approach troubleshooting systematically and have a deep sense of ownership for whatever you work on
- Ability to root cause sources of instability in a high-traffic, distributed system
- Experience with configuration and troubleshooting of Linux, Java/Scala, Docker / Kubernetes systems
- Understanding of large-scale complex systems from a reliability perspective
- Passion for resolving reliability issues and identifying strategies to mitigate going forward
- Knowledge of Performance and Observability tools such as Dynatrace, SumoLogic, TrueSight, CloudWatch, CloudTrail, AWS X-Ray, Splunk, and related tools.
- Willingness to work in an ever-changing environment
- Passioned about automation and innovations that improve productivity
- Experience with IAC tools such as Terraform, Cloud Formation etc.
- Degree in Computer Science or related discipline with a minimum of 3-5 years of work experience in IT systems operations and/or application development.
- Some experience in an RE role. Experience with building, supporting Enterprise Contact Center platforms, systems, omni channel applications (voice, chat, SMS, email, social, etc.)
- Certifications/Licenses/Badges (as applicable): AWS Certified Cloud Practitioner, AWS Certified Developer, Microsoft Certified Azure Fundamentals, Microsoft Certified Azure Developer, AWS Amazon Connect Fundamentals, AWS Amazon Connect Communications Specialist, AWS Amazon Connect Developer,
Candidate must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.
Compensation
The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:
$90,320 - $135,480
Equal Opportunity Employer/Sex/Race/Color/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age
About Us | Our Culture | What It's Like to Work Here | Perks & Benefits
Perks and Benefits
Health and Wellness
- Health Insurance
- Health Reimbursement Account
- Dental Insurance
- Vision Insurance
- Life Insurance
- Short-Term Disability
- Long-Term Disability
- On-Site Gym
- Mental Health Benefits
- Virtual Fitness Classes
- Fitness Subsidies
- FSA
- HSA
Parental Benefits
- Birth Parent or Maternity Leave
- Non-Birth Parent or Paternity Leave
- Fertility Benefits
- Adoption Assistance Program
- Family Support Resources
- Adoption Leave
Work Flexibility
- Hybrid Work Opportunities
- Remote Work Opportunities
- Flexible Work Hours
Office Life and Perks
- Commuter Benefits Program
- Casual Dress
- On-Site Cafeteria
- Company Outings
- Holiday Events
Vacation and Time Off
- Paid Vacation
- Paid Holidays
- Volunteer Time Off
- Personal/Sick Days
Financial and Retirement
- 401(K) With Company Matching
- Stock Purchase Program
- Performance Bonus
- Relocation Assistance
- Financial Counseling
- Profit Sharing
Professional Development
- Internship Program
- Leadership Training Program
- Associate or Rotational Training Program
- Tuition Reimbursement
- Promote From Within
- Mentor Program
- Shadowing Opportunities
- Access to Online Courses
- Lunch and Learns
- Learning and Development Stipend
Diversity and Inclusion
- Employee Resource Groups (ERG)
- Diversity, Equity, and Inclusion Program
Company Videos
Hear directly from employees about what it is like to work at The Hartford.