Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Manager of Site Reliability Engineering

1 week ago Alpharetta, GA

Why UKG:

At UKG, the work you do matters. The code you ship, the decisions you make, and the care you show a customer all add up to real impact. Today, tens of millions of workers start and end their days with our workforce operating platform. Helping people get paid, grow in their careers, and shape the future of their industries. That's what we do.

We never stop learning. We never stop challenging the norm. We push for better, and we celebrate the wins along the way. Here, you'll get flexibility that's real, benefits you can count on, and a team that succeeds together. Because at UKG, your work matters-and so do you.

Role Description

Site Reliability Managers at UKG have a breadth of knowledge encompassing all aspects of service delivery and management. This SRE role is primarily responsible for application reliability, performance, and operability as software runs on the underlying platform. The team focuses on how applications behave in production - including scalability, stability, resource usage, and failure recovery - rather than feature development.

They lead and grow teams that develop solutions to increase resiliency and support our Cloud Engineering and Infrastructure. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and automation.

Site Reliability Managers are passionate about learning and evolving with current technology trends and enabling their teams to do the same. They strive to innovate and are relentless in pursuing a flawless customer experience. They have an "automate everything" mindset, helping us bring value to our customers by leading their teams. Deploy services with incredible speed, consistency, and availability.

Job Responsibilities:
• Be a Technology Leader by driving the roadmap execution and running the project(s) while planning new ones
• Help drive change across the company, working towards a common methodology based around Site Reliability Engineering and Solid System Engineering practices
• Lead the team in driving further adoption of Site Reliability practices such as Chaos engineering, SLOs, Error Budgets, release safety, load testing, and disaster recovery strategies
• Build teams through hiring and people growth while balancing your ownership workload through delegation and define and review individual and team goals (OKRs)
• Responsible for guiding and encouraging the personal and technical development, engagement, and growth of your direct reports
• Own application performance, scalability, and availability in production environments
• Diagnose and resolve systemic reliability issues across application, OS, and infrastructure layers

Want more jobs like this?

Get jobs in Alpharetta, GA delivered to your inbox every week.

Job alert subscription

• Lead major incident response and act as the escalation point for platform-related reliability issues
• Ensure post-incident reviews result in measurable improvements to platform stability and application performance
• Partner with application teams to influence design decisions that impact runtime reliability
• Collaborate cross organization to successfully complete successful delivery with the wider functions, including but not limited to Security, Architecture, Operations and Product Managers

Qualifications

Basic Qualifications:
• Engineering degree, or a related technical discipline, or equivalent work experience
• Knowledge of Public Cloud based applications & Containerization Technologies
• Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing
• Experience transforming teams and successfully leading them through change
• 5+ year of people management experience leading a technical team
• Deep understanding of Windows Server internals (memory management, threading, I/O, services)
• Experience with .NET runtime behavior (GC, memory leaks, thread pools, IIS)
• Performance tuning of monolithic .NET applications in production environments

Preferred Qualifications:
• Experience working in a GCP Cloud environment
• Experience with hiring SRE, DevOps, or similar engineering team

Company Overview:

UKG is the Workforce Operating Platform that puts workforce understanding to work. With the world's largest collection of workforce insights, and people-first AI, our ability to reveal unseen ways to build trust, amplify productivity, and empower talent, is unmatched. It's this expertise that equips our customers with the intelligence to solve any challenge in any industry - because great organizations know their workforce is their competitive edge. Learn more at ukg.com.

Equal Opportunity Employer

UKG is an equal opportunity employer. We evaluate qualified applicants without regard to race, color, disability, religion, sex, age, national origin, veteran status, genetic information, and other legally protected categories. View The EEO Know Your Rights poster UKG participates in E-Verify. View the E-Verify posters here.

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Disability Accommodation in the Application and Interview Process

For individuals with disabilities that need additional assistance at any point in the application and interview process, please email UKGCareers@ukg.com.

The pay range for this position is $129,500 to $190,000. The actual base pay offered may vary depending on skills, experience, job-related knowledge and work location. In addition to base pay, employees may be eligible to participate in a performance-based bonus plan and to receive restricted stock unit awards as part of total compensation. Learn more about UKG's benefits and rewards at https://www.ukg.com/about-us/careers/benefits

Client-provided location(s): Alpharetta, GA
Job ID: ukg-MGRSI017341
Employment Type: OTHER
Posted: 2026-02-06T18:42:54

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Health Reimbursement Account
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA
    • FSA With Employer Contribution
    • HSA
    • HSA With Employer Contribution
    • Fitness Subsidies
    • On-Site Gym
    • Virtual Fitness Classes
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Non-Birth Parent or Paternity Leave
    • Adoption Assistance Program
    • Family Support Resources
    • Adoption Leave
  • Work Flexibility

    • Flexible Work Hours
    • Remote Work Opportunities
    • Hybrid Work Opportunities
  • Office Life and Perks

    • Casual Dress
    • Happy Hours
    • Company Outings
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Unlimited Paid Time Off
    • Paid Holidays
    • Personal/Sick Days
    • Volunteer Time Off
  • Financial and Retirement

    • 401(K) With Company Matching
    • Company Equity
    • Performance Bonus
    • Profit Sharing
  • Professional Development

    • Tuition Reimbursement
    • Mentor Program
    • Shadowing Opportunities
    • Access to Online Courses
    • Internship Program
  • Diversity and Inclusion