Job Requisition ID #
25WD89935
Job Title: Senior Manager, Site Reliability Engineering (SRE)
Job Summary: We are seeking a highly skilled and experienced Senior Manager, Site Reliability Engineering (SRE) to lead our SRE team. The ideal candidate will have a strong background in software engineering, systems architecture, and operations, with a focus on reliability, scalability, and performance. As a Senior Manager, SRE, you will be responsible for leading the team that drives the reliability and availability of our services, influencing improvements in our infrastructure, and leading a team of talented engineers.
Key Responsibilities:
- Lead and manage the SRE team, providing guidance, mentorship, and support.
- Establish and drive best practices for the availability, performance, capacity, and incident response of production platforms, ensuring alignment with organizational goals and industry standards
- Collaborate with engineering and operations teams to design and implement robust systems and processes.
- Monitor and analyze system performance, identifying and addressing issues proactively.
- Implement and maintain automation tools and processes to streamline operations.
- Develop and maintain documentation, including runbooks, best practices, and standard operating procedures.
- Foster a culture of continuous improvement, encouraging innovation and experimentation.
- Stay up to date with industry trends and best practices, incorporating them into our SRE practices.
Want more jobs like this?
Get jobs that are Remote delivered to your inbox every week.
Qualifications:
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience
- 10+ years of experience in site reliability engineering, DevOps, or engineering roles, with 5+ years of hands-on technical leadership and 2+ years of experience managing development teams
- Previously managed teams of engineers with ownership of high scale, production systems
- Strong knowledge of software engineering, systems architecture, and operations.
- Experience with cloud platforms (e.g., AWS, Azure, Google Cloud) and containerization technologies (e.g., Docker, Kubernetes).
- Skilled in big picture thinking and a willingness to learn and dive into unfamiliar systems and resolve complex and ambiguous issues both from a business and technical perspective
- Proficiency in programming languages such as Python, Go, or Java.
- Excellent problem-solving skills and the ability to work under pressure.
- Strong communication and leadership skills, with the ability to collaborate effectively across teams.
- Experienced in creating telemetry for the service using Splunk, New Relic, Datadog or their equivalents.
- Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
- Understanding of security best practices and compliance requirements.
The Ideal Candidate:
- Understands what it takes to deliver scalable and secure solutions.
- Ability to define and drive technical vision and strategic direction for complex platforms.
- Strong understanding of cloud security principles and best practices.
- Experience working in fast-paced, agile development environments with a focus on iterative improvement and flexibility.
- Possesses excellent communication and collaboration skills to work effectively with cross-functional teams and stakeholders.
- Demonstrates proficiency in systems thinking to understand and address the interdependencies within complex systems.
- Experience with chaos engineering practices to proactively test and improve system resilience.
- Strong skills in resiliency engineering to design and implement systems that can withstand and recover from failures.
Why Join Us:
- Opportunity to work with cutting-edge technologies and innovative projects.
- Collaborative and inclusive work environment.
- Competitive salary and benefits package.
- Career growth and development opportunities.
Learn More
About Autodesk
Welcome to Autodesk! Amazing things are created every day with our software - from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.
We take great pride in our culture here at Autodesk - it's at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.
When you're an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!
Salary transparency
Salary is one part of Autodesk's competitive compensation package. Offers are based on the candidate's experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.
Diversity & Belonging
We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging
Are you an existing contractor or consultant with Autodesk?
Please search for open jobs and apply internally (not on this external site).