Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Sr Sofware Developer / Site Reliability Engineer - ASE iCloud Content

Yesterday Seattle, WA

People at Apple don't just build products - they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. The Apple Services Engineering (ASE) team builds and provides platforms, services and infrastructure that fuel Apple's services (such as iCloud, iTunes, Siri, and Maps). We are the foundation on which Apple's software developers build the products that our customers love. We are looking for passionate and talented engineers to continue our focus in providing our customers the highest quality Apple Services experience. Our services have to scale globally, stay highly available, and "just work." If you love designing, engineering and running products and platforms that will help millions of customers, then this is the place for you!

Description

Apple Services' scale is BIG. Operating at our scale, across multiple geographies and servicing hundreds of millions of users presents unique challenges. As a Software Developer in SRE at Apple, you'll need to solve these problems using data, teamwork, and your own expertise. ASE Products Site Reliability teams are responsible for the reliability and performance of the server software stack that powers products like iCloud Photos, Mail, Drive, Backup and many more. We do that by focusing on reliability best practices from service inception to production, collaborating deeply with product development teams to deliver a superlative product and shared vision while leveraging data and automation as first principles. We run a mix of open source, vendor licensed, and internally developed tools to manage the end to end SDLC of our products. You'll learn these tools and have opportunities to improve them.

We think critically and strive to balance the best solution with the need to get things done for each engineering challenge we face. Good ideas are heard and results are rewarded.

","responsibilities":"As an SRE at Apple, you will:

Operate, monitor, and triage all aspects of our production and non-production environments.

Pioneer and implement the next-generation telemetry system.

Prepare alert handling procedures, runbooks, and collaborate with the off-shore SRE teams.

Automate deployment and orchestration of services into the cloud environment as well as other routine processes.

Actively participate in capacity planning, scale testing, and disaster recovery exercises.

Interact with and support partner teams, including engineering, QA, and program management.

Cultivate and maintain relationships with internal and external third-party vendors.

Preferred Qualifications

Fast learner who is generous with their knowledge

Experience with disaster recovery, capacity planning and chaos testing

Being curious about how systems work and, more importantly, how they fail

An acute drive to build bots that automate away repetitive tasks

Working knowledge of microservices architecture and container orchestration with Kubernetes or similar technologies, preferably in a large-scale production environment

Experience with managing large numbers of diverse systems with configuration management and software delivery platforms (such as Spinnaker, Terraform, Puppet, Chef or Ansible) in a public, private, or hybrid cloud environment

Experience with Linux/Unix, Networking, Systems Management, Systems Security

Experience using modern object storage systems like S3, GCS.

Familiarity with large-scale observability systems like Prometheus, Grafana, Splunk

A track record of partnering with peers to foster solid engineering principles

Strong belief in acquiring and spreading knowledge via mentorship

Minimum Qualifications

5+ years of software development or production operations experience in a large-scale environment

BS or MS in Computer Science or related field

Experience in managing and scaling large distributed systems in a public, private, or hybrid cloud environment

An inherent bias for action, strong sense of ownership and integrity demonstrated through clear communication and collaboration

Experience with deploying and supporting new and existing services, platforms, and application stacks

Familiarity with cloud infrastructure concepts (zones, regions, VPCs, etc)

Excellent troubleshooting and problem solving skills

Skills and experience in monitoring, alerting, fault analysis, and automation

The ability to design, author and release code in languages like Java, Go, or Python

Want more jobs like this?

Get jobs in Seattle, WA delivered to your inbox every week.

Job alert subscription


Ability to participate in on call service support

Lead incident response and root cause analysis of production systems

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Client-provided location(s): Seattle, WA
Job ID: apple-200627818-3337_rxr-658
Employment Type: OTHER
Posted: 2025-11-10T19:05:27

Perks and Benefits

  • Health and Wellness

    • Parental Benefits

      • Work Flexibility

        • Office Life and Perks

          • Vacation and Time Off

            • Financial and Retirement

              • Professional Development

                • Diversity and Inclusion

                  Company Videos

                  Hear directly from employees about what it is like to work at Apple.