Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Sr AI Observability Engineer

Yesterday Cupertino, CA

Do you want to build the future of AI enabled observability at Apple? We're looking for an experienced AI observability engineer to design and build AI observability solutions that power Apple Intelligence, Search, and AI infrastructure powering Apple's intelligent products. We're at the forefront of building AI-first observability services, blending AI, cloud-first engineering, and industry standards to deliver smart, scalable solutions. Your work will directly impact the experience of billions of users on their favorite Apple devices. If you are a seasoned principal or senior software engineer with a proven track record in building AI enabled observability solutions and have a deep passion for observability, AI, cloud-native technologies and large-scale distributed systems, we want to talk with you.

Description

We're pioneering the next generation of AI-powered observability solutions. While we innovate to build new solutions, we also leverage industry-standard open-source technologies. In this role, you will collaborate with a team of engineers to lead the design and development of user-facing observability features for AIML products and infrastructure. You will also be responsible for providing technical guidance, sharing observability best practices and know-how, leveraging AI pipelines and mentoring the team to develop and deliver best-of-class features and a delightful user experience for all users.

Preferred Qualifications

Knowledge of current Gen AI research and techniques in the following areas: MCPs, RAG systems, Agentic AI (multi-agent orchestration, tool calling)

Hands-on experience with agentic AI frameworks (e.g. LangGraph, AutoGen, CrewAI) for building multi-step reasoning and tool-using agents

Demonstrated experience in building observability systems for metrics, distributed tracing, logs, profiling and in building observability data collection using OpenTelemetry

Demonstrated proficiency in AWS services such as EKS and native Kubernetes, storage such as S3, networking, database and observability services

Want more jobs like this?

Get jobs in Cupertino, CA delivered to your inbox every week.

Job alert subscription


Experience with large scale observability visualization systems with knowledge of popular visualization tools like Grafana, DataDog, and ELK

Proficiency using cloud-native software development tools including coding, CI/CD and testing frameworks

Building large-scale incident management, alert management and notification systems

Active open source project contributions is a plus

Minimum Qualifications

7+ years of experience in building ML pipelines, portable workflows and in model tuning to deploy ML and LLM models in production for customer-facing features

7+ years software engineering experience and strong background in computer science: distributed systems, algorithms and data structures, APIs and highly-scalable, reliable systems and micro-services

Demonstrated experience using LLM and ML models for AIOps and model observability

Demonstrated experience using LLMs, ML frameworks i.e. TensorFlow, PyTorch and libraries like Scikit-learn, NumPy, LangChain, MLFlow, KubeFlow

Demonstrated experience in delivering well-architected, reliable, highly-scalable cloud-native distributed systems for data management, observability or analytics services

Strong software engineering experience in design, development and testing in cloud-native environments

Strong coding skills in Python, Go, Javascript, Java

Demonstrated experience in building large-scale micro-services using public cloud infrastructure and/or "private cloud" environments

Experience developing intelligent detection and resolution features for incident management, automated remediation and root cause analysis

Excellent verbal and written communication skills with strong problem solving skills

Excellent interpersonal skills for collaborating across teams, stakeholders, and open source collaborators

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Client-provided location(s): Cupertino, CA
Job ID: apple-200635605-0836_rxr-659
Employment Type: OTHER
Posted: 2025-12-12T19:32:15

Perks and Benefits

  • Health and Wellness

    • Parental Benefits

      • Work Flexibility

        • Office Life and Perks

          • Vacation and Time Off

            • Financial and Retirement

              • Professional Development

                • Diversity and Inclusion

                  Company Videos

                  Hear directly from employees about what it is like to work at Apple.