Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

AI Evaluation Engineer, Siri Core Modeling

Yesterday Cupertino, CA

We are seeking talented engineers to join our team and push the boundaries of evaluations for Siri AI Agents. Evaluation lies at the heart of our model development strategy-it shapes architectural choices, guides launch decisions, and ultimately ensures a world-class user experience.

Our team is highly innovative and fast-moving, leveraging auto-evaluators and LLM-based judges to measure, validate, and continuously improve the core Siri AI engine. If you're excited by the challenge of building trusted evaluation systems that directly impact the quality of a groundbreaking AI product used by millions worldwide, this role is for you.

Description

As an AI Evaluation Engineer, you will:

- Design, build, and maintain auto-evaluators that measure the quality of Siri's core AI engine.

- Identify and triage issues and implement changes to improve auto-evaluator trustworthiness.

- Work with both simulators and real devices to ensure high-fidelity evaluation and a superior user experience.

- Collaborate with scientists and engineers across software and ML teams, contributing to products shipped across our portfolio of devices.

Preferred Qualifications

Experience with large-scale ML model evaluation, testing pipelines, and triage.

Knowledge of data generation, training workflows, or context engineering.

Familiarity with real-world deployment challenges for AI/ML products

Knowledge of latest methodologies in LLM evaluations

Minimum Qualifications

M.S. degree in Computer Science, Machine Learning, or a related technical field, or equivalent practical experience.

Strong skills in data analysis and statistical methods utilized in a problem solving environment

Passion for debugging, testing, and triaging issues in complex AI + software systems.

Want more jobs like this?

Get jobs in Cupertino, CA delivered to your inbox every week.

Job alert subscription


Proficiency in Python and experience developing production-quality code.

Understanding of large language models (LLMs) and awareness of their strengths and limitations.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Client-provided location(s): Cupertino, CA
Job ID: apple-200620244-0836_rxr-658
Employment Type: OTHER
Posted: 2025-11-10T19:06:40

Perks and Benefits

  • Health and Wellness

    • Parental Benefits

      • Work Flexibility

        • Office Life and Perks

          • Vacation and Time Off

            • Financial and Retirement

              • Professional Development

                • Diversity and Inclusion

                  Company Videos

                  Hear directly from employees about what it is like to work at Apple.