Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

AIML - Machine Learning Engineer, Model Evaluations

AT Apple
Apple

AIML - Machine Learning Engineer, Model Evaluations

Cupertino, CA

Join Us in Shaping the Future of Generative AI at Apple Are you passionate about ensuring AI systems are safe, inclusive, and globally representative? We are seeking a seasoned Model Evaluations Machine Learning Engineer to oversee safety evaluations of Apple's generative AI features in international markets. Your work will directly shape how we assess and improve the safety and cultural alignment of large language and multimodal models across languages and regions. In this role, you will lead the definition, design, and execution of evaluation strategies in global markets for Apple's next-generation generative AI models-spanning text, vision, and multimodal applications. As part of the Responsible AI group within Apple's Human-Centered Machine Intelligence (HCMI) organization, you'll partner closely with cross-functional teams to identify risks, measure impact, and drive mitigations tailored to regions around the globe. You'll also play a critical role in long-term research initiatives centered on fairness, robustness, explainability, and safety-ensuring that Apple's AI systems meet the highest standards across all the regions we serve.

Want more jobs like this?

Get Data and Analytics jobs in Cupertino, CA delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


Description

Apple Intelligence is driven by intentional data design-spanning careful sampling, creation, and curation of high-quality datasets, enriched with precise annotations. Our data powers our ability to evaluate and mitigate safety risks in new generative AI features. This role sits at the intersection of applied data science, empirical analysis, cultural and linguistic expertise, and stakeholder communication. It requires strong scientific judgment, cross-functional collaboration, and the ability to translate evaluation findings into actionable insights. - Develop metrics for evaluation of safety and fairness risks inherent to generative models and Gen-AI features - Design datasets, identify data needs, and work on creative solutions, scaling and expanding data coverage through human and synthetic generation methods - Collaborate with cross-functional partners-including engineering, product, and research teams-to ensure evaluations align with feature goals and deployment plans - Partner with policy teams to translate regional safety and inclusivity requirements into measurable evaluation criteria - Build expertise in machine translation and data synthesis techniques to generate localized and culturally aligned evaluation datasets at scale - Develop ML-based enhancements to red teaming, model evaluation, and other processes to improve the quality of Apple Intelligence's user-facing products - Work with highly-sensitive content with exposure to offensive and controversial content

Minimum Qualifications

  • MS or PhD in Computer Science, Linguistics, Cognitive Science, HCI, Psychology, Mathematics, Physics, or a similar science or technology field with a strong basis in scientific data collection and analysis + at least 4 years of relevant work experience, or BA/BS with 8+ years of relevant work experience
  • Experience collecting and analyzing language data, image data, and/or multi-modal data
  • Strong experience designing human annotation projects, writing guidelines, and dealing with highly multi-labeled, nuanced, and often conflicting data
  • Proficiency in data science, machine learning, analytics, and programming with Python & Pandas; strong experience with one or more plotting & visualization libraries
  • Excellent interpersonal skills, with a proven ability to synthesize complex findings and present evaluation outcomes to senior leadership and executives
  • Strong skills for rigorous model quality metrics development; interpretation of experiments and evaluations; and presentation to executives

Preferred Qualifications

  • Deep cultural awareness and understanding of regional norms, values, and sensitivities, with the ability to translate this knowledge into actionable evaluation strategies
  • Experience in localization, internationalization, or building/evaluating machine learning systems for global markets, with a focus on linguistic and cultural adaptation
  • Curiosity about fairness and bias in generative AI systems, and a strong desire to help make the technology more equitable

Pay & Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $175,800 and $312,200, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Submit Resume

Client-provided location(s): Cupertino, CA, USA
Job ID: apple-200605752
Employment Type: Other

Company Videos

Hear directly from employees about what it is like to work at Apple.