Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

AIML Engineer - Human Perception

Today Sunnyvale, CA

The Video Computer Vision organization is working on breakthrough technologies for future Apple products. Our team delivers cutting-edge AI, machine learning, computer vision and graphics algorithms that power technologies including human understanding, perception, digital humans, AI agents, and health applications. In this role, you will collaborate with world-class experts in AI, ML, Software, and Hardware to tackle fundamental challenges in human-centric solutions that will impact millions of users across Apple's ecosystem.

Description

We are looking for an AIML Engineer with a strong background in developing foundation models for generative AI and multimodal systems that integrate various types of real-time sensor data such as video and audio with other modalities like text. You will not only work on cutting-edge projects to advance our AI capabilities, but also contribute to practical features in Apple products and bring impact to millions of users. You will collaborate with others to drive data requirements, validation strategies, and key performance indicators, and conduct algorithm research and development that serves product needs. A successful candidate will stay up-to-date with the latest advancements in AI, machine learning, and computer vision, applying this knowledge to drive innovation, but also take a practical approach to problem solving and software engineering to deliver clean, modular, testable code.

Preferred Qualifications

MS or PhD in computer vision, computer graphics, machine learning, computer science, computer engineering or related fields.

Want more jobs like this?

Get jobs in Sunnyvale, CA delivered to your inbox every week.

Job alert subscription


Experience in developing, training/tuning foundation models and multimodal LLMs.

Experience with training and troubleshooting generative architectures such as diffusion, reinforcement learning, flow matching or normalizing flow at scale.

Experience applying reinforcement learning to help train foundation models a plus.

Excellent communication and experience working with multi-functional teams.

Self-motivated with proven track record to optimally prioritize and deliver tasks on schedule.

Minimum Qualifications

Experience building models for multimodal perception system.

Experience working with LLMs and VLMs.

Software engineering skills and proficiency in Python and PyTorch.

Curiosity and willingness to learn new things in order to improve the quality of their solutions.

BS and a minimum of 3 years relevant industry experience.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Client-provided location(s): Sunnyvale, CA
Job ID: apple-200615499-3956_rxr-658
Employment Type: OTHER
Posted: 2025-11-10T19:05:25

Perks and Benefits

  • Health and Wellness

    • Parental Benefits

      • Work Flexibility

        • Office Life and Perks

          • Vacation and Time Off

            • Financial and Retirement

              • Professional Development

                • Diversity and Inclusion

                  Company Videos

                  Hear directly from employees about what it is like to work at Apple.