AIML Engineer - Human Perception
The Video Computer Vision organization is working on breakthrough technologies for future Apple products. Our team delivers cutting-edge AI, machine learning, computer vision and graphics algorithms that power technologies including human understanding, perception, digital humans, AI agents, and health applications. In this role, you will collaborate with world-class experts in AI, ML, Software, and Hardware to tackle fundamental challenges in human-centric solutions that will impact millions of users across Apple's ecosystem.
Description
We are looking for an AIML Engineer with a strong background in developing foundation models for generative AI and multimodal systems that integrate various types of real-time sensor data such as video and audio with other modalities like text. You will not only work on cutting-edge projects to advance our AI capabilities, but also contribute to practical features in Apple products and bring impact to millions of users. You will collaborate with others to drive data requirements, validation strategies, and key performance indicators, and conduct algorithm research and development that serves product needs. A successful candidate will stay up-to-date with the latest advancements in AI, machine learning, and computer vision, applying this knowledge to drive innovation, but also take a practical approach to problem solving and software engineering to deliver clean, modular, testable code.
Preferred Qualifications
MS or PhD in computer vision, computer graphics, machine learning, computer science, computer engineering or related fields.
Want more jobs like this?
Get jobs in Sunnyvale, CA delivered to your inbox every week.

Experience in developing, training/tuning foundation models and multimodal LLMs.
Experience with training and troubleshooting generative architectures such as diffusion, reinforcement learning, flow matching or normalizing flow at scale.
Experience applying reinforcement learning to help train foundation models a plus.
Excellent communication and experience working with multi-functional teams.
Self-motivated with proven track record to optimally prioritize and deliver tasks on schedule.
Minimum Qualifications
Experience building models for multimodal perception system.
Experience working with LLMs and VLMs.
Software engineering skills and proficiency in Python and PyTorch.
Curiosity and willingness to learn new things in order to improve the quality of their solutions.
BS and a minimum of 3 years relevant industry experience.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
Perks and Benefits
Health and Wellness
Parental Benefits
Work Flexibility
Office Life and Perks
Vacation and Time Off
Financial and Retirement
Professional Development
Diversity and Inclusion
Company Videos
Hear directly from employees about what it is like to work at Apple.