Sensei Platform Data Engineer
- San Jose, CA
Machine Learning is critical part of Adobe's Cloud offering. Adobe Clouds enable customers to create and manage digital content. In Creative Cloud, creative professionals and novice users alike need to manage the lifecycle of their digital assets, libraries, and documents, from brushes to colors, images, photos, videos, 3D assets and beyond. In Experience Cloud, it is all about optimizing the digital experience and digital transformations for enterprises. Adobe Cloud also includes the Adobe Stock image marketplace and the Behance community that leverage deep machine learning to enable content quality, search, discovery, organization, contributor moderation, and more.
We are building a new machine learning platform, called Adobe Sensei, that powers machine learning and AI across our Adobe Cloud product lines. This platform will span thousands of applied researchers, millions of users, and billions of digital assets. Become part of this growing team at Adobe and make a phenomenal impact in the area of computer vision, user understanding, language understanding, and digital experience optimization. The objective is to make machine learning offerings a world-class, leading-edge, differentiating technology in the Adobe Cloud ecosystem.
How can you participate? We're looking for a Data Engineer, who is passionate about large dataset management for ML applications. The ideal candidate will be a hands-on person who has strong technical and communication skills and will provide innovative technical solutions promptly. Someone with a proven record of building large-scale infrastructure needed to source, validate, clean, and process the data. This is an opportunity to make a huge impact in a fast-paced, startup-like environment in great company. Join us!
Define and develop processes for dataset lifecycle management, modeling, and production.
Architect, build and maintain scalable automated data pipelines from the ground up. Be an expert in combining and calibrating data across various data sources.
Work with Adobe's data ingestion, data platform, and product teams to understand and validate instrumentation and data flow.
Integrate new data management technologies and software engineering tools into existing structures.
Collaborate with architects, product management, and engineering teams to define and establish product requirements.
Explore and research new and emerging ML technologies and bring them to the Adobe Sensei platform.
Write and review technical documents, including requirements and design documents for existing and continuously evolving features of the Adobe Sensei platform.
What you need to succeed
MS/PhD in Computer Science or related field
5+ years of industry experience in building and maintaining big data pipelines and/or building and maintaining analytical or reporting systems at scale.
Expertise with data pipeline and workflow management tools: Airflow, Azkaban, Luigi, etc.,
Experience working with Apache Hadoop and related technology like Pig, Hive, Oozie etc.,
Hands on experience in Python, Java and/or C++.
Experience with Docker, Containerization, AWS.
Experience in micro-services, and REST APIs.
Experience in RDBMS and NoSQL databases.
Proficient interpersonal and communication skills.
At Adobe, you will be immersed in an exceptional work environment that is recognized throughout the world on Best Companies lists. You will also be surrounded by colleagues who are committed to helping each other grow through our unique Check-In approach where ongoing feedback flows freely.
If you're looking to make an impact, Adobe's the place for you. Discover what our employees are saying about their career experiences on the Adobe Life blog and explore the meaningful benefits we offer.
Adobe is an equal opportunity employer. We welcome and encourage diversity in the workplace regardless of race, gender, religion, age, sexual orientation, gender identity, disability or veteran status.
Back to top