Data Engineer, Analytics (Instagram Well Being)
- Menlo Park, CA
Facebook's mission is to give people the power to build community and bring the world closer together. Through our family of apps and services, we're building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to empower people around the world to build community and connect in meaningful ways. Together, we can help people build stronger communities - we're just getting started.
Our more experienced Data Engineers are clearly characterized by in-depth technical experience and proven progression in leadership responsibility. If you have an interest in being responsible for the dynamics of a fast-paced environment, this is the right role for you. You will be working on many projects at a time, but also focused on the details while finding creative ways to pursue big picture challenges. You will leverage not just technical skills, but strong emphasis on program management, technical leadership, and communication. In this role, you will be responsible for the data for Instagram Trust & Privacy within the Well Being team. You will work closely with the product team including counterparts in data science, engineering, product, and others to support delivering comprehensive, accurate, and data artifacts specifically related to Privacy & product security. You will be responsible for thinking holistically about product & data privacy on Instagram and implementing a roadmap defining the data architecture, ownership model, pipelines and visualization to drive understanding.
- Define and own the data engineering roadmap for Instagram Trust Pillar
- Build product-focused datasets and scalable, fault-tolerant pipelines
- Build data anomaly detection, data quality checks, and optimize pipelines for ideal compute and storage
- Collaborate with Software Engineers and Data Scientists to design technical specification for logging and add logging to production code to generate metrics both online as well as offline
- Work with different cross functional partners partners - Data Scientists, Infra Engineering, Logging Framework Infra Teams, Product Managers, Privacy
- Build visualizations to provide insights into the data & metrics generated
- Work with data infrastructure teams to suggest improvements and influence their roadmap
- Able to immerse yourself in all aspects of the product, understand the problems, and tie them back to data engineering solutions
- Recommend improvements and modifications to existing data and ETL pipelines
- Communicate and influence strategies and processes around data modeling and architecture to multi-functional groups and leadership
- Drive internal process improvements and automating manual processes for data quality and SLA management
- Provide ongoing proactive communication and collaboration throughout the organization
- Actively mentor team members in their careers
- 4+ years' experience in the data warehouse space
- 4+ years' experience working with either a MapReduce or an MPP system
- 7+ years' experience in writing complex SQL and ETL processes
- 4+ years' experience with object-oriented programming languages
- 7+ years' experience with schema design and dimensional data modeling
- BS/BA in Technical Field, Computer Science or Mathematics
- Knowledge in Python or Java
- Experience analyzing data to identify deliverables, gaps, and inconsistencies
- Experience effectively collaborating and communicating complex technical concepts to a broad variety of audiences the data architecture, pipelines, visualization and anomaly detection to drive understanding
Back to top