Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
Pandora

Engineer, Data

New York, NY

Engineer, Data - (employer: Sirius XM Radio Inc.; job location: New York, NY) - Build and deploy streaming and batch data pipelines capable of processing and storing petabytes of data quickly and reliably.  Collaborate with product teams, data analysts, and data scientists.  Gather and process raw, structured, semi-structured, and unstructured data.  Integrate with a variety of data providers (marketing, web analytics, and consumer devices including IoT and Telematics).  Build and maintain dimensional data warehouses to support business intelligence tools. Develop data catalogs and data validations. Design, code, test, correct and document programs and scripts using agreed upon standards and tools. Derive an overall strategy of data management. Plan effective data storage, security, sharing and publishing within the organization. Ensure data quality, and implement tools and frameworks for automating the identification of data quality issues. Collaborate with internal and external data providers on data validation, providing feedback and making customized changes to data feeds and data mappings. Provide ongoing support, monitoring, and maintenance of deployed products. Requirements: Master’s Degree in Computer Science or Engineering (Computer/Electronics/Mechanical) plus three years of experience in the offered position or in a data engineering position or Bachelor’s Degree in Computer Science or Engineering (Computer/Electronics/Mechanical) plus five years of post-Bachelor’s progressive experience in the offered position or in a data engineering position. All required experience must have included deploying and running AWS-based data solutions using tools like Cloud Formation, IAM, Athena and Kinesis; engineering big data solutions using technologies like EMR, S3, and Spark, with data partitioning and sharding techniques; loading and querying on premise and cloud-hosted databases like Teradata and Redshift; building streaming data pipelines using Kafka, Spark, or Flint, with binary data serialization formats such as Parquet, Avro and Thrift; deploying data notebook and analytic environments like Jupyter and Databricks; building and deploying ML pipelines including training models, feature development and regression testing; performing data modeling and data profiling; building graph-based data workflows using Apache Airflow; writing distributed, high-volume services in Python, Java or Scala; conducting metadata management, data lineage, and employing principles of data governance; and working with agile software processes, data-driven development, data storage techniques, high volume heterogeneous data (distributed systems), and Python data ecosystem using pandas and numpy. Apply online at www.siriusxm.com/careers.  

Want more jobs like this?

Get jobs in New York, NY delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.
Job ID: ostkifwh
Employment Type: Other

This job is no longer available.

Search all jobs