Distributed Systems Engineer - Data

    • San Francisco, CA

About the department

Cloudflare’s Engineering Team builds and runs the software that handles incredible amounts of requests on the Internet today. We also build and run the internal tools that builds and runs our software. The Engineering Team is split into two groups: one handles product development and the other handles operations. Product development covers both new features and functionality and scaling our existing software to meet the challenges of a massively growing customer base. The operations team handles one of the world’s largest networks with data centers in 102 cities worldwide.

What you'll do

You will be responsible for designing, building, and scaling one of the biggest global data pipelines to overcome network delays and partitions. The pipeline uses Go, Kafka, ClickHouse, Flink and PostgreSQL to store and analyze in excess of 10 million events per second (and growing fast!).

Examples of desirable skills, knowledge and experience

  • Experience designing, integrating, and optimizing distributed data processing pipelines
  • Experience writing high-quality data processing code in Go, Java, Python, or other high-performance languages
  • Experience building and administering alerting and monitoring systems around data processing pipelines
  • Strong systems-level debugging

Bonus Points

  • Experience building REST APIs for analytics services
  • Experience with cluster and configuration management systems such as Docker, Mesos, Marathon, Salt
  • Familiarity writing and optimizing advanced SQL queries
  • Good Linux/UNIX systems knowledge
  • Experience productionizing Machine Learning models

 


Back to top