Data Reliability Engineer

Data Reliability Engineers at Twitch are software engineers responsible for the performance and reliability of data stores that support Twitch’s services. The Data Reliability and Performance team designs and builds tools that automate using data stores when building your own service or microservice, and it provides guidance and best practices for using those tools and databases in general.  As a Data Reliability Engineer you will have impact shaping Twitch’s engineering culture around data availability and performance.

Responsibilities

  • Work closely with engineering teams to build tools, applications, and services that improve our ability to ship reliable and high performance services quickly.
  • Bring ideas to life to help make the lives of engineers better.
  • Measure everything, predict outcomes, and proactively work to mitigate failures and subpar performance.
  • Understand complex systems, and identify bottlenecks and performance issues in existing architecture.
  • Advocate and implement reliable design patterns (high availability, graceful degradation, correct alerting, SPoF mitigation).

Relevant skills and experience

  • 5+ experience writing maintainable, production-ready code in a modern language, and enthusiasm for learning the best tool for the job. We typically use Go or Python.
  • You know how engineering works in the real world and have experience shipping and maintaining what you ship.
  • Hands on experience using or scaling RDBMS data stores. You understand the pros and cons of using ACID databases and other data stores.
  • Experience working in a service-oriented or microservices architecture.
  • Experience building large, fault-tolerant, scalable systems.
  • Knowledge of AWS services.
  • We write and text chat a lot. Good writing skills are important, as is the ability to line up the impact of your communication with your intention.
  • You work with integrity, earning the respect of both engineers and non-technical contributors.
  • A sense of humor always helps.
  • We abide by the “no asshole” rule.

Bonus Points

  • Open source contributions to large data scaling projects such as Kafka, Cassandra, PostgreSQL.
  • Fundamentals of distributed programming (CAP, Ordering, Consensus).
  • Experience with TDD and Agile methodology.

Twitch is the world’s leading video platform and community for gamers, with more than 100 million visitors per month. We connect gamers from around the world by allowing them to broadcast, watch, and chat with each other. Twitch’s live and on-demand video platform forms the backbone of a distribution network for video game broadcasters including pro players, tournaments, leagues, developers and gaming media organizations. Twitch is leading a revolution in gaming culture, turning gameplay into an immersive video experience. Learn more at:http://twitch.tv

We are an equal opportunity employer and value diversity at Twitch. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

#LI-MJ1

Meet Some of Twitch's Employees

Daniel H.

Data Scientist

Daniel discovers techniques that influence every aspect of product planning and market prediction, from consumer need to company cost and ultimate value, all through data science research.

Jenny Q.

Director Of Business Operations

Jenny and her team use data-driven insights to tackle the toughest business problems at Twitch to help improve company performance.


Back to top