Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
Yelp

Software Engineer - Distributed Systems

Do you fret over consistency versus availability guarantees? Do you love making complex systems easy to use, resilient and scalable? That’s what we do! You might have heard of one of our open source projects like data-pipeline or Tron. Or maybe you’ve read our blog post on how we monitor Cassandra at scale or watched our talk at MesosCon about our distributed task runner.

The Distributed Systems group at Yelp designs, builds and operates core infrastructure and critical services that all of Yelp’s feature teams rely on. We do everything from scaling our NoSQL datastore platform to store petabytes of data, building systems to transport this data reliably in real-time streams, transforming these streams to build materialized views, and providing the computational infrastructure to run the batches and services that make up the website.

Want more jobs like this?

Get Software Engineer jobs delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

We also focus on building interfaces to these systems, making them easy to use whilst keeping them highly available and performant. Our systems power many critical applications including ads, search, transactions and revenue, spam detection, and machine learning, among others.

If this sounds like you, join the team that is the glue that connects Yelp’s feature teams to their underlying software infrastructure - in the data centers, up in the cloud, or anywhere in between.

What You Will Do:

  • Design, build and deploy software systems that run 24/7 at scale.
  • Develop custom tools and automate processes to find answers to challenging questions and reveal the unknown.
  • Write code, document, participate in code reviews, and mentor other engineers.

We Are Looking For:

  • Strong knowledge of systems and application design, with an understanding of operational and reliability trade-offs.
  • Solid foundation in data structures, algorithms and complexity analysis.
  • Ability to work closely with information retrieval/machine learning experts on big-data problems.
  • A hunger for tracking down root causes -- no matter how deep it takes you -- and fixing them in systematic ways.
  • Deep understanding of operating systems, file systems, distributed queues and networking.
  • Deep understanding of web technology.
  • Fluency in Python, C++, Java, Scala, Go, or a similar language.
  • Ability to collaborate with and influence people with different roles at all levels of the organization.
  • Excellent written and interpersonal communication skills.
  • If you don't have at least one year of experience in a similar role, please take a look at our College Engineering roles instead!

Pluses:

  • Experience building large-scale distributed systems.
  • Experience with Mesos, Lucene, ElasticSearch, Kafka, Spark, Flink, ReSTful web services.
  • Experience at a large-scale consumer internet site.
  • Experience with configuration management tools (Puppet, Chef, Ansible, Salt, CFEngine), monitoring (Nagios, Sensu, Monit), metrics (Graphite, statsd).
  • Experience with AWS.
  • Experience with web frameworks.
  • Interest in IR/ML techniques, and supporting systems implementing these techniques.
  • Experience working with and contributing to open source projects.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Job ID: 53198a94-0808-491b-83cd-18c6eb6dc27a
Employment Type: Other

This job is no longer available.

Search all jobs