DevOps Engineer, Monitoring & Observability Team

DevOps Engineer, Monitoring & Observability Team

Outbrain’s recommendation system is powered by a large scale infrastructure, based both on Cloud Infrastructure as well as bare metal infrastructure managed in our own Data Centers.
Outbrain servers fleet consists of over 6000 physical servers, and produces over 300 Billion personalised recommendations on a monthly basis, reaching over 800M unique users every month.

Outbrain’s service is built from cutting edge technologies and with a microservice design, working in a highly scalable and full self-serve CI/CD environment. In order to support that, it is crucial we will have a cutting edge and easy to use observability (monitoring, visibility & alerting) infrastructure.
Our current environment is based on Prometheus, ELK Stack, Grafana and other self-developed tools, which digests 65 billion metric samples and over 300 billion events/logs per day.

We’re looking for an experienced DevOps Engineer to join our Monitoring & Observability team and take part in designing, building, implementing and maintaining our super scalable & automated Observability infrastructure.

If you are:

  • An experienced engineer who wants to design, develop and implement modern monitoring & observability tools to provide a reliable and holistic view of Outbrain services.
  • Passionate about helping & supporting engineering teams in order to make their life easier by providing new initiatives and top notch solutions for their wishes on all observability aspects.
  • Feel really excited about highly scalable systems and love troubleshooting and tune these super complex distributed systems while automating about everything we can.
  • A Data-driven person and like all about dashboards, logging & tracing in cloud-native environments.

You probably will feel at home as part of monitoring & observability team @ Outbrain.
What are we looking for?
  • Experience in writing software in one or more languages, such as Python, Java, Go, Ruby or similar.
  • Strong understanding and knowledge of Linux.
  • Experience working with deployment and orchestration technologies (such as Docker, Kubernetes, Chef, Terraform and Jenkins).
  • Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
  • Experience with Prometheus and/or ELK Stack - an advantage
  • Familiar with Public cloud such as GCP, AWS - an advantage

Back to top