Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
Duolingo

Senior Site Reliability Engineer

Seattle, WA

Duolingo is the most popular language learning application in the world, with over 300 million users. We are passionate about education, fact-based decision making, and elegant solutions to cross-functional problems. If that sounds like you, then come join us as we build the next-generation learning company!

As a Software Engineer, Distributed Systems, you will work closely with cross-functional engineering teams to ensure Duolingo’s complex distributed systems and products are built and maintained with world-class quality, and operated in measurable and scalable ways.

You will...


- Collaborate with internal teams to identify sources of instability in distributed systems and drive operational excellence

Want more jobs like this?

Get jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.

- Own core infrastructure (i.e manage, diagnose, and debug large-scale distributed systems in production)

- Provide system design consulting, develop software platforms/frameworks, and conduct launch reviews and root cause analysis

- Maintain and document sustainable postmortem/incident response practices

- Understand and resolve potential threats to performance or security

- Monitor and measure latency, availability and overall system health, once live

- Advocate for and implement changes that improve reliability, scalability, and velocity

- Monitor and stress test systems to collect metrics for tuning and capacity planning

- Reduce the burden of toil with iterative development of tooling and automation

- Collaborate with engineering teams to release new features and become an authority on our services

- Participate in on-call rotation


You have...


- Bachelor’s Degree in Computer Science

- 3+ years of experience within site reliability engineering/devops of a product with millions of users

- Experience analyzing and troubleshooting large-scale distributed systems

- Proven knowledge of C, C++, Java, Kotlin, Python or Go

- Fluency in networking protocols, such as TCP/IP, HTTP, SSL, DNS, etc

- An understanding of containerization toolsets and container orchestration technologies (Docker, Mesos, Kubernetes, Nomad, etc)

- Effective communication skills and understanding of best practices around tools/methodologies for Infrastructure, Automation, Capacity Planning, etc.

- Ability to be on-call for critical incident responses

________________________________________________________________________________________________________

We aim to return to office, and as such are requiring all employees to be fully vaccinated against COVID-19 and have received any booster doses as recommended by the Centers for Disease Control and Prevention.

Take a peek at how we care for our employees' holistic well-being with our benefits here.

We will do everything we can within reason to make sure that your interview takes place in an environment that fairly and accurately assesses your skills. If you need assistance or accommodation, please contact your recruiter.

 

Job ID: 5027449002
Employment Type: Other

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Dental Insurance
    • Vision Insurance
    • FSA
    • Short-Term Disability
    • On-Site Gym
  • Work Flexibility

    • Flexible Work Hours
  • Office Life and Perks

    • Commuter Benefits Program
    • Casual Dress
    • Pet-friendly Office
    • Happy Hours
    • Snacks
    • Some Meals Provided
    • Company Outings
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Personal/Sick Days
  • Financial and Retirement

    • 401(K) With Company Matching
    • Company Equity
    • Performance Bonus
    • Relocation Assistance
  • Professional Development

    • Learning and Development Stipend
    • Promote From Within
    • Mentor Program
    • Access to Online Courses
  • Diversity and Inclusion

    • Diversity, Equity, and Inclusion Program

This job is no longer available.

Search all jobs