Site Reliability Engineer (Kubernetes and Golang)
3+ months ago• Sofia, Bulgaria 
As a Site Reliability Engineer you'll play a pivotal role in maintaining and enhancing our critical developer-facing tools at one of the biggest companies in the world. We're seeking a candidate with expertise in Kubernetes, Go, and operations/observability technologies.
#LI-DNI#LI-VC5 
 
Responsibilities
- Develop, monitor, and maintain observability tooling on Kubernetes (e.g., Prometheus, Jaeger, Grafana/Plutono)
- Develop (Golang) and collaborate closely with other development team, including onsite engagements
- Provide occasional third-level support for internal tools
- Utilize and create Grafana/Plutono/Prometheus dashboards and queries
- Administer and leverage log aggregation tooling
- Operate and monitor Kubernetes workloads, adhering to best practices
- Implement and manage end-user monitoring tools
- Update workflows on GitHub Actions
- Use and update Terraform modules
- Enhance operational efficiency and productivity for 50k engineers
- 3 years of experience in a similar role and knowledge of Kubernetes and Golang
- Hands-on experience with operations and observability tooling
- Knowledge of creating and managing dashboards and queries in Grafana/Plutono/Prometheus
- Experience with log aggregation tools like Splunk, Open Telemetry, fluentbit, and ELK Stack
- Proficiency in administering and operating Kubernetes workloads
- Experience with end-user monitoring tools (e.g., Dynatrace RUM)
- Familiarity with Sentry (sentry.io) for error management
- Expertise in developing Helm charts and Helm chart libraries
- Experience updating workflows on GitHub Actions
- Experience using and updating Terraform modules
- Very good proficiency in English (written and spoken)
- Willingness to work in a hybrid setup (home office and office in Sofia)
- Opportunity to Engineer your Future and to drive the world's digital transformation with top industry clients
- Personal development program that will allow you to be valued for your strengths
- Wide range of professional trainings and workshops
- Being part of a collaborative, fast-growing, and innovative design team
- Established and accelerated growth toward different career paths, competencies, and roles
- Broad projects variety and possible mobility between projects over the time
- Collaboration in a multicultural environment and exchange of best practices with colleagues around the world
- Varied social benefits, Sports, Transportation and Health programs
- Work-life balance and flexible schedule, team buildings and sport opportunities
- Modern office/collaboration spaces (incl. new Infinity Tower business center, Sofia)
- Hybrid By Design - we provide you with the best productivity options from the 2 worlds. Meet, socialize and enjoy F2F time with your colleagues, while working from the modern EPAM's office for a few days per week and benefit from the EPAM's virtual working environment - making you able to be productive and work from remote for the rest of the week
Want more jobs like this?
Get Software Engineering jobs in Sofia, Bulgaria delivered to your inbox every week.

Client-provided location(s): Sofia, Bulgaria
Job ID: EPAM-epamgdo_blt2888d2b6308e5b58_en-us_Sofia_Bulgaria
Employment Type: OTHER
Posted: 2024-10-30T00:44:23
Perks and Benefits
- Health and Wellness
- Parental Benefits
- Work Flexibility
- Office Life and Perks
- Vacation and Time Off
- Financial and Retirement
- Professional Development
- Diversity and Inclusion