About The Position
We're looking for an experienced, highly motivated SRE engineer, to utilize SRE methodologies and technologies in order to implement highly scalable and available production environments.
As a SRE engineer at Gloat, you will be part of a growing DevOps team. You will have the freedom to explore and implement the newest technologies. You will be responsible for implementing monitoring and alerting infrastructure and to define the right measurements for highly available production environment. You will learn new things every minute of every day and constantly be challenged. There will not be a single boring moment of work but the opposite; exciting, motivating and stimulating. The DevOps team has the honour and responsibility to support some of the biggest enterprise clients in the world.
Want more jobs like this?
Get Software Engineering jobs in Dimona, Israel delivered to your inbox every week.
Responsibilities
- Design and implement reliable, highly available and scalable production infrastructure.
- Seek for new technologies, from POC through implementation.
- Ensure high uptime and reliability of the production environment.
- Perform root cause analysis for complex failures and offer modern solutions and tools.
- Analyze performance and stability issues.
- Work closely with DevOps, R&D, product, and support to define cross organizational processes.
- Design, develop, and drive troubleshooting & mitigation tools as part of driving self-healing agenda.
- Educate engineers on how to approach and debug production issues across services and levels of the stack.
Must:
- 3 + years SRE experience
- Solid knowledge of Kubernetes
- Proven Monitoring and alerting experience (ELK, Grafana, Prometheus, etc.)
- Experience implementing services in one of the big clouds (AWS, Azure, GCP, etc.)
- Experience with a complete programming language (Python, Java, Go, Ruby, etc.)
- Scripting and automation skills (Bash, Python, etc.) .
- Strong background knowledge in Linux Administration.
- Strong networking skills
- Experience iac tools such as Ansible, Terraform, etc.
- Experience in multi cloud environments
- Strong Security skills
- Microservice architecture implementation experience
- Experience with SaaS production infrastructure