DESCRIPTION
We're seeking a Senior Data DevOps Engineer to join our team and play a crucial role in shaping the future of our CVML (Computer Vision and Machine Learning) platform.
If you're an experienced engineer with a passion for innovation, this could be the perfect opportunity for you.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Want more jobs like this?
Get Software Engineering jobs in Bahía Blanca, Argentina delivered to your inbox every week.
#REF_CL23_AR
Responsibilities
- As a Senior Data DevOps Engineer a EPAM, you'll be tasked with a range of responsibilities that are integral to our platform's success. Your day-to-day tasks will include a
- Crafting Infrastructure as Code: Develop Terraform and Terragrunt configurations to efficiently manage our infrastructure. Work on GitHub Actions workflows to streamline our development processes
- Data Access Permissions: Troubleshoot and resolve data access permission issues, particularly within AWS S3 and AWS IAM
- Kubeflow Mastery: Take ownership of our Kubeflow ML pipelines, resolving issues related to CPU, memory, GPU, and permissions
- Scripting for Automation: Utilize your programming skills in Python and Golang to develop scripts for various automation tasks, contributing to the efficiency and scalability of our platform
- Terraform and Terragrunt Proficiency: You should be highly skilled in using these tools for infrastructure management
- Kubernetes Expertise: Deep knowledge of Kubernetes, including experience with AWS EKS and KubeSpray
- AWS Mastery: A strong command of AWS services, from network management to LoadBalancer and IAM
- Istio Understanding: Familiarity with Istio, including sidecars, mTLS (mutual TLS), and the ingress gateway
- Monitoring Skills: Experience with monitoring tools like Prometheus and Grafana
- Programming Prowess: Strong Python programming skills
- GitHub Proficiency: Familiarity with GitHub and GitHub Actions
- Distributed Tracing: Familiarity with distributed tracing tools such as Zipkin and Istio
- Golang Proficiency: A background in Golang programming
- Kubeflow Knowledge: Experience with Kubeflow, a popular ML platform
- Pulumi Expertise: Familiarity with Pulumi for infrastructure as code
- Health Insurance
- Life Insurance (SVO)
- Occupational Risk Insurance (ART)
- Paid Time Off - Vacations. 14 calendar days a year, the number of days will increase by seniority based on local law rules
- Sick leave
- Exceptional Leave. Take paid time off for your major life changes (childbirth, marriage, etc.)
- Compensation of costs for internet, electricity, and personal laptop usage (if applicable)
- Stable full-time workload
- Thousands of projects for top brands
- Stable income
- Referral Program
- Certification opportunities
- Unlimited access to LinkedIn learning solutions
- Language courses
- Relocation Assistance Package
- By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM's Privacy Notice and Policy