Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
EPAM Systems

Senior/Lead AI DevOps/SRE

Nurota, Uzbekistan

We are currently seeking an experienced Senior/Lead AI DevOps/SRE to join our team. In this pivotal role, you will collaborate closely with data scientists and software developers to ensure seamless integration and optimize the operational efficiency of our AI deployments. Your expertise will be pivotal in deploying, maintaining, and scaling our cutting-edge AI solutions, encompassing LLMs and RAG systems.

As a key team member, you will spearhead both traditional DevOps responsibilities and innovative approaches to MLOps. Your proactive involvement will be essential in driving the success of our AI initiatives and maximizing their impact across the organization.

#wca-senior-lead-ai-devops
#Big-Data-6-UZ

Want more jobs like this?

Get Data and Analytics jobs in Nurota, Uzbekistan delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

#May-Referral-Digest-UZ
#AI-Integration-vacancies-UZ

What You'll Do
  • Implement and maintain CI/CD pipelines for AI and machine learning projects, ensuring robust deployment strategies and continuous integration
  • Monitor and ensure the reliability, availability, and performance of AI applications, particularly those involving LLMs and RAG
  • Collaborate with AI research teams to operationalize machine learning models and systems efficiently
  • Develop and enforce best practices for version control, configuration management, and testing of AI-driven software solutions
  • Utilize MLOps tools such as Kubeflow, MLflow, or TensorFlow Extended (TFX) to streamline the machine learning lifecycle from experimentation to production
  • Implement monitoring solutions that track both system metrics and model performance to facilitate proactive issue resolution
  • Participate in on-call rotations to support the operational health of critical systems, employing SRE principles to meet service-level objectives (SLOs) and reduce downtime
What You Have
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • Proven experience as a DevOps Engineer or SRE, with a strong background in software development and automation
  • Expertise in deployment and management of LLMs, including technologies like RAG
  • Proficient in CI/CD tools (Jenkins, GitLab CI, CircleCI) and infrastructure as code (Terraform, Ansible)
  • Solid knowledge of container orchestration technologies (Kubernetes, Docker)
  • Familiarity with MLOps tools and practices to support machine learning lifecycle management
Nice to have
  • Experience with cloud services (AWS, GCP, Azure), particularly in AI/ML deployments
  • Background in monitoring tools like Prometheus, Grafana, and ELK stack
  • Understanding of Python, particularly in data science and machine learning contexts
  • Certification in Kubernetes, AWS/GCP/Azure, or similar technologies
We Offer
  • We connect like-minded people::
    • Delivering innovative solutions to industry leaders, making a global impact
    • Enjoyable working environment, whether it is the vibrant office or the comfort of your home
    • Opportunity to work abroad for up to two months per year
    • Relocation opportunities within our offices in 50+ countries
    • Corporate and social events
  • We invest in your growth::
    • Leadership development, career advising, soft skills and well-being programs
    • Certifications, including GCP, Azure and AWS
    • Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly, Cloud Guru
    • Free English classes with certified teachers

    • Discounts in local language schools, including offline courses for the Uzbek language
  • We cover it all::
    • Monetary bonuses for engaging in the referral program
    • Comprehensive medical & family care package
    • Four trust days per year (sick leave without a medical certificate)
    • Discounts for fitness clubs, dance schools and sports programs
    • Benefits package (restaurants, beauty salons, hotels, a variety of stores and services)
About EPAM
  • EPAM Uzbekistan is a team of technologists and innovators united by technology. In 2019, we opened our first office in Tashkent. Since then, we've built a continuously learning organization that helps its employees reach their full potential and achieve professional goals through learning. Our agile methodologies, client collaboration frameworks, engineering excellence programs, and hybrid teams offer many career paths and development opportunities

Client-provided location(s): Uzbekistan
Job ID: EPAM-96061
Employment Type: Other