Lead Site Reliability Engineer - 10929
Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.
Why join Coupa?
• Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.
• Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.
• Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other.
Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa.
The Impact of a Lead Site Reliability Engineer at Coupa:
If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At Coupa, the "Cloud team" is looking for a Lead engineer who is ready to constantly question the status quo with a mixture of system design, code development, deployment, automation, networking, and experience in managing Machine Learning/GenAI / Agentic AI platforms.
What You'll Do:
- Build, deploy, and troubleshoot microservices in Kubernetes and Amazon EKS, ensuring scalability and reliability.
- Design secure, highly available web applications with a focus on capacity planning and performance optimization.
- Deploy and manage the lifecycle of LLMs and embedding models, defining KPIs to measure and improve AI application performance.
- Evaluate and integrate emerging technologies such as RAG systems, MCP servers, AI Agents, and agentic workflows into our platform.
- Manage AWS core and GenAI services (S3, IAM, EKS, Bedrock, etc.) using infrastructure-as-code tools like Terraform and Chef, while maintaining observability through tools like New Relic or PagerDuty.
- Collaborate across product, platform, and engineering teams on architecture design, security patching, incident response, and release management to ensure the reliability of our ML and GenAI infrastructure
Want more jobs like this?
Get jobs in Bogota, Colombia delivered to your inbox every week.

What You Will Bring to Coupa:
- Bachelor's degree and 10+ years of experience managing large-scale cloud applications with a strong background in Linux administration and troubleshooting. Excellent communication skills, a collaborative mindset, and the confidence to take ownership, drive solutions, and deliver results independently while thinking globally.
- Over 8 years of hands-on experience managing cloud infrastructure across AWS, GCP, and Azure environments.
- A solid understanding of today's generative AI ecosystem, with practical experience using LLMs and embedding models (OpenAI, AWS Bedrock, SageMaker); familiarity with vector databases like LanceDB is a plus.
- Strong scripting skills in Bash or Python, and experience with container orchestration platforms like Amazon EKS or Azure AKS.
- Proficiency with DevOps and automation tools such as Chef, GitHub Actions, Rundeck, and IaC frameworks like Terraform, Spacelift, and Helm.
- Working knowledge of DNS, load balancers, and MySQL, along with a good grasp of source control and branching strategies in Git.
Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees.
Please be advised that inquiries or resumes from recruiters will not be accepted.
By submitting your application, you acknowledge that you have read Coupa's Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.
Perks and Benefits
Health and Wellness
- Health Insurance
- Vision Insurance
- Life Insurance
- Dental Insurance
- FSA With Employer Contribution
- HSA With Employer Contribution
- FSA
- HSA
- Mental Health Benefits
- Virtual Fitness Classes
- Short-Term Disability
- Health Reimbursement Account
Parental Benefits
- Non-Birth Parent or Paternity Leave
- Birth Parent or Maternity Leave
- Adoption Leave
- Fertility Benefits
- Family Support Resources
Work Flexibility
- Remote Work Opportunities
- Hybrid Work Opportunities
Office Life and Perks
- Casual Dress
- Snacks
- Some Meals Provided
- Happy Hours
- Company Outings
- Holiday Events
Vacation and Time Off
- Personal/Sick Days
- Volunteer Time Off
- Paid Vacation
- Paid Holidays
- Unlimited Paid Time Off
Financial and Retirement
- 401(K) With Company Matching
- Pension
- Performance Bonus
Professional Development
- Leadership Training Program
- Mentor Program
- Access to Online Courses
- Lunch and Learns
- Promote From Within
Diversity and Inclusion
- Employee Resource Groups (ERG)
- Unconscious Bias Training
- Diversity, Equity, and Inclusion Program
Company Videos
Hear directly from employees about what it is like to work at Coupa.