Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Site Reliability Engineer II

AT IBM
IBM

Site Reliability Engineer II

Bangalore, India

Introduction

A career in IBM Software means you'll be part of a team that transforms our customer's challenges into solutions.

Seeking new possibilities and always staying curious, we are a team dedicated to creating the world's leading AI-powered, cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career.

IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.

Your role and responsibilities

As an Sr. Engineer II for the Scale & Performance team, you will play a critical role in ensuring the scalability, performance, and reliability of HashiCorp's cloud and enterprise offerings. Your work will be central to enhancing system resilience, optimizing performance at scale, and ensuring HashiCorp's products deliver high availability in dynamic cloud environments. Your experience in Performance engineering, or systems engineering, or reliability engineering or a related field, you will lead efforts to identify performance bottlenecks, address, and mitigate operational challenges before they impact our customers. Your expertise in load testing, performance analysis, and system hardening will ensure that our services meet the highest standards of scale and performance excellence.

Want more jobs like this?

Get Software Engineering jobs in Bangalore, India delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


You'll have the opportunity to dive deep into the architecture of HashiCorp's products, including both our cloud and enterprise offerings. You'll take ownership of building and maintaining an advanced automation framework that powers ephemeral, scalable environments, enabling controlled scaling efforts and performance regression testing.

Your work will directly impact how we validate and optimize performance across our systems. From spinning up environments to scaling them dynamically and tearing them down on demand, you'll own the end-to-end lifecycle of our test engines. Beyond that, you'll play an important role in analysing results, creating insightful dashboards, and delivering actionable reports to help teams identify and resolve performance bottlenecks and throttling issues.

What you'll do (responsibilities)

  • Implement best practices for system reliability, including proactive identification of potential failure points and the development of automated mitigations
  • Design and execute comprehensive performance testing strategies to identify performance bottlenecks and scalability limits across our cloud products
  • Work with the engineering teams to identify potential application and infrastructure bottlenecks and suggest changes.
  • Work closely with engineering and product teams to integrate scale and performance readiness into the development lifecycle, enhancing product stability and user satisfaction.
  • Build and refine tools and frameworks for automated testing, environment simulation, and incident reproduction, reducing manual effort and increasing test coverage.
  • Conduct in-depth analysis of testing results, documenting findings and making actionable recommendations for system enhancements.
  • Drive Systemic Improvements to the products by introducing Chaos Testing and partnering with product development teams.
  • Share your knowledge and expertise with team members, fostering a culture of learning and continuous improvement.
  • Develop and implement disaster recovery and backup strategies to ensure data integrity and system resilience.

Required education

Bachelor's Degree

Required technical and professional expertise

  • 8+ years of experience in performance engineering systems engineering, reliability engineering or non functional testing roles with a focus on performance testing, load testing or system scalability.
  • Strong programming skills in Python / Golang and exposure to scripting languages like javascript or shell script
  • Experience with version control systems such as Git.
  • Strong experience with performance testing tools like K6, Artillery, Vegeta, Locust etc or similar tools for deriving key performance metrics for a product
  • Proven track record of leading successful performance testing and optimization initiatives in cloud and on-prem environments.
  • Experience in creating and managing test environments for automated testing.
  • Experience in creating CI/CD pipelines and maintaining quality gates for system testing.
  • Understanding of monitoring and observability tools such as Datadog or Prometheus to develop dashboards indicating metrics that accurately reflect system performance and load break points and regressions.
  • Exposure to cloud technologies ( AWS, Azure, Or GCP) and container technologies like Nomad or Kubernetes and/Or working in a Hybrid cloud environment.
  • Effective communication and collaboration skills, capable of working with cross-functional teams and articulating technical concepts to diverse audiences.

Preferred technical and professional experience

  • You have experience using HashiCorp products (Terraform, Packer, Waypoint, Nomad, Vault, Boundary, Consul).
  • Experience with Javascript development / using any test framework based on Java script is a plus.
  • Experience in driving systemic improvements through Chaos engineering is a plus. #LI-Hybrid

ABOUT BUSINESS UNIT

IBM Software infuses core business operations with intelligence-from machine learning to generative AI-to help make organizations more responsive, productive, and resilient. IBM Software helps clients put AI into action now to create real value with trust, speed, and confidence across digital labor, IT automation, application modernization, security, and sustainability. Critical to this is the ability to make use of all data, because AI is only as good as the data that fuels it. In most organizations data is spread across multiple clouds, on premises, in private datacenters, and at the edge. IBM's AI and data platform scales and accelerates the impact of AI with trusted data, and provides leading capabilities to train, tune and deploy AI across business. IBM's hybrid cloud platform is one of the most comprehensive and consistent approach to development, security, and operations across hybrid environments-a flexible foundation for leveraging data, wherever it resides, to extend AI deep into a business.

YOUR LIFE @ IBM

In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.

Being an IBMer means you'll be able to learn and develop yourself and your career, you'll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.

Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.

Are you ready to be an IBMer?

ABOUT IBM

IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.

Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.

At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.

IBM is proud to be an equal-opportunity employer. All qualifiedapplicants will receive consideration for employment without regard to race,color, religion, sex, gender, gender identity or expression, sexualorientation, national origin, caste, genetics, pregnancy, disability,neurodivergence, age, veteran status, or other characteristics. IBM is alsocommitted to compliance with all fair employment practices regardingcitizenship and immigration status.

OTHER RELEVANT JOB DETAILS

When applying to jobs of your interest, we recommend that you do so for those that match your experience and expertise. Our recruiters advise that you apply to not more than 3 roles in a year for the best candidate experience. For additional information about location requirements, please discuss with the recruiter following submission of your application.

Client-provided location(s): Bengaluru, Karnataka, India
Job ID: IBM-42528
Employment Type: Other

Company Videos

Hear directly from employees about what it is like to work at IBM.