Who We Are
At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities.
The Role
"Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The Role Skytap is an IaaS provider deployed globally in Azure that's just joined forces with infrastructure services powerhouse Kyndryl, and the future has never looked brighter! This exciting new chapter combines our cutting-edge cloud platform with Kyndryl's robust infrastructure expertise, creating a unique opportunity to deliver smarter, more scalable capabilities for businesses worldwide. As we integrate the best of both companies, we're looking for talented individuals to help us define the next chapter of our journey. If you're passionate about building products that make a real impact and ready to bring fresh ideas, we want you to be part of our growing team. We're creating a new era of seamless, high-performance solutions that drive real innovation and modernization for clients across industries. Let's build that future together! The DevOps team is part of Skytap on Kyndryl, which builds a public cloud platform product as a dedicated software development team within the broader organization.
Want more jobs like this?
Get jobs delivered to your inbox every week.
Responsibilities:
• Storage and Infrastructure Management:
o Deploy, manage, and optimize storage solutions using ZFS and iSCSI across global data centers.
o Implement and maintain automation and monitoring tools such as Puppet, Grafana, Zabbix, and Jenkins to enhance system performance and reliability.
o Utilize storcli for managing server storage configurations.
o Linux Systems Expertise:
o Manage and maintain Ubuntu-based systems, ensuring security and compliance.
o Conduct performance tuning and capacity planning for Linux servers.
o Develop and implement self-healing systems and automated recovery processes on Linux platforms.
• Reliability Engineering:
o Develop and implement strategies for improving system availability and performance.
o Conduct root-cause analysis and incident response for storage-related issues.
o Collaborate with SDEs to support software development infrastructure and deploy new product features.
• Operational Excellence:
o Manage on-call rotations, leveraging automation to minimize operational load.
o Develop and maintain documentation for operational procedures and best practices.
o Drive continuous improvement and innovation in storage operations.
• Collaboration and Communication:
o Work closely with cross-functional teams, including SDEs and infrastructure engineers.
o Provide technical guidance and support for storage-related challenges.
o Present data-driven insights to stakeholders to support decision-making.
• Hiring:
o Manage recruiting and hiring pipeline, including working with internal and external recruiters, reviewing resumes, designing interview loops, and directly interviewing candidates.
• Team Growth:
o Develop and grow talent through effective mentoring, coaching, succession planning and retention strategies for key talent. Provide semi-annual performance reviews, determine promotions and compensation changes, delegate and give direct feedback weekly in 1on1s, and produce career growth plans for individual direct report employees.
• Processes and Documentation:
o Design, implement, and improve team processes including task triage, tooling, knowledge sharing, brownbags, team review policies, quality assurance, etc. Produce and review documentation artifacts, team health and status reports, and technical designs.
Your future at Kyndryl
There are lots of opportunities to gain certification and qualifications on the job, and you'll continuously grow as a Cloud Hyperscaler. Many of our Infrastructure Specialists are on a path toward becoming either an Architect or Distinguished Engineer, and there are opportunities at every skill level to grow in either of these directions
Who You Are
You're good at what you do and possess the required experience to prove it. However, equally as important - you have a growth mindset; keen to drive your own personal and professional development. You are customer-focused - someone who prioritizes customer success in their work. And finally, you're open and borderless - naturally inclusive in how you work with others.
Required Professional Experience:
8 to 12 Years of experience
o Proven experience in site reliability engineering, with a focus on storage solutions and Linux systems.
o Strong knowledge of ZFS, iSCSI, and Ubuntu.
o Expertise in automation and configuration management tools (e.g., Bash, Ansible, Puppet).
o Familiarity with Hashicorp tools, SSH, and LDAP.
o Experience with storcli for storage configuration.
o Experience with monitoring tools such as Grafana, Zabbix, InfluxDB.
o Ability to conduct root-cause analysis and implement effective solutions.
o High level of ownership for assigned team problem space, including driving predictable delivery, continuous iteration and improvement, consistent and effective communication team, gracefully coordinating with upstream and downstream stakeholders, and project status.
o Project management skills, including experience with task estimation, scheduling, Gantt charts, unblocking dependencies, Agile methodologies (such as sprint planning or Scrum), being detail-oriented, and keeping projects on track. Ability to define broad, complex problems and break into discrete, specific tasks that can be delegated.
o Documentation skills including writing standard operating procedures, design docs, policy documents, runbooks.
Preferred Professional Experience:
o Ability to program in Python, Rust.
o Software development experience, including technical design and deployment.
Being You
Diversity is a whole lot more than what we look like or where we come from, it's how we think and who we are. We welcome people of all cultures, backgrounds, and experiences. But we're not doing it single-handily: Our Kyndryl Inclusion Networks are only one of many ways we create a workplace where all Kyndryls can find and provide support and advice. This dedication to welcoming everyone into our company means that Kyndryl gives you - and everyone next to you - the ability to bring your whole self to work, individually and collectively, and support the activation of our equitable culture. That's the Kyndryl Way.
What You Can Expect
With state-of-the-art resources and Fortune 100 clients, every day is an opportunity to innovate, build new capabilities, new relationships, new processes, and new value. Kyndryl cares about your well-being and prides itself on offering benefits that give you choice, reflect the diversity of our employees and support you and your family through the moments that matter - wherever you are in your life journey. Our employee learning programs give you access to the best learning in the industry to receive certifications, including Microsoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations. At Kyndryl, we invest heavily in you, we want you to succeed so that together, we will all succeed.
Get Referred!
If you know someone that works at Kyndryl, when asked 'How Did You Hear About Us' during the application process, select 'Employee Referral' and enter your contact's Kyndryl email address.