Job Description:
This Director of Distributed Compute Operations role will be responsible for managing a team of approximately 10 System Administrators to support critical function i.e. Server Operations in Enterprise Infrastructure & Operations(EI&O) including Incident Management support, change execution with a focus on delivering stability, day-to-day operations & managing several operations KPIs and Metrics.
In this role, you will work with senior technology leadership, your peers on other operations teams day to day. You will be representing Distributed Compute and Storage(DCS) on Incident calls where you will be expected to work with your teams to provide meaningful insights of infrastructure, partner with application and DB teams to work towards common goal of mitigating issue and follow though Problem Management process to ensure any learnings are captured. Other key responsibilities include partnering with Global Operations leader to build roadmaps, and delivering highly effective, cost effective, highly reliable solutions to meet the demands of our internal customers. We are truly a global team so you will also be working with teams out of our India offices on a day-to-day basis.
Want more jobs like this?
Get jobs delivered to your inbox every week.
As a leading member of this team, you will continue to evaluate and find opportunities to automate, promote DevOps and EngOps work. You will influence the direction of distributed compute services by providing your input on key compute services. You will be required to present Operations priorities in quarterly meetings and regularly report on the progress of critical projects. You will be leading managing Linux, Unix, Windows operating systems, Hypervisors, Hardware, security compliance patching and responsible for stability of the environment.
The Expertise You Have and The Skills You Bring
- Bachelor's in computer science or engineering or equivalent
- 15-20 years of IT industry experience, 5-10 years of Infrastructure management & engineering experience
- Sound knowledge on Operating System (Linux and AIX), Hypervisor (OpenStack private cloud and VMware), Hardware (HP, Dell, IBM AIX)
- Production Support Infrastructure Management (compute environment) with dedicated focus on Incident, Change, Problem Management and Release practices
- Vendor management and regular service reviews for SLA achievement, Service quality assurance, provide feedback to improve vendor service and control
- Strategic planning and executing IT infrastructure projects keeping future requirements in mind and identifying opportunities for automation, cost savings, cycle time reduction and service improvements
- Preparation of strategic roadmap and overseeing its execution to meet our objectives by collaborating with stakeholders
- Proven leadership through empowerment of individuals/teams, innovation, collaboration and must be eager to learn and teach
- Proven track record of leading, mentoring, and developing associates
- Problem identification and resolution skills
- Collaborative approach to gain consensus and support
- Experience in managing Infrastructure team
- Expert in driving the crisis / high severity incident calls
- Expert in Agile methodology & IT Service Management
- Good understanding of infrastructure tech stack (Linux, AIX, Windows, VMWare, OpenStack, HP, Dell & AIX server hardware). Experience in Oracle Linux Virtualization Manager (OLVM) is a huge plus
- Good knowledge & understanding of public cloud AWS & Azure. Certification will be an added advantage
- Identify gaps and process improvements to enhance the stability of the environment
- Collaborating with various vendors (Redhat, HP, Dell, IBM AIX etc.) on RCA's and implementing the solution
- Demonstrate and encourage innovative problem solving & collaborative decision making
- Manage team members related talent reviews, performance, compensation, goal setting, development & top talent recruitment practices
- Communicate, collaborate and build relationships with partners and key stakeholders including DCS engineering and other operations leaders. We strive to be fastest to deliver and easiest to consume technology services.
Certifications:
Category:
Information Technology
Fidelity's hybrid working model blends the best of both onsite and offsite work experiences. Working onsite is important for our business strategy and our culture. We also value the benefits that working offsite offers associates. Most hybrid roles require associates to work onsite every other week (all business days, M-F) in a Fidelity office.
Please be advised that Fidelity's business is governed by the provisions of the Securities Exchange Act of 1934, the Investment Advisers Act of 1940, the Investment Company Act of 1940, ERISA, numerous state laws governing securities, investment and retirement-related financial activities and the rules and regulations of numerous self-regulatory organizations, including FINRA, among others. Those laws and regulations may restrict Fidelity from hiring and/or associating with individuals with certain Criminal Histories.