Senior Software Engineer, HPC Systems Management & Monitoring

Senior Software Engineer, HPC Systems Management & Monitoring

Job Description:

You will be responsible for defining, designing and prototyping the system management and monitoring infrastructure for exascale high performance computing (HPC) systems. The volume of raw data generated by these systems will overwhelm existing system monitoring and processing tools. You will work closely with system architects and garner customer input to establish the requirements, and validate design decisions. You will be responsible for periodic technical reports and collaboration across the project team.

Responsibilities

  • Develop requirements and validate conclusions internally and externally.
  • Identify and evaluate new technologies, techniques and solutions to include as components in the overall solution, adapting where necessary.
  • Efficiently and effectively prototype large systems, to develop models and validate design decisions.
  • Drives innovation and integration of new technologies and leverage open source software (OSS) into HPC projects and activities.
  • Reviews and evaluates designs and project activities for compliance with systems design and development guidelines and standards; provides tangible feedback to improve product quality and mitigate failure risk.
  • Provides domain-specific expertise and overall software systems leadership and perspective to cross-organization projects, programs, and activities.
  • Provides guidance and mentoring to less- experienced staff members.

Education and Experience

  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent desirable
  • Minimum 6-10 years of experience

Knowledge and Skills

  • Deep knowledge of HPC operational activities, including system operation and problem diagnosis.
  • Extensive experience with multiple software systems design tools and languages.
  • Excellent analytical and problem solving skills.
  • Experience in overall architecture of software systems for HPC products and solutions.
  • Designing and integrating software systems running on multiple platform types into overall architecture.
  • Evaluating forms and processes for software systems testing and methodology, including writing and execution of test plans, debugging, and testing scripts and tools.
  • Excellent written and verbal communication skills; mastery in English and local language. Ability to effectively communicate product architectures, design proposals and negotiate options at senior management levels.

Job:

Engineering

Job Level:

Expert

Hewlett Packard Enterprise is EEO F/M/Protected Veteran/ Individual with Disabilities.

HPE will comply with all applicable laws related to the use of arrest and conviction records, including the San Francisco Fair Chance Ordinance and similar laws and will consider for employment qualified applicants with criminal histories.


Meet Some of Hewlett Packard Enterprise's Employees

Rebecca W.

Senior Manager, Community Engagement

Rebecca is responsible for educating and inspiring HPE employees to become involved in the communities around them, then ensuring they’re each recognized for their contributions.

Cat G.

Research Scientist

Cat works in the HPE lab, exploring and designing new devices that become the building blocks for the computation creations of the future.


Back to top