Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

High-Performance Computing (HPC) Architect

AT Applied Materials
Applied Materials

High-Performance Computing (HPC) Architect

Bangalore, India

About Applied

Applied Materials is the leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. Our expertise in modifying materials at atomic levels and on an industrial scale enables customers to transform possibilities into reality. At Applied Materials, our innovations make possible the technology shaping the future.

Our Team

Our team is developing a high-performance computing solution for low-latency and high throughput image processing and deep-learning workloads that will enable our Chip Manufacturing process control equipment to offer differentiated value to our customers.

Your Opportunity

As an HPC Architect, you will get the opportunity to architect high-performance computing solutions from scratch and design/optimize all aspects (Compute, Memory, Networking, Storage) for better cost of Ownership.

Want more jobs like this?

Get Software Engineering jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.


Roles and Responsibility

  • As an architect, you will be responsible for designing HPC infrastructure solutions, including compute, networking, storage, and workload management components.


  • You will work closely with cross-functional teams, including Hardware, Software, product management, and business stakeholders, to understand compute workload and translate them into Platform architecture and designs that meet business needs.


  • You will create and maintain detailed system architecture diagrams and specifications.


  • You will evaluate and select appropriate hardware and software components for HPC environments


  • You will Install, configure, and maintain HPC systems, including hardware, software, and networking components


  • You will develop and implement automation scripts for system management and deployment.


  • You will be a subject Matter expert to unblock dependent teams in the HPC domain.


  • You will be expected to develop system benchmarks, profile systems to understand bottlenecks, optimize workflows and processes to improve cost of ownership.


  • Identify and mitigate technical risks and issues throughout the HPC development life cycle.


  • Ensure that Compute Cluster is resilient, reliable, and maintainable.


  • You will be expected to stay abreast of the latest HPC technologies, including Hardware, Software and Networking Solutions


  • Your primary focus will be to understand the compute workload and design HPC cluster with right combination of Nodes, CPU/GPU, Memory, Interconnects and storage to have optimum performance at minimum cost of Ownership.

Our Ideal Candidate

Someone who has the drive and passion to learn quickly, has the ability to multi-task and switch contexts based on business needs.

Qualifications

  • In-depth experience with Linux System administration and Hardware/Software Configuration.


  • Strong knowledge of HPC technologies including cluster computing, high speed interconnects (InfiniBand, RoCE), parallel filesystems (Lustre, GPFS, BeeGFS etc)


  • Experience in creating, maintaining Operating System images with different installation and boot schemes


  • Extremely good with automation tools like Ansible, Chef, Salt-Stack and Scripting languages (Python and Bash)


  • Experience in Creating, maintaining Storage Solutions with different RAID configuration.


  • Ability to design storage solution for different IOPS, Access patterns (Random vs Sequential RW) and tune storage and filesystems for better performance.


  • Good of knowledge Networking concepts including IP addressing, routing, protocols and Switch configuration for RDMA, VLAN configuration, network bonding etc.


  • Good Knowledge Virtualization, Hardware and Software Hypervisors


  • Good knowledge of containerization technologies like docker, singularity.


  • Experience in Software Defined Networking and Storage.


  • Experience in setting-up remote management protocols like IPMI, Redfish etc.


  • Experience in setting-up and using monitoring systems like Prometheus, Grafana.


  • Experience System profiling and custom tuning for target workload for higher performance and low cost of ownership


  • Very good written and verbal communication skills.


  • Very good in Technical documentation meant to serve as manuals for non-experts in the field.

Additional Qualifications:

  • Experience in HPC Cluster management and Work-load orchestration software (e.g. SLURM, Torque, LSF)


  • Experience in Setting-up Deep-learning training/inference solutions.


  • Experience in Private cloud infrastructure like Kubernetes, OpenStack, CloudStack etc.


  • Experience in Distributed High Performance Computing and Parallel programming frameworks


  • Good knowledge of Low-latency and high-throughput data transfer technologies (RDMA on RoCE, InfiniBand)

Education:

Bachelor's Degree or higher in Computer science or related Disciplines.

Applied Materials is committed to diversity in its workforce including Equal Employment Opportunity for Minorities, Females, Protected Veterans and Individuals with Disabilities.

Qualifications

Education:

Bachelor's Degree

Skills

Certifications:

Languages:

Years of Experience:

4 - 7 Years

Work Experience:

Additional Information

Shift:

Day (India)

Travel:

Relocation Eligible:

No

Referral Payment Plan:

Employee Referral (Standard)

Applied Materials is an Equal Opportunity Employer committed to diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

Client-provided location(s): Bengaluru, Karnataka, India; Chennai, Tamil Nadu, India
Job ID: Applied_Materials-R2514185
Employment Type: Full Time

Perks and Benefits

  • Health and Wellness

    • Health Insurance
    • Dental Insurance
    • Vision Insurance
    • Life Insurance
    • Short-Term Disability
    • Long-Term Disability
    • FSA With Employer Contribution
    • HSA With Employer Contribution
    • Health Reimbursement Account
    • FSA
    • HSA
    • On-Site Gym
    • Pet Insurance
    • Mental Health Benefits
  • Parental Benefits

    • Birth Parent or Maternity Leave
    • Fertility Benefits
    • Adoption Leave
    • Non-Birth Parent or Paternity Leave
    • Adoption Assistance Program
    • Family Support Resources
    • On-site/Nearby Childcare
  • Work Flexibility

    • Flexible Work Hours
    • Work-From-Home Stipend
  • Office Life and Perks

    • On-Site Cafeteria
    • Commuter Benefits Program
    • Casual Dress
    • Holiday Events
  • Vacation and Time Off

    • Paid Vacation
    • Paid Holidays
    • Personal/Sick Days
    • Unlimited Paid Time Off
    • Leave of Absence
  • Financial and Retirement

    • 401(K) With Company Matching
    • Stock Purchase Program
    • 401(K)
    • Performance Bonus
    • Relocation Assistance
    • Financial Counseling
  • Professional Development

    • Tuition Reimbursement
    • Access to Online Courses
    • Internship Program
    • Work Visa Sponsorship
    • Associate or Rotational Training Program
    • Promote From Within
    • Mentor Program
    • Shadowing Opportunities