Ai Infrastructure Architect (Habana Labs)

3+ months agoAustin, TX

Job Description
Habana Labs is hiring a AI Infrastructure Architect to scale up its AI infrastructure. The ideal candidate will have experience developing architectures for new completely data centers. The design will encompass both internal and external network connectivity, security, provisioning, authentication, DNS, and other infrastructure tools. You will lead multiple teams in design and implementation. You should have experience with commercial and open source tools for orchestration and automation systems with hands-on experience running a data center environment. As a part of the team, you will work with and lead other specialists and define Habana Lab's capabilities to deploy leading solutions for a broad range of Deep Learning Accelerator-based applications, including directly supporting open-source initiatives.
What you'll be doing:
• Collaborate with multiple teams to understand their orchestration, scaling, security, connectivity and deployment requirements
• Design and coordinate expansions of both cloud and local data-center based environments
• Design Deploy and operate our orchestration layer over both bare metal and cloud service providers, including future expansions and upgrades
• Collaborate with internal and external researchers and leaders to build scale-out CI/CD solutions for our teams and partners in the open-source community
• Define automation and tools that will increase the efficiency of internal teams and external developers who use Habana Compilers and open-source software


Minimum Qualifications
A BS in Computer Engineering, Computer Science, Information Systems, or a related field with 9+ years of experience
3+ years of experience in  Linux system administration with strong Bash scripting and automation tools (Ansible, MaaS, Puppet, Terraform, etc) experience for cluster/cloud environment deployment/management
3+ years of experience  with various programming languages such as Python, C, C++, Java and their build tools and environments
2+ years of experience building, managing, and deploying Docker images at scale, including converting existing processes into a Docker container
1+ years of experience in at  least one job scheduler such as LSF, SLURM, Mesos/Marathon, Kubernetes, Docker Swar
3+ years of experience  CI/CD experience using Jenkins CI, GitLab CI, and others to build and maintain a fully automated: build, test, and deploy pipeline
Preferred Qualifications
3+ years of experience  with a Python package management system (pip, Conda) including dependency management, package specifications, versioning, and package building for both platform-independent and native binary packages
1+ years of experience in  Layer 3 networks, BGP, Calico, VPN and external connectivity

Inside this Business Group
The Data Center Group (DCG) is at the heart of Intel's transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies-spanning software, processors, storage, I/O, and networking solutions-that fuel cloud, communications, enterprise, and government data centers around the world.

Intel Corporation will require all new U.S. employees to be fully-vaccinated for Covid-19 as a condition of hire unless they have an approved accommodation in place under applicable law. Newly-hired employees will be required to provide proof of vaccination prior to their start date.

Posting Statement

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
US Experienced Hire JR0159656 Austin Data Center Group

Job ID: intel-JR0159656