Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI. We need to build, scale and evolve our network infrastructure that connects myriads of GPUs together. Simple, elegant, and scalable network design, automation, and data analytics are the keys to meeting our demands. In this role, you will be part of a team that is responsible for conceiving design solutions, developing, testing and deploying network software, systems, and tools that keep the Data Center network operating at maximum reliability, scalability, and efficiency.Engineers in this role are hybrid software and network engineers who leverage their network engineering skills to research and design new generation of network architectures and related systems and use their software development skills to reliably introduce them at scale in production.
Want more jobs like this?
Get jobs in Seattle, WA delivered to your inbox every week.
Production Network Engineer Responsibilities:
- Partner with network hardware, software, and vendor teams on the design and development of network topologies and network platforms (switch and optics)
- Codify the network designs by partnering with the in-house Software Engineer, Tooling, Planning, Simulation, and Delivery teams
- Develop test automation frameworks integrated in Continuous Integration/Continuous Deployment pipeline to qualify network hardware and software stack for both in-house Facebook Open Switching System(FBOSS) and Vendor platforms before push in production
- Develop tests that qualify complex network migration procedures in lab/emulation before executing the same in production
- Work closely with our hardware, software and sourcing teams to develop new networking solutions and influence the future of networking and its associated infrastructure
- Be oncall to learn from real world production challenges and take the lessons to improve current and future generation products
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
- 6+ years of experience working on networks supporting large scale training workloads
- Experience in designing, deploying and operating datacenter networks at scale
- Experience coding in languages like Python, C++, Go
- Experience in network automation software leveraging software defined networking principles
- Experience configuring and troubleshooting routing and switching protocols (BGP, IS-IS, OSPF, MPLS, RSVP-TE)
- Working knowledge of network protocols (TCP/UDP, DHCP, DNS) and experience with IPv4 and IPv6
- Understanding of AI training workloads and demands they exert on networks
- Understanding of RDMA congestion control mechanisms on RoCE Networks
- Working knowledge of 40/100/400G Ethernet and CWDM, DWDM and optical transport network technologies
- Understanding of different Optics and internals of a switch ASIC
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.
$147,000/year to $208,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.