Capacity Lab Lead

Facebook's mission is to give people the power to build community and bring the world closer together. Through our family of apps and services, we're building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to empower people around the world to build community and connect in meaningful ways. Together, we can help people build stronger communities - we're just getting started.

Facebook is seeking a forward thinking, experienced Capacity Lab Lead to join the Hardware Release to Production team. This position is full-time and will be based in Prineville, Oregon. We seek a Capacity Lab Lead with advanced hands-on technical skills in Server Hardware, Linux, and Networking, ideally in a data center environment. Having depth and breadth of knowledge of managing servers in a large-scale distributed environment is a core competency of this individual. The candidate should also have deep knowledge and experience in one of the following core areas: Networking, Project Management, Tooling and Automation, Hardware, Systems Administration, Validation, or Data Center Operations.

RESPONSIBILITIES

  • Perform general troubleshooting and repairs on Linux-based data center hardware products.
  • Work with hardware design, validation teams, and vendors to test and deploy new server, storage, and networking products in the data center infrastructure.
  • Test and troubleshoot new hardware products and components.
  • Provision, decommission, and manage hardware test racks in a production data center environment.
  • Identify, characterize, and root cause hardware failures and error conditions.
  • Assist hardware engineers by running experiments, collecting data, and providing feedback on failure symptoms for lab and production servers.
  • Provide cross-functional communication with other technical operations group.
  • Provide serviceability feedback on new hardware and coordinate road shows of early hardware for Site Operations teams.
  • Serve as the Site Operations team's local point of contact and subject matter expert regarding hardware.
  • Maintain an efficient, orderly hardware test lab operation within the production data center.
MINIMUM QUALIFICATIONS
  • BS or BA in technical field or commensurate experience
  • 6+ years of experience with Linux and hardware systems support in an Internet operations environment
  • Experience working with Linux (Red Hat/CentOS, SUSE, Ubuntu, Debian, Gentoo), or Unix (Solaris, FreeBSD, OSX)
  • Experience supervising, training, mentoring, and leading other technicians
  • Knowledge of out-of-band/lights-out server communication methods, such as IPMI and serial console
  • Communication experience
  • Project management experience
  • Ability to lift/move 20-30 lbs. equipment on a daily basis
PREFFERED QUALIFICATIONS
  • Bash, PHP, Python, or Perl scripting experience


Meet Some of Facebook's Employees

Lauren W.

Global Marketing Lead, Facebook Blueprint

As the marketing lead for Facebook’s Blueprint program, Lauren focuses on building awareness around the program and the adoption of education and training by businesses and advertisers.

Ariane J.

Software Engineer

Ariane works to improve Android performance for various Facebook products. She drives the entire tooling system and the way it should operate, and fixes logging and instrumentation APIs.


Back to top