Facebook Capacity Lab Lead in Prineville, Oregon


Facebook's mission is to give people the power to build community and bring the world closer together. Through our family of apps and services, we're building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to empower people around the world to build community and connect in meaningful ways. Together, we can help people build stronger communities — we're just getting started.


Facebook is seeking a forward thinking, experienced Capacity Lab Lead to join the Hardware Release to Production team. This position is full-time and will be based in Prineville, Oregon. We seek a Capacity Lab Lead with advanced hands-on technical skills in Server Hardware, Linux, and Networking, ideally in a data center environment. Having depth and breadth of knowledge of managing servers in a large-scale distributed environment is a core competency of this individual. The candidate should also have deep knowledge and experience in one of the following core areas: Networking, Project Management, Tooling and Automation, Hardware, Systems Administration, Validation, or Data Center Operations.

Required Skills:

  1. Perform general troubleshooting and repairs on Linux-based data center hardware products.

  2. Work with hardware design, validation teams, and vendors to test and deploy new server, storage, and networking products in the data center infrastructure.

  3. Test and troubleshoot new hardware products and components.

  4. Provision, decommission, and manage hardware test racks in a production data center environment.

  5. Identify, characterize, and root cause hardware failures and error conditions.

  6. Assist hardware engineers by running experiments, collecting data, and providing feedback on failure symptoms for lab and production servers.

  7. Provide cross-functional communication with other technical operations group.

  8. Provide serviceability feedback on new hardware and coordinate road shows of early hardware for Site Operations teams.

  9. Serve as the Site Operations team's local point of contact and subject matter expert regarding hardware.

  10. Maintain an efficient, orderly hardware test lab operation within the production data center.

Minimum Qualifications:

  1. BS or BA in technical field or commensurate experience

  2. 6+ years of experience with Linux and hardware systems support in an Internet operations environment

  3. Experience working with Linux (Red Hat/CentOS, SUSE, Ubuntu, Debian, Gentoo), or Unix (Solaris, FreeBSD, OSX)

  4. Experience supervising, training, mentoring, and leading other technicians

  5. Knowledge of out-of-band/lights-out server communication methods, such as IPMI and serial console

  6. Communication experience

  7. Project management experience

  8. Ability to lift/move 20-30 lbs. equipment on a daily basis

Preferred Qualifications:

  1. Bash, PHP, Python, or Perl scripting experience

Industry: Internet

Equal Opportunity: Facebook is proud to be an Equal Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. If you need assistance or an accommodation due to a disability, you may contact us at accommodations-ext@fb.com or you may call us at +1 650-308-7837.