Facebook Data Engineer, Building 8 in Menlo Park, California


Facebook's mission is to give people the power to share, and make the world more open and connected. Through our growing family of apps and services, we're building a different kind of company that helps billions of people around the world connect and share what matters most to them. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to make the world more open and accessible. Connecting the world takes every one of us—and we're just getting started.


Building 8 brings together world-class experts to develop and ship groundbreaking products at the intersection of hardware, software, and content. We have a clear mandate to ship products at scale. In particular, seemingly impossible products that define new categories and that advance Facebook's mission of connecting the world. The B8 team will apply DARPA-style breakthrough development at the intersection of ambitious science and product development. It will operate on aggressive, fixed timelines, with extensive use of partnerships in universities, small and large businesses. The Technical Project Team has the responsibility for scoping the effort from inception to product, communicating it to others, creating the partnerships necessary to achieve targeted results, and hitting key milestones. We are looking for an experienced Data Engineer that is slightly impatient and willing to face down their fear of failure to accomplish bold things. This is a two-year position based in our Menlo Park office.

Required Skills:

  1. Interface with Technical Project Lead, engineers, and team members to understand product goals and data needs

  2. Build data expertise and own data quality for the awesome pipelines you build

  3. Architect, build and launch new data models that provide intuitive analytics

  4. Design, build and launch extremely efficient & reliable data pipelines to move data (both large and small amounts) to our Data Warehouse

  5. Work across multiple teams in high visibility roles and own the solution end-to-end

  6. Write Hive queries and pull data from Laser, TAO and other data sources

  7. Create scrapers to pull content which can’t be found in existing tables

Minimum Qualifications:

  1. BS or MS degree in Computer Science or a related technical field

  2. 4+ years of Java and/or Python development experience

  3. 4+ years of SQL (Oracle, Vertica, Hive, etc) experience

  4. Ability to write reusable code components

  5. Understanding of dataswarm pipelines and experience in running them

  6. Writing different dataswarm operators/tasks, especially Hive operators

  7. Knowledge of where various Facebook user data lives and various ways of accessing them through dataswarm operators

Preferred Qualifications:

  1. Experience working in www

  2. Experience working with privacy sensitive data

Industry: Internet

Equal Opportunity: As part of our dedication to the diversity of our workforce, Facebook is committed to Equal Employment Opportunity without regard for race, color, national origin, ethnicity, gender, protected veteran status, disability, sexual orientation, gender identity, or religion. We are also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, you may contact us at accommodations-ext@fb.com or you may call us at 1+650-308-7837.