Senior Data Engineer - Infrastructure Data Science and Engineering

Los Gatos, California

Applications have closed

Netflix

Watch Netflix movies & TV shows online or stream right to your smart TV, game console, PC, Mac, mobile, tablet and more.

View company page

Netflix is enjoyed by more than 200 million households globally, entertaining new audiences every day.  Netflix’s cloud platforms and internal tools play a key role in making the Netflix experience great.  We are one of the pioneers of leveraging a highly scalable cloud footprint to deliver delightful experiences to our members worldwide. The Infrastructure Data Science and Engineering team leads the analytic innovation that enables architecting secure, scalable, and efficient platforms. We partner with our internal engineering teams to empower them through intelligent analytic products and integrated models that enable smarter decision making. 
Our team currently has 4 pods, and a successful candidate would have the opportunity to work across any of these areas:Platform Innovation - focuses on large initiatives within the platform engineering org focused on human efficiency and reduced complexity enabling engineers and scientists to do their best and most innovative work. Our main partners in this space are the Productivity Engineering teams which provide the platforms to build, deploy, and run the jobs, services, and applications that enable the Netflix product (and internal products like studio and finance tools, and data products).Platform Scale - focuses on enabling our platform to scale efficiently to support new business areas as well as position ourselves to be able to reach 100s of millions of new members as the Netflix business grows around the world. We look for innovative ways to help the platform grow and scale for our needs.Security - focuses on keeping the Netflix service secure for our members as well as securing our corporate infrastructure for employees and contractors. Core DSE - is a software engineering heavy group which builds tooling for our Analysts, Data Scientists and Data Engineers enabling them to more effectively focus on business problems rather than boiler plate work needed to build, develop and deploy software at Netflix. 
We are looking for a Senior Data Engineer to build reliable, distributed data pipelines and intuitive data products that allow our stakeholders to easily leverage data in an effective  manner. As part of this team, you will work on diverse data technologies such as Spark, Presto, Flink, Kafka and others to build insightful, scalable and robust data pipelines; write ETL jobs to aggregate data; and build high quality data models that describe the entities and interactions and usage of Netflix’ cloud infrastructure like our compute platform (internally called Titus), our custom big data platform, productivity tools (like Spinnaker, etc), network traffic (AWS traffic, security group interactions, etc).The Infrastructure DSE organization supports the cloud compute platform, the demand and capacity planning, the networking infrastructure, the SRE function for Netflix.

The ideal candidate will have a strong background in distributed data processing, have great and demonstrable data intuition, and share our passion for continuously improving the ways we use data to make the Netflix infrastructure better. 

Who you are:

  • Passionate about building intuitive data models and an expert in distributed data processing patterns 
  • Highly proficient in at least one of Java, Python or Scala
  • Comfortable with complex SQL
  • Expert in engineering data pipelines using big data technologies (Hive, Presto, Spark, Flink etc...) on large scale data sets demonstrated through years of experience
  • Understand the Data Lifecycle and concepts such as lineage, governance, privacy, retention, anonymity, etc
  • Conceptually familiar with AWS cloud resources (S3, EC2, RDS etc) 
  • Excel at taking vague requirements and crystallizing them into scalable data solutions 
  • Excited about operating independently, demonstrating excellence, and learning new technologies and frameworks 

What you will do:

  • Engineer efficient, adaptable and scalable data pipelines to process structured and unstructured data 
  • Develop subject-matter expertise in the infrastructure domain at Netflix 
  • Act as a thought partner to the platform and/or security engineering team, understand their challenges, and make opinionated recommendations that empower them with data solutions to efficiently scale Netflix infrastructure and tools 
  • Maintain and rethink existing datasets and pipelines to service a wider variety of use cases 
  • Enable smart analytics by building robust, reliable, and useful data sets that can power various analytic techniques like regression, classification, clustering etc
  • Join a stunning team of data experts with diverse skill set, and deliver excellent solutions that better enable our decision-making process
A few more things to know: 
Our culture is unique and we live by our values, so it's worth learning more about Netflix at jobs.netflix.com/culture. We regularly share examples of our work on our tech blog at https://netflixtechblog.com. You will need to be comfortable working in the most agile of environments. Requirements will be vague. Iterations will be rapid. You will need to be nimble and take smart risks. 

Tags: Agile AWS Big Data Classification Data pipelines EC2 Engineering ETL Excel Finance Flink Kafka Pipelines Python Scala Security Spark SQL Unstructured data

Perks/benefits: Career development

Region: North America
Country: United States
Job stats:  10  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.