Senior Data Engineer - Infrastructure Data Science and Engineering
Los Gatos, California
Applications have closed
Netflix
Watch Netflix movies & TV shows online or stream right to your smart TV, game console, PC, Mac, mobile, tablet and more.Our team currently has 4 pods, and a successful candidate would have the opportunity to work across any of these areas:Platform Innovation - focuses on large initiatives within the platform engineering org focused on human efficiency and reduced complexity enabling engineers and scientists to do their best and most innovative work. Our main partners in this space are the Productivity Engineering teams which provide the platforms to build, deploy, and run the jobs, services, and applications that enable the Netflix product (and internal products like studio and finance tools, and data products).Platform Scale - focuses on enabling our platform to scale efficiently to support new business areas as well as position ourselves to be able to reach 100s of millions of new members as the Netflix business grows around the world. We look for innovative ways to help the platform grow and scale for our needs.Security - focuses on keeping the Netflix service secure for our members as well as securing our corporate infrastructure for employees and contractors. Core DSE - is a software engineering heavy group which builds tooling for our Analysts, Data Scientists and Data Engineers enabling them to more effectively focus on business problems rather than boiler plate work needed to build, develop and deploy software at Netflix.
We are looking for a Senior Data Engineer to build reliable, distributed data pipelines and intuitive data products that allow our stakeholders to easily leverage data in an effective manner. As part of this team, you will work on diverse data technologies such as Spark, Presto, Flink, Kafka and others to build insightful, scalable and robust data pipelines; write ETL jobs to aggregate data; and build high quality data models that describe the entities and interactions and usage of Netflix’ cloud infrastructure like our compute platform (internally called Titus), our custom big data platform, productivity tools (like Spinnaker, etc), network traffic (AWS traffic, security group interactions, etc).The Infrastructure DSE organization supports the cloud compute platform, the demand and capacity planning, the networking infrastructure, the SRE function for Netflix.
The ideal candidate will have a strong background in distributed data processing, have great and demonstrable data intuition, and share our passion for continuously improving the ways we use data to make the Netflix infrastructure better.
Who you are:
- Passionate about building intuitive data models and an expert in distributed data processing patterns
- Highly proficient in at least one of Java, Python or Scala
- Comfortable with complex SQL
- Expert in engineering data pipelines using big data technologies (Hive, Presto, Spark, Flink etc...) on large scale data sets demonstrated through years of experience
- Understand the Data Lifecycle and concepts such as lineage, governance, privacy, retention, anonymity, etc
- Conceptually familiar with AWS cloud resources (S3, EC2, RDS etc)
- Excel at taking vague requirements and crystallizing them into scalable data solutions
- Excited about operating independently, demonstrating excellence, and learning new technologies and frameworks
What you will do:
- Engineer efficient, adaptable and scalable data pipelines to process structured and unstructured data
- Develop subject-matter expertise in the infrastructure domain at Netflix
- Act as a thought partner to the platform and/or security engineering team, understand their challenges, and make opinionated recommendations that empower them with data solutions to efficiently scale Netflix infrastructure and tools
- Maintain and rethink existing datasets and pipelines to service a wider variety of use cases
- Enable smart analytics by building robust, reliable, and useful data sets that can power various analytic techniques like regression, classification, clustering etc
- Join a stunning team of data experts with diverse skill set, and deliver excellent solutions that better enable our decision-making process
Our culture is unique and we live by our values, so it's worth learning more about Netflix at jobs.netflix.com/culture. We regularly share examples of our work on our tech blog at https://netflixtechblog.com. You will need to be comfortable working in the most agile of environments. Requirements will be vague. Iterations will be rapid. You will need to be nimble and take smart risks.
Tags: Agile AWS Big Data Classification Data pipelines EC2 Engineering ETL Excel Finance Flink Kafka Pipelines Python Scala Security Spark SQL Unstructured data
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Product Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs