Data Engineer - Remote
New York City
Nielsen
A global leader in audience insights, data and analytics, Nielsen shapes the future of media with accurate measurement of what people listen to and watch.
Nielsen Media would not function without our Technology teams! We are catalysts for delivery quality, on-time, reliable measurements to clients, and we are cultivators, growing our employees through education, skill building and experiences. Around the globe, our Technology teams are relentless in our pursuit of superior analytics, technology, process and support.
Nielsen‘s mission is to help marketers and media companies measure and improve advertising performance by accurately reporting on what consumers watch, see, or hear. In the Identity Platform Team, we are processing, in batch, billions of rows for the purpose of measuring our audience and calibrating against panelists. We have implemented this data pipeline with the following technologies:
Apache Spark (SQL) with GraphframesTrino (Presto)Apache AirflowAWS EMR, S3, and RDSScala, Python and Spark/Trino SQL.
So if you get excited by clustering in graphs, predicate pushdown in query execution, using typeclasses in scala and finding ways to make software less error-prone, then we have a job for you.
Nielsen‘s mission is to help marketers and media companies measure and improve advertising performance by accurately reporting on what consumers watch, see, or hear. In the Identity Platform Team, we are processing, in batch, billions of rows for the purpose of measuring our audience and calibrating against panelists. We have implemented this data pipeline with the following technologies:
Apache Spark (SQL) with GraphframesTrino (Presto)Apache AirflowAWS EMR, S3, and RDSScala, Python and Spark/Trino SQL.
So if you get excited by clustering in graphs, predicate pushdown in query execution, using typeclasses in scala and finding ways to make software less error-prone, then we have a job for you.
Requirements
- Fluency in Scala as a functional programming language.
- 5+ years of hands-on experience in server-side development using Java or Python
- 2+ years of hands-on experience making data pipelines with Apache Spark.
- Degree in Engineering or Computer Science or in a quantitative field
Nice to Have
- Understanding of graph algorithms
- Strong understanding of distributed systems design
- Contributes to Open Source Software
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Computer Science Data pipelines Distributed Systems Engineering Open Source Pipelines Python Scala Spark SQL
Region:
North America
Country:
United States
Job stats:
7
2
0
Category:
Engineering Jobs
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open NLP-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs