Sr. Data Engineer
United States
Nielsen
A global leader in audience insights, data and analytics, Nielsen shapes the future of media with accurate measurement of what people listen to and watch.
Technology is where the brightest minds in data and engineering connect, providing the essential measurement that helps fuel media businesses. The work this team does today will change the entertainment of tomorrow–the way we watch and listen to what we love.
Nielsen‘s mission is to help marketers and media companies measure and improve advertising performance by accurately reporting on what consumers watch, see, or hear. In the Identity Platform Team, we are processing, in batch, billions of rows for the purpose of measuring our audience and calibrating against panelists. We have implemented this data pipeline with the following technologies:
Apache Spark (SQL) with GraphframesTrino (Presto)Apache AirflowAWS EMR, S3, and RDSScala, Python and Spark/Trino SQL
So if you get excited by clustering in graphs, predicate pushdown in query execution, using typeclasses in scala and finding ways to make software less error-prone, then we have a job for you.
Do you want to move the industry forward with Nielsen? Our people are the driving force. Your thoughts, ideas and expertise can propel us forward. Whether you have fresh thinking around maximizing a new technology or you see a gap in the market, we are here to listen and take action. Our team is made strong by a diversity of thoughts, experiences, skills, and backgrounds. You’ll enjoy working with smart, fun, curious colleagues, who are passionate about their work. Come be part of a team that motivates you to do your best work!
Nielsen is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class.
#LI-Remote#LI-BH1
Nielsen‘s mission is to help marketers and media companies measure and improve advertising performance by accurately reporting on what consumers watch, see, or hear. In the Identity Platform Team, we are processing, in batch, billions of rows for the purpose of measuring our audience and calibrating against panelists. We have implemented this data pipeline with the following technologies:
Apache Spark (SQL) with GraphframesTrino (Presto)Apache AirflowAWS EMR, S3, and RDSScala, Python and Spark/Trino SQL
So if you get excited by clustering in graphs, predicate pushdown in query execution, using typeclasses in scala and finding ways to make software less error-prone, then we have a job for you.
Requirements
- Fluency in Scala as a functional programming language
- 5+ years of hands-on experience in server-side development using Java or Python
- 2+ years of hands-on experience making data pipelines with Apache Spark
- Degree in Engineering or Computer Science or in a quantitative field
Nice to have
- Understanding of graph algorithms
- Strong understanding of distributed systems design
- Contributes to Open Source Software
Do you want to move the industry forward with Nielsen? Our people are the driving force. Your thoughts, ideas and expertise can propel us forward. Whether you have fresh thinking around maximizing a new technology or you see a gap in the market, we are here to listen and take action. Our team is made strong by a diversity of thoughts, experiences, skills, and backgrounds. You’ll enjoy working with smart, fun, curious colleagues, who are passionate about their work. Come be part of a team that motivates you to do your best work!
Nielsen is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class.
#LI-Remote#LI-BH1
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Clustering Computer Science Data pipelines Distributed Systems Engineering Open Source Pipelines Python Scala Spark SQL Streaming
Regions:
Remote/Anywhere
North America
Country:
United States
Job stats:
6
1
0
Category:
Engineering Jobs
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Databricks-related jobs
- Open Generative AI-related jobs