Sr. Data Engineer
United States
Nielsen
A global leader in audience insights, data and analytics, Nielsen shapes the future of media with accurate measurement of what people listen to and watch.
Nielsen Media would not function without our Technology teams! We are catalysts for delivery quality, on-time, reliable measurements to clients, and we are cultivators, growing our employees through education, skill building and experiences. Around the globe, our Technology teams are relentless in our pursuit of superior analytics, technology, process and support.
Nielsen‘s mission is to help marketers and media companies measure and improve advertising performance by accurately reporting on what consumers watch, see, or hear. In the Identity Platform Team, we are processing, in batch, billions of rows for the purpose of measuring our audience and calibrating against panelists. We have implemented this data pipeline with the following technologies:
Apache Spark (SQL) with GraphframesTrino (Presto)Apache AirflowAWS EMR, S3, and RDSScala, Python and Spark/Trino SQL.
So if you get excited by clustering in graphs, predicate pushdown in query execution, using typeclasses in scala and finding ways to make software less error-prone, then we have a job for you.
Do you want to move the industry forward with Nielsen? Our people are the driving force. Your thoughts, ideas and expertise can propel us forward. Whether you have fresh thinking around maximizing a new technology or you see a gap in the market, we are here to listen and take action. Our team is made strong by a diversity of thoughts, experiences, skills, and backgrounds. You’ll enjoy working with smart, fun, curious colleagues, who are passionate about their work. Come be part of a team that motivates you to do your best work!
Nielsen is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class.
#LI-Remote#LI-BH1
Nielsen‘s mission is to help marketers and media companies measure and improve advertising performance by accurately reporting on what consumers watch, see, or hear. In the Identity Platform Team, we are processing, in batch, billions of rows for the purpose of measuring our audience and calibrating against panelists. We have implemented this data pipeline with the following technologies:
Apache Spark (SQL) with GraphframesTrino (Presto)Apache AirflowAWS EMR, S3, and RDSScala, Python and Spark/Trino SQL.
So if you get excited by clustering in graphs, predicate pushdown in query execution, using typeclasses in scala and finding ways to make software less error-prone, then we have a job for you.
Requirements
- Fluency in Scala as a functional programming language
- 5+ years of hands-on experience in server-side development using Java or Python
- 2+ years of hands-on experience making data pipelines with Apache Spark
- Degree in Engineering or Computer Science or in a quantitative field
Nice to have
- Understanding of graph algorithms
- Strong understanding of distributed systems design
- Contributes to Open Source Software
Do you want to move the industry forward with Nielsen? Our people are the driving force. Your thoughts, ideas and expertise can propel us forward. Whether you have fresh thinking around maximizing a new technology or you see a gap in the market, we are here to listen and take action. Our team is made strong by a diversity of thoughts, experiences, skills, and backgrounds. You’ll enjoy working with smart, fun, curious colleagues, who are passionate about their work. Come be part of a team that motivates you to do your best work!
Nielsen is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class.
#LI-Remote#LI-BH1
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Clustering Computer Science Data pipelines Distributed Systems Engineering Open Source Pipelines Python Scala Spark SQL Streaming
Regions:
Remote/Anywhere
North America
Country:
United States
Job stats:
4
0
0
Category:
Engineering Jobs
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs