Data Engineer (01086)
Oldsmar, FL
Nielsen
A global leader in audience insights, data and analytics, Nielsen shapes the future of media with accurate measurement of what people listen to and watch.
Data Science is at the core of Nielsen’s business. Our team of researchers come from diverse disciplines and they drive innovation, new product ideation, experimental design and testing, complex analysis and delivery of data insights around the world. We support all International Media clients and are located where our clients are.
DESCRIPTIONConduct one-on-one meetings with the manager and team meetings to gather the information needed for developing new software or code. Analyze methodology documents to understand client requirements. Design and develop software and related database infrastructure that will feed into the media data lake (ecosystem) through Nielsen’s data flow processes. Conduct code review with senior developers before pushing the code into production. Support analytics performed by the Data Science team by creating or modifying and then deploying and supporting software applications and data infrastructure to respond to custom analytics requests and client inquiries, and incorporate changing analytical and statistical methodologies, standards and best practices. Combine software engineering expertise with knowledge of Nielsen TV measurement tools and services. Create job flow, automate the jobs and run jobs. Implement data pipeline Directed Acyclic Graphs (DAGs) and maintenance DAGs. Configure and setup DAGs based on the data to run Spark commands in parallel and sequential. Create hooks, as well as Python, Bash, Spark and custom operators. Schedule a pipeline to run the DAG to fetch data for loading into a database. Collaborate with cross-functional Nielsen Media teams to develop and implement customized software solutions that will validate enhancements to the universe estimate methodologies. Perform unit testing using test cases and fix any bugs. Write and run test scripts for Media and Annual Estimate data. Run test scripts for data generated from team members. Validate the local Return Path Data Universe Estimates (RPD UEs). Collaborate with Data Scientists by leveraging software engineering expertise to identify and address software development and data quality issues, and detect and resolve quality escapes. Perform detailed analysis of data quality issues, report to other teams on significant data changes, and rectify the problem. Respond to internal data analytics inquiries by determining how to execute the analyses with the platforms and applications available, modifying specifications as needed, providing an estimated delivery date, and accurately delivering data-driven analyses on time. Oversee (without supervisory authority) 2-3 data scientists. Involves domestic travel, 1-2 times/year for 1-2 days/trip. Involves the opportunity to telecommute from within the Oldsmar, FL area for up to 3 days/week, as feasible. Tools used: Python, PySpark, SQL, Databricks, Intelligence Studio, EC2, Airflow and Tableau. QUALIFICATIONSMinimum requirements: Bachelor’s degree in computer science, engineering or a related field with an information technology focus plus 2 years of experience in software design, development and testing. This must include 2 years of experience in/with: solving complex design and coding problems; SQL and large-scale databases; UNIX; writing Python code, and converting SAS and/or C code to Python code; creating and loading tables in the Nielsen Media Data Lake, and migrating tables from 1.0 to 2.0; creating RPD (Return Path Data) universe estimates, and updating and maintaining the methodology with any new additions; and producing cluster analyzes for exclusion of Broadband Only UEs and RPD. 1 year of experience in/with: creating S3 buckets in the AWS environment to store and retrieve files, using Iterative Proportional Fitting (IPV); and designing databases. Willingness to travel domestically 1-2 times/year for 1-2 days/trip. About Nielsen: Nielsen is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class. #LI-DNI#IND-DNS
DESCRIPTIONConduct one-on-one meetings with the manager and team meetings to gather the information needed for developing new software or code. Analyze methodology documents to understand client requirements. Design and develop software and related database infrastructure that will feed into the media data lake (ecosystem) through Nielsen’s data flow processes. Conduct code review with senior developers before pushing the code into production. Support analytics performed by the Data Science team by creating or modifying and then deploying and supporting software applications and data infrastructure to respond to custom analytics requests and client inquiries, and incorporate changing analytical and statistical methodologies, standards and best practices. Combine software engineering expertise with knowledge of Nielsen TV measurement tools and services. Create job flow, automate the jobs and run jobs. Implement data pipeline Directed Acyclic Graphs (DAGs) and maintenance DAGs. Configure and setup DAGs based on the data to run Spark commands in parallel and sequential. Create hooks, as well as Python, Bash, Spark and custom operators. Schedule a pipeline to run the DAG to fetch data for loading into a database. Collaborate with cross-functional Nielsen Media teams to develop and implement customized software solutions that will validate enhancements to the universe estimate methodologies. Perform unit testing using test cases and fix any bugs. Write and run test scripts for Media and Annual Estimate data. Run test scripts for data generated from team members. Validate the local Return Path Data Universe Estimates (RPD UEs). Collaborate with Data Scientists by leveraging software engineering expertise to identify and address software development and data quality issues, and detect and resolve quality escapes. Perform detailed analysis of data quality issues, report to other teams on significant data changes, and rectify the problem. Respond to internal data analytics inquiries by determining how to execute the analyses with the platforms and applications available, modifying specifications as needed, providing an estimated delivery date, and accurately delivering data-driven analyses on time. Oversee (without supervisory authority) 2-3 data scientists. Involves domestic travel, 1-2 times/year for 1-2 days/trip. Involves the opportunity to telecommute from within the Oldsmar, FL area for up to 3 days/week, as feasible. Tools used: Python, PySpark, SQL, Databricks, Intelligence Studio, EC2, Airflow and Tableau. QUALIFICATIONSMinimum requirements: Bachelor’s degree in computer science, engineering or a related field with an information technology focus plus 2 years of experience in software design, development and testing. This must include 2 years of experience in/with: solving complex design and coding problems; SQL and large-scale databases; UNIX; writing Python code, and converting SAS and/or C code to Python code; creating and loading tables in the Nielsen Media Data Lake, and migrating tables from 1.0 to 2.0; creating RPD (Return Path Data) universe estimates, and updating and maintaining the methodology with any new additions; and producing cluster analyzes for exclusion of Broadband Only UEs and RPD. 1 year of experience in/with: creating S3 buckets in the AWS environment to store and retrieve files, using Iterative Proportional Fitting (IPV); and designing databases. Willingness to travel domestically 1-2 times/year for 1-2 days/trip. About Nielsen: Nielsen is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer, making decisions without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability status, age, marital status, protected veteran status or any other protected class. #LI-DNI#IND-DNS
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS Computer Science Data Analytics Databricks EC2 Engineering PySpark Python SAS Spark SQL Tableau Testing
Region:
North America
Job stats:
1
0
0
Category:
Engineering Jobs
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open AI Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open MLOps Engineer jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open LLMs-related jobs
- Open Airflow-related jobs
- Open Data warehouse-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs