Junior Data Engineer - Analytics (All Genders)
Paris, France
Applications have closed
Dailymotion
Die neuesten News-, Sport-, Musik- und Unterhaltungsvideos auf DailymotionCompany Description
Dailymotion is the leading video discovery destination & technology that learns about your tastes over time, constantly surfacing the best, most relevant content on the web. Our mission is to provide the best video user experience for consumers on the market, connecting publishers and advertisers to engaged viewers who turn to Dailymotion for their daily fix of the most compelling music, entertainment, news and sports content around.
Through partnerships with the world's leading publishers and content creators, France Télévisions, Le Parisien, CBS, Bein Sports, CNN, GQ, Universal Music Group, VICE and more, Dailymotion commands 3 billion monthly pageviews across its mobile app, desktop and connected TV experiences. Dailymotion is owned by Vivendi, one of the largest mass-media corporations in the world.
At Dailymotion, we‘re storytellers. We build the best place for people to enjoy the videos that matter. We do this through utilizing and developing cutting-edge technology and pushing the envelope to bring discoverable stories to life through premium content from the world’s best publishers. We do this by helping these publishers grow their audiences and monetize their content, their way.
Dailymotion is proud to be an equal employment opportunity and affirmative action employer. We value inclusion and we want you to help us thrive for a more diverse community.
Job Description
Dailymotion is seeking a Data (Analytics) Engineer for the Analytics Engineering team.
You will join the Data Engineering & Machine Learning craft. A craft consists of multiple teams of engineers and machine learning experts who collaborate daily to create and run Data products in Dailymotion. Inside this craft, the Analytics Engineering team’s mission is to provide trustworthy and available data to enable analysis & insights throughout the company (B2C, B2B products, and business teams).
Analytics Engineering team builds and maintains products like our multi-petabyte data warehouse, event processors (at tens of thousands of messages per second), highly scalable client-facing analytics, data ingestion & distribution, synchronizing data across databases & systems, etc. The team is responsible for making costs-performance tradeoffs around data modeling & architecture. The team is also involved with training users of our data on SQL and analytics best practices and spearheading a significant effort around data governance.
Analytics Engineering is a new and emerging space within the Data sphere. As an Analytics Engineer, you bring a software engineering mindset, best practices to maintain analytics code, and to model data from its source to its use in the data warehouse as business and reporting data. It requires a mix of programming skills and data skills on a day-to-day basis. If you are interested in solving challenging business problems with your skills, consider applying to this role. Your impact will be broad and across all of Dailymotion’s businesses.
What you will do:
Collect vast amounts of raw data from internal sources and external sources in batch and streaming modes.
Expose the data through APIs, flat files, data marts, etc., for internal and external users.
Design Druid datasets for external facing consumers for speed, consistency, cost, and efficiency.
Write complex and optimal SQL queries to transform data in our data lake into reliable business entities and then into reporting aggregates. Identify dependencies for these transformations. Schedule these transformations through Airflow.
Investigate data discrepancy, data quality issues. Debug performance issues using query plan.
Design BigQuery table data model to efficiently answer business use cases considering cost and performance.
Ensure data is clean, consistent, and available. Perform data quality checks, create monitors.
Catalog and document the business entities, data marts, dimensions, metrics, business rules, etc.
Be a knowledge guide on the various business entities, data marts. Train users of our data on SQL and analytics best practices.
Come up with new tools, processes, documents and explore new tech during the cool-down periods.
Qualifications
- BS/MS in Computer Science, Engineering or related field
2+ years experience around Big Data, Data warehousing, writing complex SQL, and debugging complex SQL.
1+ years of experience developing and debugging software in Python.
Good business modeling skills: going from a stakeholder’s expressed requirements to an actual data model.
Ability to work with multiple stakeholders - Product, Engineers, Analysts, Product managers, DevOps, etc.
Comfortable working with Linux and the GCP stack
Experience with PubSub, Data flow, Data Processor, Airflow or Kafka, Spark, or other streaming technologies is a plus.
Experience in real-time analytics databases like Apache Druid is a plus.
Familiarity with NoSQL technologies such as Aerospike is a plus.
Writing and speaking proficiency in English
Technologies used by the team:
Google Cloud Platform (BigQuery, Cloud Storage, Beam/Dataflow, Compute Engine, etc), Python, GO, Airflow, SQL, Git, Java, JSON, Bash, Docker, Druid, Kubernetes, etc
Additional Information
At Dailymotion, we empower candidates to take action. If this job sounds like a great opportunity for you, be confident in your skills, we are always happy to meet you! If needed, we can accommodate our recruitment process for your special abilities.
Location: Remote in France / Sophia Antipolis / Paris
Type of contract: Permanent
Start Date: ASAP
For the France offices 🇫🇷
🏡 Hybrid Work Framework (4 types of remote work: Full office /Flex office (1/2 days remote) / Flex remote (1/2 days at the office) / Full remote + ability to work 3 months abroad)
💰International Group Savings Plan offered through the Vivendi Group
🍼 8 weeks paid Paternity leave or Co-parental leave
🕶️ Excellent Employee Culture (Company Events / Training / Parties / All hands …)
🚀 Career development support (training / career check-in with HR / internal mobility / compensation cycle / 360 quarter feedback review …)
🏥 Company-paid Health Insurance and Personal Services Vouchers (CESU)
🚆Commuter benefit coverage - Public Transport and Bike refund
⛱️ Paid Time off – RTT and Saving time plan (CET)
✅ Meal Vouchers
🎡Workers representatives committee(sports membership/cinemas vouchers/gift vouchers/discount)
Feel free to explore Dailymotion culture a little further, please check out:
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs Architecture Big Data BigQuery Computer Science Dataflow Data governance Data quality Data warehouse Data Warehousing DevOps Docker Engineering GCP Git Google Cloud Java JSON Kafka Kubernetes Linux Machine Learning NoSQL Python Spark SQL Streaming
Perks/benefits: Career development Parental leave Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs
- Open Databricks-related jobs