Senior Data Engineer

Madrid

Daniel J Edelman Holdings

Edelman is an award-winning global public relations consultancy firm. We partner with businesses to evolve, promote and protect their brands and reputations.

View company page

We currently seeking a Senior Data Engineer with 5-7 years’ experience. The ideal candidate would have the ability to work independently within an AGILE working environment and have experience working with cloud infrastructure leveraging tools such as Apache Airflow, Databricks, DBT and Snowflake. A familiarity with real-time data processing and AI implementation is advantageous.  

Responsibilities:

  • Design, build, and maintain scalable and robust data pipelines to support analytics and machine learning models, ensuring high data quality and reliability for both batch & real-time use cases.
  • Design, maintain, optimize data models and data structures in tooling such as Snowflake and Databricks.
  • Leverage Databricks for big data processing, ensuring efficient management of Spark jobs and seamless integration with other data services.
  • Utilize PySpark and/or Ray to build and scale distributed computing tasks, enhancing the performance of machine learning model training and inference processes.
  • Monitor, troubleshoot, and resolve issues within data pipelines and infrastructure, implementing best practices for data engineering and continuous improvement.
  • Diagrammatically document data engineering workflows.
  • Collaborate with other Data Engineers, Product Owners, Software Developers and Machine Learning Engineers to implement new product features by understanding their needs and delivery timeously. 

Qualifications:

  • Minimum of 3 years experience deploying enterprise level scalable data engineering solutions.
  • Strong examples of independently developed data pipelines end-to-end, from problem formulation, raw data, to implementation, optimization, and result.
  • Proven track record of building and managing scalable cloud-based infrastructure on AWS (incl. S3, Dynamo DB, EMR).
  • Proven track record of implementing and managing of AI model lifecycle in a production environment.
  • Experience using Apache Airflow (or equivalent) , Snowflake, Lucene-based search engines.
  • Experience with Databricks (Delta format, Unity Catalog).
  • Advanced SQL and Python knowledge with associated coding experience.
  • Strong Experience with DevOps practices for continuous integration and continuous delivery (CI/CD).
  • Experience wrangling structured & unstructured file formats (Parquet, CSV, JSON).
  • Understanding and implementation of best practices within ETL end ELT processes.
  • Data Quality best practice implementation using Great Expectations.
  • Real-time data processing experience using Apache Kafka Experience (or equivalent) will be advantageous.
  • Work independently with minimal supervision.
  • Takes initiative and is action-focused.
  • Mentor and share knowledge with junior team members.
  • Collaborative with a strong ability to work in cross-functional teams.
  • Excellent communication skills with the ability to communicate with stakeholders across varying interest groups.
  • Fluency in spoken and written English.
#LI-RT9
Edelman Data & Intelligence (DXI) is a global, multidisciplinary research, analytics and data consultancy with a distinctly human mission.
We use data and intelligence to help businesses and organizations build trusting relationships with people: making communications more authentic, engagement more exciting and connections more meaningful.
DXI brings together and integrates the necessary people-based PR, communications, social, research and exogenous data, as well as the technology infrastructure to create, collect, store and manage first-party data and identity resolution. DXI is comprised of over 350 research specialists, business scientists, data engineers, behavioral and machine-learning experts, and data strategy consultants based in 15 markets around the world.
To learn more, visit: https://www.edelmandxi.com
Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Airflow AWS Big Data CI/CD CSV Databricks Data pipelines Data quality Data strategy dbt DevOps ELT Engineering ETL JSON Kafka Machine Learning ML models Model training Parquet Pipelines PySpark Python Research Snowflake Spark SQL

Region: Europe
Country: Spain
Job stats:  5  1  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.