Data Engineer
Poland - Krakow Office
We are on an exciting data journey, aiming to transform our business through the Connectivity of our machines, and the MyDyson app.
About the role
The Lead Data Engineer will be responsible for developing, constructing, testing, and maintaining data pipelines, ensuring optimal data delivery architecture through ongoing projects. This role requires a deep understanding of data architecture, data engineering, data analysis, and hands-on experience in orchestrating data flows.
You will play an important part in maintaining and improving our Connected Intelligence Platform, Connected Control Tower, and Data Science toolset to ensure that the answers to crucial questions can either be self-served or provided through modelling and investigation of the data.
Responsibilities
Lead and take complete responsibility for data projects from start to finish.
Create and maintain data pipelines to extract, transform, and load data from various sources into a central repository for analysis;
Work closely with data scientists, analysts, and business stakeholders to understand their data needs and implement solutions that enable efficient data analysis.
Ensure data is accurate, complete, consistent, and accessible by performing data quality checks, tracking data sequence, and data security measures;
Leverage cloud computing platforms such as GCP (Google Cloud Platform) to deploy and scale analytics solutions, as well as scalability, security, and cost optimization of data infrastructure;
Optimize the performance of data storage and retrieval systems, data pipelines, and machine learning models to ensure they can handle the volume and complexity of data;
Using programming languages and tools such as Python and SQL, to implement data pipelines, data models, and machine learning models;
Keep up to date with the latest technologies and trends in data engineering, machine learning, and analytics, and continually seek opportunities to improve the organization’s data infrastructure and analytics capabilities.
Build and maintain relationships with external partners, such as Google, to ensure we fully use their toolset and remain at the forefront of data technology.
Qualifications
Proven experience as a Senior Data Engineer, Data & Analytics Engineer, or similar role.
Strong proficiency in at least one major programming language (e.g., Java, Scala, Python) and comfortable working with SQL to implement data pipelines, data models, and machine learning models.
Strong background in at least one of the following: distributed data processing or software engineering of data services.
Experience with database technologies (relational, NoSQL) and dimensional data modeling techniques and best practices.
Experience with big data tools and data pipeline orchestration tools (e.g., Apache Airflow), experience with dbt is a plus.
Experience with version control systems like Git.
Experience with cloud computing platforms like AWS, GCP, or Azure to deploy and scale their solutions.
Experienced with industry-standard visualization and Business Intelligence tools like Tableau, Looker, Power BI, etc.
Familiar with Infrastructure as a Code principles.
Benefits:
Financial:
Performance related bonus
Life Assurance
Pension scheme with competitive employer contributions
Recognition Program
Holiday Allowance
Lifestyle:
Free fruit delivered for office staff, free coffee and tea
Cafeteria Benefit – wellness programme, cinema tickets, Multisport card etc.
Health:
Medical: Employee cover + opportunity to buy additional cover for family
Employee Assistance Program for employee and dependents
#LI-KO1
Dyson is an equal opportunity employer. We know that great minds don’t think alike, and it takes all kinds of minds to make our technology so unique. We welcome applications from all backgrounds and employment decisions are made without regard to race, colour, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other any other dimension of diversity.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture AWS Azure Big Data Business Intelligence Data analysis Data pipelines Data quality dbt Engineering GCP Git Google Cloud Java Looker Machine Learning ML models NoSQL Pipelines Power BI Python Scala Security SQL Tableau Testing
Perks/benefits: Career development Health care Salary bonus Snacks / Drinks Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Data Engineer II jobs
- Open Senior Business Intelligence Analyst jobs
- Open Principal Data Engineer jobs
- Open Data Manager jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Sr Data Engineer jobs
- Open Business Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Product Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs