Senior Data Engineer
London
Spotify
We grow and develop and make wonderful things happen together every day. It doesn't matter who you are, where you come from, what you look like, or what music you love. Join the band!What You'll Do
- Build large-scale speech and audio data pipelines using frameworks like Google Cloud Platform and Apache Beam
- Work on machine learning projects powering new generatie AI experiences and helping to build state-of-the-art text-to-speech models
- Learn and contribute to the teams understanding of best practicies and techniques for building data pipelines for large scale generative models, including cleaning, filtering, classifying and labelling
- Collaborate with other engineers, researchers, product managers and stakeholders, taking on learning and leadership opportunities that arise
- Deliver scalable, testable, maintainable, and high-quality code. Share knowledge, promote standard methodologies, making your team the best version of itself through mentorship and constructive accountability
Who You Are
- You have Data Engineering experience and you know how to work with high-volume, heterogeneous data, preferably with distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS or Azure
- You have experience with one or more higher-level Python or Java based data processing frameworks such as Beam, Dataflow, Crunch, Scalding, Storm, Spark, Flink etc. You have strong python programming abilities
- Experience using pre-trained ML models is a plus
- You might have worked with Docker as well as Luigi, Airflow, or similar tools
- You care about quality and you know what it means to ship high quality code
- You have experience managing data retention policies
- You care about agile software processes, data-driven development, reliability, and responsible experimentation
- You understand the value of collaboration and partnership within teams.
Were You'll Be
- This role is located in London, UK or Stockholm, Sweden
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow AWS Azure Bigtable Cassandra Dataflow Data pipelines Distributed Systems Docker Engineering Flink GCP Generative modeling Google Cloud Hadoop Java Machine Learning ML models Pipelines Python Spark
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Research Scientist jobs
- Open Junior Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Scientist jobs
- Open BI Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr. Data Scientist jobs
- Open Lead Data Analyst jobs
- Open Azure Data Engineer jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Engineering Manager jobs
- Open Product Data Analyst jobs
- Open Data Analyst II jobs
- Open Privacy-related jobs
- Open GCP-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open Consulting-related jobs
- Open Generative AI-related jobs
- Open Business Intelligence-related jobs
- Open Data governance-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Docker-related jobs
- Open Snowflake-related jobs