Data Engineer - Content Platform

New York, NY

Applications have closed

Spotify

We grow and develop and make wonderful things happen together every day. It doesn't matter who you are, where you come from, what you look like, or what music you love. Join the band!

View company page

Spotify is looking for a Data Engineer to join the Content Platform team! Content Platform is a central enabler for Spotify. Together, our teams ensure that Spotify has a complete, available, and enriched catalog of music, podcasts, videos, and more. Sitting at the intersection of our consumer and creator offerings, we use a combination of cutting-edge machine learning, crowd-sourced wisdom, and deep industry expertise to discover and organize structured information about the world of audio. In doing so, we help Spotify make well-informed decisions and build impactful products. We are looking for a Data Engineer to join our team that is working on applying and experimenting with state-of-the-art machine learning techniques to improve the quality of Spotify’s catalog content. As an engineer on the team, you will not only coordinate and influence the direction within the team but also work with multiple stakeholders to drive and deliver Data and ML-centric products.

What you’ll do

  • Build large-scale batch and real-time testable data pipelines in Scala and Python in support of ML systems, using data processing frameworks like Scio, Beam, Spark, Storm, and Scalding, running in the cloud via Google Cloud Platform.
  • Design, develop, and deploy backend services in Java with a focus on high availability, robustness, and monitoring.
  • Use best practices in continuous integration and delivery.
  • Help drive optimization, testing, and tooling to improve data and systems quality.
  • Work in multi-functional agile teams to continuously experiment, iterate, and deliver on new product objectives.
  • Take operational responsibility for the services that are owned by your team
  • Work in an environment that supports your individual growth by providing you ambitious tasks to tackle and the time needed to acquire new skills

Who you are

  • Have 3+ years of professional experience working in a product facing environment.
  • Worked with high volume heterogeneous data, preferably with distributed systems such as Hadoop, BigTable, and Cassandra.
  • Experienced in writing distributed, high-volume services in Java or Scala.
  • Have an understanding of system design, data structures, and algorithms.
  • Knowledgeable about data modeling, data access, and data storage techniques.
  • Deployed and operated services in a cloud environment such as GCP or AWS
  • Enjoy close collaboration with web engineers and passionate about the software architecture across the backend, frontend and the APIs that glue them together
  • Care about agile software processes, data-driven development, reliability, and responsible experimentation.

Where you'll be

  • We are a distributed workforce enabling our band members to find a work mode best for them!
  • Where in the world? For this role, it can be within the Americas region in which we have a work location
  • Prefer an office to work from home instead? Not a problem! We have plenty of options for your working preferences. Find more information about about our Work From Anywhere options here
  • Working hours? We operate within the Eastern Standard time zone for collaboration
Spotify is an equal opportunity employer. You are welcome at Spotify for who you are, no matter where you come from, what you look like, or what’s playing in your headphones. Our platform is for everyone, and so is our workplace. The more voices we have represented and amplified in our business, the more we will all thrive, contribute, and be forward-thinking! So bring us your personal experience, your perspectives, and your background. It’s in our differences that we will find the power to keep revolutionizing the way the world listens.
Spotify transformed music listening forever when we launched in 2008. Our mission is to unlock the potential of human creativity by giving a million creative artists the opportunity to live off their art and billions of fans the chance to enjoy and be passionate about these creators. Everything we do is driven by our love for music and podcasting. Today, we are the world’s most popular audio streaming subscription service with a community of more than 345 million users.

Tags: Agile APIs AWS Bigtable Cassandra Data pipelines Distributed Systems GCP Google Cloud Hadoop Machine Learning Pipelines Python Scala Spark Streaming Testing

Perks/benefits: Career development

Region: North America
Country: United States
Job stats:  16  1  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.