Staff Machine Learning Engineer - Content Intelligence
New York City
Applications have closed
Spotify
We grow and develop and make wonderful things happen together every day. It doesn't matter who you are, where you come from, what you look like, or what music you love. Join the band!Music attribution at scale is one of the great unsolved technical problems of the music industry, and we’re building groundbreaking technology to solve it. Our goal is to solve this problem for the more than 60 million music tracks playable on Spotify, building a knowledge graph through machine learning models, deep domain expertise, and close integration with human-in-the-loop processes across Spotify and the industry. Content Platform’s catalog data powers Spotify experiences from Artist pages in the app, search and recommendations, human playlist curation, Spotify for Artists, and our music industry strategy!
We are looking for a Staff Engineer to help us define and build Spotify’s Music Knowledge Graph. The team is composed of product, machine learning, data and backend engineers, and domain experts who average 11 years behind the scenes in the music industry. We expand the state of the art in AI-based machine technology, which enables thoughtful, efficient, and intuitive ways to search, re-use, explore or process metadata. You will use world-class engineering and machine learning techniques on real-world, internal, and external big data to directly impact the evolution of our music catalog!
What you'll do
- Oversee and guide the design, development, and evolution of our knowledge graph ecosystem.
- Coordinate with Product and Engineering leadership to identify both the long-term and short-term needs of the knowledge graph.
- Build and deploy robust ML/DL models that improve entity extraction, classification, resolution, and disambiguation within the Music Knowledge Graph across multiple languages (e.g. English, Korean, etc.), time dimensions, and territories.
- Collaborate with data engineers, applied ML engineers, software engineering, data/content analysts, research scientists & front-end engineers to support tooling for an increasing number of Music Knowledge Graph use cases within Spotify.
- Collaborate with technical and non-technical business partners to develop analytics and metrics that describe the performance of matching systems and the quality of our data.
- As a multi-functional resource, you will have the opportunity to work on the problems where you are needed most, whether that is with an existing project or cutting a path for something new.
- Take on complex data-related problems involving some of the most diverse datasets available and determine the feasibility of projects through quick prototyping with respect to performance, quality, time, and cost using Agile methodologies.
- Architect best-in-class infrastructure (platforms, tools, and approaches) to accelerate our research to the production phase and to unblock efficient deployment, optimization, and testing of ML models.
- Be a leading voice in an active community of machine learning practitioners across Spotify and use existing state-of-the-art tooling in the Spotify ecosystem. (TensorFlow, Kubeflow, DataFlow, python-beam, Google Cloud Platform).
- Contribute to our team-wide product ideation in collaboration with other engineers, researchers, product managers, and subject-matter experts on the team.
- Your critical projects will involve building enriched canonical versions of the knowledge graph from discrete data sources.
Who you are
- Extensive experience working in a product and data-driven environment (Python, Scala, Java, SQL, or C++, with Python experience required) and cloud platforms (GCP or AWS).
- Understand storage solutions and when to use them (e.g. Graph Database, Cassandra, Relational database).
- Familiarity with Graph ML and graph learning problems & solutions (e.g., graph embedding and graph neural networks).
- Deep expertise in graph building, graph processing, graph querying, and graph analytics.
- You have experience architecting data pipelines and are self-sufficient in getting the data you need to build and evaluate models, using tools like Dataflow, Apache Beam, or Spark.
- Academic and/or proven experience in knowledge graphs, data management, natural language processing.
- Familiar with the industry trends and keep up with the latest product offerings, and can understand trade-offs of existing solutions.
- Have excellent communication skills and the ability to translate business intuition into data-driven hypotheses that result in impactful engineering solutions.
- Love your customers even more than your code.
- Have experience and passion for mentoring and encouraging collaborative teams.
- Have experience in encouraging a strong engineering culture in an agile environment.
Where you'll be
- We are a distributed workforce enabling our band members to find a work mode best for them!
- Where in the world? For this role, it can be within the Americas region in which we have a work location.
- Prefer an office to work from home instead? Not a problem! We have plenty of options for your working preferences. Find more information about our Work From Anywhere options here.
- Working hours? We operate within the Eastern Standard time zone for collaboration.
Spotify transformed music listening forever when we launched in 2008. Our mission is to unlock the potential of human creativity by giving a million creative artists the opportunity to live off their art and billions of fans the chance to enjoy and be passionate about these creators. Everything we do is driven by our love for music and podcasting. Today, we are the world’s most popular audio streaming subscription service with a community of more than 381 million users.
This position is not eligible to be performed in Colorado.
Tags: Agile AWS Big Data Cassandra Classification Dataflow Data management Data pipelines Engineering GCP Google Cloud Machine Learning ML models NLP Pipelines Prototyping Python Research Scala Spark SQL Streaming TensorFlow Testing
Perks/benefits: Flex vacation Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Databricks-related jobs