Senior Data Engineer, POI Data
Remote US
Mapbox
APIs and SDKs for AI-powered maps, location search, turn-by-turn navigation, and geospatial data in mobile or web apps. Get started for free.Maps are no longer static. Our maps represent the ever-evolving world, accessing, aggregating, and adapting anonymous data from millions of sensors and phones in real-time. Mapbox has the exciting opportunity to power devices and products across the next frontier in location-based data, such as Internet of Things and AR/VR.
Whether you’re watching the delivery of your grocery order on Instacart, looking at a gym on ClassPass, sending your snaps on Snap, tracking your personal best on Strava, monitoring your gas budget on Metromile, or checking today’s forecast on The Weather Channel, Mapbox is the location and maps within those apps. We’re changing how people move by live-mapping the world. We are the developer platform for location.
What We Do
The Points of Interest Data (POI) team at Mapbox is responsible for providing a complete, accurate collection of the places people want to go, with all the information users need to get there. We handle the problem end-to-end. On the data engineering side, this means sourcing input datasets and deduplicating them to produce a whole greater than the sum of its parts. On the information retrieval front, we operate our POI Search API and infrastructure, powering search requests from thousands of customers and hundreds of millions of end-users every month.
The POI team works in a domain where precision and recall are carefully balanced. We deal with challenges in normalizing, comparing, aggregating, and canonicalizing our datasets. After we produce that data, we must respond to end-user queries, with high availability and low latency, with results that balance relevance in accordance to the query, proximity, user engagement, and notability of our POIs. To deliver an excellent end-user experience, we track progress by measuring our coverage, completeness, and accuracy, as well as industry-standard information retrieval metrics.
What You’ll Do
As a data engineer working on POI search quality, you’ll be responsible for, and expected to:
- Build and maintain high performance data pipelines that ingest and conflate POI data
- Improve industry-standard information retrieval metrics for POI search, across dozens of countries and languages
- Collaborate with the Address and Federation Search teams in their quality efforts
- Rigorously test your proposed changes
- Use your software development and operations experience to rigorously test your changes and release them uneventfully
What We Believe are Important Traits for This Role
- Ability to use software development experience to onboard onto the POI search problem space to make informed decisions of what to improve
- Background or interest in statistical methods and data science is helpful, or a desire to use your software engineering skills to further specialize in this area
- Experience building and maintaining ingestion data pipelines
- Experience with our tech stack (Spark, Airflow/Dagster, Python, and/or node.js, Docker, AWS) is a plus, as is experience operating applications on cloud-based big data platforms like AWS EMR
What We Value
In addition to our core values, which are not unique to this position and are necessary for Mapbox leaders:
- We value high-performing creative individuals who dig into problems and opportunities.
- We believe in individuals being their whole selves at work. We commit to this through supportive health care, parental leave, flexibility for the things that come up in life, and innovating on how we think about supporting our people.
- We emphasize an environment of teaching and learning to equip employees with the tools needed to be successful in their function and the company.
- We strongly believe in the value of growing a diverse team and encourage people of all backgrounds, genders, ethnicities, abilities, and sexual orientations to apply.
By applying for this position, you acknowledge that you have received the Mapbox Non-US Privacy Notice for applicants, which is linked here. Completing this application requires you to provide personal data, such as your name and contact information, which is mandatory for Mapbox to process your application.
Mapbox is an EEO Employer - Minority/Female/Veteran/Disabled/Sexual Orientation/Gender Identity
Tags: Airflow APIs AWS Big Data Dagster Data pipelines Docker Engineering Node.js Pipelines Python Spark VR
Perks/benefits: Career development Parental leave
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open LLMs-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Data warehouse-related jobs
- Open Databricks-related jobs