Staff Software Engineer, Trust Data Engineering
San Francisco, CA
Airbnb
Get an Airbnb for every kind of trip → 7 million vacation rentals → 2 million Guest Favorites → 220+ countries and regions worldwideAirbnb is a mission-driven company dedicated to helping create a world where anyone can belong anywhere. It takes a unified team committed to our core values to achieve this goal. Airbnb's various functions embody the company's innovative spirit and our fast-moving team is committed to leading as a 21st century company.
What is Data Engineering at Airbnb?
We need to ensure every area of the business has trustworthy data to fuel insight and innovation. Understanding the business need, securing the right data sources, designing usable data models, and building robust & dependable data pipelines are essential skills to meet this goal.
At the same time, the technology used to create great data is continually evolving. We are moving to a reality where both batch & stream processing are leveraged to meet the latency requirements for the business. The Data Engineering paved path is still taking shape, and we want to collaboratively develop this to support the entire company. We need senior engineers who are passionate not only about the data, but also about improving the technology we leverage for Data Engineering.
We are looking for talented senior Data Engineers who are excited about redefining what it means to do Data Engineering. Data Engineering is part of our Engineering org as we believe great Data Engineering depends on solid Software Engineering fundamentals. However, we also recognize that each Data Engineer has a unique blend of skills. Whether your strength is in data modeling or in stream processing, we want to talk to you.
What is Trust Fraud Detection At Airbnb?
Over two million people stay on Airbnb every night and the Trust Engineering team keeps our hosts and guests safe and supported throughout the entire Airbnb experience.
As part of the Trust Data Engineering team, you will be in charge of designing and building scalable and robust data processing systems to detect and mitigate fraud across our entire platform. You will be deeply involved in the technical details of building highly available and real-time risk detection data applications in close collaboration with product, data science and operations teams to understand ever-evolving attack vectors and to make Airbnb the safest and most trusted community.
Projects:
- Build real-time data pipelines to empower data foundations for fraud detection, such as account takeover, newly-added payout methods with historical ATO'ed accounts, ghosted users etc.
- Design and implement a unified data interface to facilitate data access across various data warehouses (ex. Hive, Presto, StarRocks) for Trust applications and fraud management tooling
- Provide an end-to-end near real-time alerting framework to automate fraud trend detection
- Collaborate with Operations, Data Science, and engineering teams to design and build an offline rule management system to enable creation, testing, and execution of offline rules used for async fraud detection and mitigation
- Define a domain specific language (DSL) to support an intuitive query builder user interface to achieve low-code/no-code queries for fraud alerting, actioning, and investigative tooling
- Design and implement data validation framework for fraud rule backtesting and automate alert efficacy analysis
- Standardize fraud ground truth data and labels to power critical fraud metrics, automate rule efficacy analysis, and improve fraud detection acrossTrust.
Minimum Requirements:
- 10+ years of relevant software development industry experience in a fast paced, high growth tech environment.
- Bachelor’s in CS or related field, Master’s or PhD is preferred.
- Excellent technical background in big data, including Spark/Flink/Hive/SQL and strong programming experiences with Java/Scala/Python/Airflow etc.
- Strong communication skills and the ability to work well within a team and across engineering teams.
- Strong sense of ownership with experiences building and operating high-scale, distributed systems across the full software life cycle.
- Passionate about system efficiency, availability, quality and scalability.
- Related experience in fraud detection or Trust & Safety is a plus.
- Ability to influence and have an impact.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Big Data Data pipelines Distributed Systems Engineering Flink PhD Pipelines Python Scala Spark SQL Testing
Perks/benefits: Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Sr Data Engineer jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs
- Open Databricks-related jobs