Data Engineer (Remote)

San Francisco

Applications have closed

AllTrails

Search over 400,000 trails with trail info, maps, detailed reviews, and photos curated by millions of hikers, campers, and nature lovers like you.

View company page

Data Engineer About AllTrails
AllTrails is the most trusted and used outdoors platform in the world. We help people explore the outdoors with hand-curated trail maps along with photos, reviews, and user recordings crowdsourced from our community of millions of registered hikers, mountain bikers, and trail runners in 150 countries. AllTrails is frequently ranked as a top-5 Health and Fitness app and has been downloaded by over 40 million people worldwide.
Every day, we solve incredibly hard problems so that we can get more people outside having healthy, authentic experiences and a deeper appreciation of the outdoors. Join us!  

What You’ll Be Doing:

  • Work cross-functionally to ensure the company has access to clean, reliable, and secure data required to make informed business decisions and provide the backbone for new product features
  • Build, deploy, and orchestrate large-scale batch and stream data pipelines to transform and move data to/from our data warehouse and third-party systems
  • Deliver scalable, testable, maintainable, and high-quality code
  • Investigate, test-for, monitor, and alert on inconsistencies in our data, data systems, or processing costs
  • Create tools to improve data discoverability and documentation
  • Ensure data collection and storage adheres to GDPR and other privacy and legal compliance requirements
  • Uphold best data-quality standards and practices, promoting such knowledge throughout the organization
  • Work closely with a small tight-knit data team consisting of data analysts and scientists, providing the team with engineering expertise

Requirements:

  • 3+ years of experience in a Data Engineering role
  • Experience with one or more high-level non-SQL-based data processing languages (e.g. Python, Java, Ruby)
  • Proficiency with SQL and experience working with high volume datasets in SQL-based warehouses such as BigQuery, Redshift, Snowflake, or others
  • Deep understanding of data modeling, access, storage, caching, replication, and optimization techniques
  • Ability to orchestrate data pipelines through tools such as Apache Airflow
  • Experienced in container orchestration (e.g. Docker)
  • Understanding of the software development lifecycle and CI/CD
  • Monitoring and metrics-gathering (Datadog, NewRelic, Cloudwatch, etc)
  • Proficiency with git
  • Excellent documentation skills
  • Self-motivation and a deep sense of pride in your work
  • Passion for the outdoors
  • Comfort with ambiguity, and an instinct for moving quickly
  • Humility, empathy and open-mindedness - no egos

Bonus Points:

  • Experience with parallelized data processing frameworks such as Apache Beam, Apache Spark, Google Dataflow, AWS Glue, etc
  • Infrastructure-as-code, such as Terraform and basic dev-op principles
  • Moving data within a Multi-Cloud architecture (AWS and Google)
  • Experience with ELT tools such as Fivetran, dbt or Dataform
  • Experience with feature collection for ML systems
  • Software development experience with Ruby on Rails

Our Commitment to You:

  • A competitive and equitable compensation plan. This is a full-time, salaried position that includes equity
  • Physical & mental well-being including health, dental and vision benefits + a monthly stipend for wellness expenses
  • Trail Days: First Friday of each month off to hit the trails!
  • Unlimited PTO
  • Flexible parental leave
  • Remote employee equipment stipend to create a great remote work environment
  • Annual continuing education stipend
  • Discounts on subscription and merchandise for you and your friends & family
  • An authentic investment in you as a human being and your career as a professional
Nature celebrates you just the way you are and so do we! At AllTrails we’re passionate about nurturing an inclusive workplace that values diversity. It’s no secret that companies that are diverse in background, age, gender identity, race, sexual orientation, physical or mental ability, ethnicity, and perspective are proven to be more successful. We’re focused on creating an environment where everyone can do their best work and thrive.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Airflow AWS BigQuery CI/CD Dataflow Data pipelines Docker ELT Engineering FiveTran Git Machine Learning Pipelines Python Redshift Ruby Snowflake Spark SQL Terraform

Perks/benefits: Career development Competitive pay Equity Fitness / gym Flex hours Flex vacation Health care Home office stipend Parental leave Salary bonus Unlimited paid time off Wellness

Regions: Remote/Anywhere North America
Country: United States
Job stats:  13  10  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.