Senior Analytics Data Engineer

San Francisco or Remote

Trunk

Automated Code Quality for Teams : universal formatting, linting, static analysis, and security

View company page

At Trunk, our mission is to help teams create high-quality software quickly. Merge conflicts, poor code quality or consistency, flaky tests, and dozens of other distractions quickly drain the productivity and morale of those teams. Engineering teams that can stay focused on designing, implementing, and delivering software will build magical, high-quality projects - and they will be happier doing it. We're building the tools that empower teams to land code faster and develop happier.
We are building the foundation for a modern software engineering team. Our founders started this journey in 2021 and have designed, delivered, and scaled software at some of the world's largest and fastest-growing tech companies - Uber, Google, YouTube, and Microsoft. We're building a game-changing company, and we hope you are excited to be a part of that audacious goal.
Software has eaten the world; almost every company produces software in some form or fashion, so our addressable market is virtually every company on earth. We're going after every engineering team on the planet - we're starting with smaller teams, but there are literally hundreds of thousands of companies out there for us to empower and maybe only a handful (Google, Facebook, Amazon), that are outside our scope. We are building the DevEx platform to empower the world.
In 2022, we raised a $25M Series A led by Initialized Capital (Garry Tan) and a16z (Peter Levine), with investments from Haystack Ventures, Garage VC, Tom Preston Warner (Founder/CEO of GitHub), Geoff Schmidt (Founder/CEO Apollo GraphQL), Nicolas Dessaigne (Founder/CEO Algolia), and Oleg Rognysky (Founder/CEO Peopl.ai).

What you'll do 🧑‍💻

  • Build data pipelines, text analysis algorithms, query engines, and decision making engines
  • Apply robust and fault-tolerant approaches to create scalable ingestion and data-processing systems
  • Debug, profile and optimize distributed data-intensive applicating, improving their latency, accuracy, resource consumption, and throughput
  • Work with existing applications built with Spark, S3, Timescale, Python and Rust
  • Directly implement services and features that leverage the results of your data pipelineImplement and improve machine learning and data pipelines

We're looking for 🔎

  • 5+ years of experience as an engineer with a strong understanding of key concepts in distributed systems
  • 3+ years of extensive experience in building and deploying data applications
  • Fluency in at least one, and ideally more than one, of these languages: Java/Scala/Kolin, Python, Go, Rust, or C++
  • Good understanding of following concepts: partitioning, replication, map-reduce, indexing, and CAP
  • Experience with distributed storage systems (S3, HDFS, Hive, ClickHouse, Elastic, etc), distributed processing engines (Spark, etc), and message queues (Kafka, SQS, etc)
  • Passion for building large-scale ML applications and improving software engineers' productivity
  • Some understanding of key concepts in natural language processing, machine learning, or statistical analysis
  • Some experience with machine learning stack (pandas, PyTorch, numpy, sci-kit, transformers, etc)

What we offer 🎁

  • Unlimited PTO
  • Competitive salary and equity
  • Work-life balance
  • Flexibility to be fully or partly remote
  • Few meetings, so you can ship fast and focus on building
  • One Medical membership on us!
  • Top-notch medical, dental, vision, short-term disability, long-term disability, and life insurance
  • All insurance is 100% company-paid ($0 premiums) for employees and highly subsidized for dependants
  • FSA, HSA with company contributions, and pre-tax commuter benefits
  • 401(k) plan
  • Paid parental leave ( up to 12 weeks)
The salary and equity range for this role are: $170K - $210K and .15% - .35%.
Don’t meet every single requirement? At Trunk, we are dedicated to building a diverse and inclusive workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.
If you need assistance or an accommodation due to a disability, we're happy to help accommodate. Please contact us at recruiting@trunk.io.
Apply now Apply later
  • Share this job via
  • or

Tags: Data pipelines Distributed Systems Engineering GitHub GraphQL Haystack HDFS Java Kafka Machine Learning NLP NumPy Pandas Pipelines Python PyTorch Rust Scala Spark Statistics Transformers

Perks/benefits: Career development Competitive pay Equity Health care Insurance Medical leave Parental leave Unlimited paid time off

Regions: Remote/Anywhere North America
Country: United States
Job stats:  12  2  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.