Staff Software Engineer, ML Platform

San Francisco, CA

Applications have closed

Reddit

The front page of the internet

View company page

Reddit is a community of communities where people can dive into anything through experiences built around their interests, hobbies, and passions. Our mission is to bring community, belonging, and empowerment to everyone in the world. Reddit users submit, vote, and comment on content, stories, and discussions about the topics they care about the most. From pets to parenting, there’s a community for everybody on Reddit and with over 50 million daily active users, it is home to the most open and authentic conversations on the internet. For more information, visit redditinc.com.

“The front page of the internet," Reddit brings over 430 million people together each month through their common interests, inviting them to share, vote, comment, and create across thousands of communities. 

The Machine Learning Platform team at Reddit is a high impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while directly impacting other teams such as Growth, Ads, Feeds, and Core Machine Learning teams.

How You'll Have Impact:

As the 6th largest site on the internet, Reddit generates billions of events and terabytes of data in a day. You will own projects from ideation to production instead of being stuck making small incremental gains on enterprise systems. We are looking for the best and the brightest Machine Learning Platform Engineers to join us in solving hard problems in order to enable products that millions of users will love, and ultimately bring community and belonging to Reddit’s users. We are a team of builders that value impact, personal growth, openness and kindness.

What You’ll Do

You will be instrumental in architecting, implementing, and maintaining foundational Machine Learning (ML) infrastructure that powers Feeds Ranking, Content Understanding, Recommendations and much more to fulfill Reddit’s mission of bringing community and belonging to everyone in the world. You will build systems and tools that enable machine learning engineers (MLEs) and data scientists (DSs) and continuously improve the ML software development lifecycle.  You will deliver a self service ML platform that enables the continuous iteration and improvement of systems that use ML techniques including Deep Learning, Natural Language Processing, Recommendation Systems, Representation Learning and Computer Vision.

Responsibilities:

  • Provide technical leadership through engineering lifecycle 
  • Serve as top escalation point to resolve complex technical issues 
  • Identify and lead strategic initiatives to advance the ML infrastructure at Reddit
  • Guide other engineers in resolving complex technical designs 
  • Be the go-to expert in the design and building the high-performance ML platform solutions that address bottlenecks in the model development lifecycle
  • Collaborate with other teams and translate requirements into reliable, scalable platform solutions
  • Set high-standards for a rigorous DevOps approach to maintain and/or improve ML platform components and services health and quality

Requirements:

  • 7+ years of work experience in software development and ML or data infrastructure
  • 5+ years of experience building production-quality code incorporating testing, evaluation, and monitoring using object oriented programming, e.g. Python, Scala, etc.
  • 2+ years acting as leader and architect in the computation and data storage domains to evolve and refine ML infrastructure
  • Ability to lead, coach, and mentor other engineers 
  • Superior organizational skills and cross-functional collaboration
  • Superior communication skills

Pluses:

  • Expert on relevant technology stack, e.g. Go, Python, AWS, GCP, Kubernetes, Kubeflow, Airflow, and Ray
  • Experience designing and developing applications using large-scale data stack, e.g, BigQuery, GraphQL, Kafka, Flink, Cassandra, Redis
  • Experience is recommendation, search engines, or content classification systems
  • Interest in advancing infrastructure for Deep Learning, NLP, or Computer Vision
  • Interest in open source development and working with applied researchers in the advancement of ML Systems

Benefits: 

  • Comprehensive Health benefits
  • 401k Matching 
  • Workspace benefits for your home office
  • Personal & Professional development funds
  • Family Planning Support
  • Flexible Vacation & Reddit Global Days Off
  • 4+ months paid Parental Leave  
  • Paid Volunteer time off

Pay Transparency:

This job posting may span more than one career level.

In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.

To provide greater transparency to candidates, we share base pay ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.

The base pay range for this position is: $198,200 - $297,300.

#LI-SV1

#LI-Remote

Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at ApplicationAssistance@Reddit.com.

Tags: Airflow AWS BigQuery Cassandra Classification Computer Vision Deep Learning DevOps Engineering Flink GCP GraphQL Kafka Kubeflow Kubernetes Machine Learning ML infrastructure ML models NLP Open Source Python Scala Testing

Perks/benefits: 401(k) matching Career development Equity Flex hours Flex vacation Health care Home office stipend Insurance Medical leave Parental leave Team events Transparency

Regions: Remote/Anywhere North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.