Software Engineer, Data Platform
San Francisco, CA
Applications have closed
Benchling
Benchling is a cloud-based platform for biotechnology research and development and the only biology-first platform for scientific data, collaboration, and insights.Biotechnology is rewriting life as we know it, from the medicines we take, to the crops we grow, the materials we wear, and the household goods that we rely on every day. But moving at the new speed of science requires better technology.
Benchling’s mission is to unlock the power of biotechnology. The world’s most innovative biotech companies use Benchling’s R&D Cloud to power the development of breakthrough products and accelerate time to milestone and market.
Come help us bring modern software to modern science.
ROLE OVERVIEW
As one of Benchling's Data Platform engineers, you'll join a rapidly growing engineering team and form the foundation of our data pillar, encompassing customer-facing products, internal analytics, and the customer-facing data warehouse. You will build the next generation of our Data Platform services that enable internal developers to build multi-tenant data applications and analytical products easily. Benchling is growing quickly, and you'll set the bar for high-quality data and a metrics-driven culture as we scale. You'll serve as a key input and thought leader and work closely with the product teams to deliver data-driven capabilities to our internal and external customers.
RESPONSIBILITIES
- Own projects end-to-end, from initial design, to prototype, to large-scale rollout.
- Partner with product managers, architects, internal developers, and data-analyst to ensure we build a Data Platform vision with high scalability, reliability, and usability
- Build and operate (OLTP & OLAP)databases for complex scientific customer-data and Benchling's internal data.
- Design and implement Data Platform services with API & SQL endpoints while ensuring that data integrity and data privacy are first-class concerns regarded proactively instead of reactively.
- Design and create CI/CD pipelines for platform provisioning and full lifecycle management. Build the platform control panel to operate the fleet of systems efficiently.
- Work closely with teams across Application and platform to establish Service Level Objectives and best practices around the usage of our Data Platform.
- Help rapidly scale our product and team. As a member of the engineering team, you'll be an integral part of how we mature our tooling, best practices, engineering processes, and hiring.
Check out our engineering blog for some examples of past work. Here are some other examples of recent and future projects:
- Design and build a Schemas system with foundational domain model primitives – such as inheritance/Interface support, uniqueness constraints, or filterable fields – so customers can fully model their scientific concepts into domain models.
- Support schema evolution of domain model so customers can iterate on their domain model at scale with high reliability that offers async data migrations over billions of records with zero downtime while maintaining our data integrity guarantees. As customers' science evolves, customers need the ability to evolve their domain models' schemas accordingly at any data scale.
- Re-architect the customer-facing warehouse to be a cost-effective multi-tenant data system with high scalability to meet ongoing customer demand. The warehouse is powering customer-facing data products like Insights and will be leveraged for Machine Learning & Product Data Science.
- Architect and build a next-generation internal-analytics platform that empowers our internal data analysts to make business intelligence from product-analytical data and customer metadata.
- Build a Common Application Platform framework to develop Benchling's Core domain models via DSLs. This framework will auto-generate persistent layers and highly scalable Ingress/egress APIs , thus reducing internal engineering teams' effort to build Core domain models from a few months to a few weeks.
QUALIFICATIONS
- Have 7+ years of experience or a proven track record in software engineering
- Experience with data analytics and warehouse solutions such as Snowflake, Delta Lake, AWS Redshift, etc
- Experience with data processing technologies Kafka, Kinesis, Spark, Flink, or other open-source or commercial software
- Experience in schema design, schema registry, SQL and OLTP databases like Postgres
- Strong experience with scripting language (such as Python)
- Experience with deployment and configuration management frameworks such as Terraform, Ansible, or Chef and container management systems such as Kubernetes or Amazon ECS.
- Driven by creating positive impact for our customers and Benchling's business, and ultimately accelerating the pace of research in the Life Sciences
- Comfortable with complexity in the short term but can build towards simplicity in the long term
- Strong communicator with both words and data - you understand what it takes to go from raw data to something a human understands
- Willing to work onsite in our SF office 3 days a week.
SALARY RANGE
Benchling takes a market-based approach to pay. The candidate's starting pay will be determined based on job-related skills, experience, qualifications, interview performance, and work location. For this role the base salary range is:
ZONE 1: 177,735 - 240,465
To help you determine which zone applies to your location, please see this resource. If you have questions regarding a specific location's zone designation, please contact a recruiter for additional information.
Total Compensation includes the following:
- Competitive salary and equity
- 100% premiums covered for health, dental, and vision for employees
- Fertility healthcare and family-forming benefits
- Four months of fully paid parental leave
- 401(k) + Employer Match
- Commuter benefits
- Home office stipend
- Mental health benefits + Wellness stipend
- Learning and development stipend
- Generous PTO
- Company-wide Summer & Winter holiday shutdown
- Sabbaticals for 5-year and 10-year anniversaries
#LI-MD1
Benchling welcomes everyone.
We believe diversity enriches our team so we hire people with a wide range of identities, backgrounds, and experiences.
We are an equal opportunity employer. That means we don’t discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We also consider for employment qualified applicants with arrest and conviction records, consistent with applicable federal, state and local law, including but not limited to the San Francisco Fair Chance Ordinance.
For applicants for U.S.-based roles only: All new hires in the United States are required to be fully vaccinated against COVID-19 and to provide proof of vaccination prior to their first day of employment. If you are unable to be vaccinated for medical or religious reasons, we will explore potential reasonable accommodations; however, Benchling may not be able to grant such accommodations in all cases if doing so would cause undue hardship.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Ansible APIs AWS Business Intelligence CI/CD Data Analytics Data warehouse ECS Engineering Flink Kafka Kinesis Kubernetes Machine Learning OLAP Open Source Pipelines PostgreSQL Privacy Python R R&D Redshift Research Snowflake Spark SQL Terraform
Perks/benefits: 401(k) matching Career development Competitive pay Equity Fertility benefits Flex vacation Health care Home office stipend Medical leave Parental leave Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Big Data Engineer jobs
- Open Business Data Analyst jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open ETL Developer jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open PhD-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Data governance-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs