Data Scientist (Lab Informatics)

Berkeley, California, United States

Profluent

Born out of pioneering research in artificial intelligence and biology, Profluent develops machine learning models that can read and write biomolecules for human health and industrial applications.

View company page

Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to revolutionize biomedicine. Based in Berkeley, CA, we are backed by leading investors including Spark Capital, Insight Partners, Air Street Capital, AIX Ventures, and Convergent Ventures.

Profluent is currently seeking a creative, passionate, and detail-oriented Data Scientist with a focus on laboratory informatics. The candidate will work closely with experimental biologists to automate the capture, organization, and analysis of experimental data and create software that enhances productivity of scientists.

This is an excellent opportunity to shape the future of AI-driven protein design and to work cross-functionally with a diverse team of experts across machine learning, protein engineering, cell biology, and gene editing.

Responsibilities

  • Provide computational support for experimental biology workflows (guide RNA design, primer design, barcoding, codon optimization, cloning)
  • Write robust software that captures experimental data and streamlines it into Benchling and other relational databases
  • Collaborate with scientists to create relational data models that represent experimental workflows
  • Implement bioinformatics tools for analysis of NGS sequencing data
  • Act as a data steward, who manages, organizes, and provides access to datasets generated across a protein engineering campaign
  • Clearly document code and communicate outcomes to colleagues

Qualifications

  • BS or MS in Computer Science, Bioengineering, Bioinformatics, or a related quantitative bioscience field
  • 3+ years of academic or biotechnology industry research or laboratory experience
  • Experience with Benchling or other laboratory information management systems (LIMS)
  • Experience analyzing data from molecular biology experiments
  • Fluent in Python data analysis tools (numpy, pandas, Jupyter notebook, biopython)
  • Experience with Linux environments and version control (git)
  • Pays attention to detail, highly organized, and excels in a fast-paced work environment

Preferences (but not required)

  • Experience working with high throughput data and lab automation systems
  • Experience designing relational data models and writing SQL queries (e.g. MySQL, PostgreSQL)
  • Experience working with Google Cloud Platform (GCP) or other cloud-based compute services (e.g. AWS)
  • Experience with at least one data analysis & visualization tool

Actual salary will be determined based on relevant skills, qualifications, experience, training, and market data. Benefits package may vary depending on company policies and eligibility criteria.

Salary Range$115,000—$165,000 USD

What we offer at Profluent

  • A high-growth opportunity with meaningful impact
  • Competitive compensation package
  • Health insurance (health/dental/vision)
  • Generous paid time off (PTO) policy
  • Commitment to physical and mental well-being
  • More benefits and perks to be added!

Profluent Bio, Inc is an equal opportunity employer promoting diversity and inclusion in the workspace. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical conditions, veteran status, sexual orientation, gender (including gender identity and gender expression), sex (which includes pregnancy, childbirth, and breastfeeding), genetic information, taking or requesting statutorily protected leave, or any other basis protected by law.

Legal authorization to work in the United States is required. In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.

Apply now Apply later
  • Share this job via
  • or

Tags: AWS Bioinformatics Biology Biopython Computer Science Data analysis Engineering GCP Generative modeling Git Google Cloud Jupyter Linux Machine Learning MySQL NumPy Pandas PostgreSQL Protein engineering Python RDBMS Research Spark SQL

Perks/benefits: Career development Competitive pay Health care Insurance Medical leave Startup environment

Region: North America
Country: United States
Job stats:  7  1  0
Category: Data Science Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.