Data Scientist (Lab Informatics)
Berkeley, California, United States
Full Time Mid-level / Intermediate USD 115K - 165K
Profluent
Born out of pioneering research in artificial intelligence and biology, Profluent develops machine learning models that can read and write biomolecules for human health and industrial applications.Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to revolutionize biomedicine. Based in Berkeley, CA, we are backed by leading investors including Spark Capital, Insight Partners, Air Street Capital, AIX Ventures, and Convergent Ventures.
Profluent is currently seeking a creative, passionate, and detail-oriented Data Scientist with a focus on laboratory informatics. The candidate will work closely with experimental biologists to automate the capture, organization, and analysis of experimental data and create software that enhances productivity of scientists.
This is an excellent opportunity to shape the future of AI-driven protein design and to work cross-functionally with a diverse team of experts across machine learning, protein engineering, cell biology, and gene editing.
Responsibilities
- Provide computational support for experimental biology workflows (guide RNA design, primer design, barcoding, codon optimization, cloning)
- Write robust software that captures experimental data and streamlines it into Benchling and other relational databases
- Collaborate with scientists to create relational data models that represent experimental workflows
- Implement bioinformatics tools for analysis of NGS sequencing data
- Act as a data steward, who manages, organizes, and provides access to datasets generated across a protein engineering campaign
- Clearly document code and communicate outcomes to colleagues
Qualifications
- BS or MS in Computer Science, Bioengineering, Bioinformatics, or a related quantitative bioscience field
- 3+ years of academic or biotechnology industry research or laboratory experience
- Experience with Benchling or other laboratory information management systems (LIMS)
- Experience analyzing data from molecular biology experiments
- Fluent in Python data analysis tools (numpy, pandas, Jupyter notebook, biopython)
- Experience with Linux environments and version control (git)
- Pays attention to detail, highly organized, and excels in a fast-paced work environment
Preferences (but not required)
- Experience working with high throughput data and lab automation systems
- Experience designing relational data models and writing SQL queries (e.g. MySQL, PostgreSQL)
- Experience working with Google Cloud Platform (GCP) or other cloud-based compute services (e.g. AWS)
- Experience with at least one data analysis & visualization tool
Actual salary will be determined based on relevant skills, qualifications, experience, training, and market data. Benefits package may vary depending on company policies and eligibility criteria.
Salary Range$115,000—$165,000 USDWhat we offer at Profluent
- A high-growth opportunity with meaningful impact
- Competitive compensation package
- Health insurance (health/dental/vision)
- Generous paid time off (PTO) policy
- Commitment to physical and mental well-being
- More benefits and perks to be added!
Profluent Bio, Inc is an equal opportunity employer promoting diversity and inclusion in the workspace. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical conditions, veteran status, sexual orientation, gender (including gender identity and gender expression), sex (which includes pregnancy, childbirth, and breastfeeding), genetic information, taking or requesting statutorily protected leave, or any other basis protected by law.
Legal authorization to work in the United States is required. In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.
Tags: AWS Bioinformatics Biology Biopython Computer Science Data analysis Engineering GCP Generative modeling Git Google Cloud Jupyter Linux Machine Learning MySQL NumPy Pandas PostgreSQL Protein engineering Python RDBMS Research Spark SQL
Perks/benefits: Career development Competitive pay Health care Insurance Medical leave Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Business Intelligence Engineer jobs
- Open Data Engineer II jobs
- Open Lead Data Analyst jobs
- Open Power BI Developer jobs
- Open Marketing Data Analyst jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Science Manager jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Senior Data Architect jobs
- Open Principal Data Scientist jobs
- Open Sr. Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Research Scientist jobs
- Open Junior Data Engineer jobs
- Open Data Quality Analyst jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Data quality-related jobs
- Open ML models-related jobs
- Open Business Intelligence-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open NLP-related jobs
- Open Finance-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Databricks-related jobs
- Open Data warehouse-related jobs