Principal Data Engineer, Knowledge Graphs and Data Semantics

San Francisco Bay Area, CA

Applications have closed

Altos Labs

Altos Labs is a biotechnology company focused on restoring cell health and resilience through cell rejuvenation to reverse disease, injury, and the disabilities that occur throughout life. Learn more about Altos.

View company page

Our Mission

To restore cell health and resilience through cellular rejuvenation programming to reverse disease, injury, and the disabilities that can occur throughout life.

For more information, see our website at altoslabs.com.

Diversity at Altos

We believe that diverse perspectives are foundational to scientific innovation and inquiry. 

We are building a company where exceptional scientists and industry leaders from around the world work side by side to advance a shared mission. 

Our intentional focus is on Belonging, so that all employees know that they are valued for their unique perspectives. 

At Altos, we are all accountable for sustaining a diverse and inclusive environment.

Who You Are

The Altos Labs Scientific Computing & Data (SCD) group consists of software, data, and machine learning engineers implementing scalable data engineering solutions enabling Altos Labs' scientific mission. We are currently hiring a Principal Data Engineer, Knowledge Graphs, and Data Semantics to lead the development of knowledge graphs integrating data at multiple biological scales. 

This role will be responsible for designing ontologies and information models capturing multimodal genomics, imaging, mass spectrometry, and clinical data generating unique insights on cell rejuvenation and health. The knowledge graph will be the engine to integrate internal and external experimental datasets, reference datasets, and ontologies, and provide the foundation to drive AI/ML research at Altos Labs. The Principal Data Engineer will drive standardization of the datasets being generated across Altos Labs using ontologies and controlled vocabularies, including Gene Ontology, Drug Target Ontology (DTO), BioAssay Ontology, and semantic models representing cells and cell lines. This role would own the strategy and implementation of knowledge graph and semantic data management across various modalities and use cases, working closely with researchers across Altos Science & Medical Institutes. As a Principal data engineer, you will shape the culture, strategy, and technology roadmap of the group.

Responsibilities

  • Integrate multimodal genomics, imaging, mass spectrometry and clinical data to represent knowledge about proteins, genes, transcription factors, pathways to generate unique insights on cell rejuvenation and health.
  • Building conceptual models for representing perturbations (single and combinational) and resultant response from cells and networks.
  • Develop knowledge graph representing information at multiple biological scales, providing an integrated view of internal and external datasets,  enabling computational and data science research 
  • Integrate internal and external experimental datasets, reference datasets and ontologies
  • Standardize datasets being generated across Altos and provide single source of truth
  • Own the strategy and implementation of knowledge graph and semantic data management across various modalities and use cases, and work closely with stakeholders across global research organization 
  • Be a trusted partner for scientific teams in order to ideate and develop engineering solutions to support and accelerate research across various labs.
  • Be a thought leader bridging Altos Labs Scientific Computing & Data group with other leading engineering organizations and academic institutions, and to promote Altos Labs as best place to work for top data engineering talent 
  • Engage extensively with major consortiums for medical R&D, large genomics research institutes, and leading startups leveraging latest system design and implementations to support Altos Labs mission

Requirements

  • Masters or PhD in Computer Science, Bioinformatics with strong emphasis on data and knowledge modeling 
  • 8+ years of experience in academia and/or industry working with research data sets (genomics, imaging & microscopy)
  • Hands-on experience in knowledge graph & ontology development, and programming in python, R or Java 
  • Experience in successfully bringing data products from inception, ideation, prototyping and implementation
  • Experience or familiarity with biology, bioinformatics, or common biological data analysis methods and experience working with biologists or bioinformaticians
  • Exposure to large language models within biomedical domain and beyond
  • Ability to work in cross functional and cross location teams 


The salary for this role is: $229,000 - $310,000.


#LI-LY1

What We Want You To Know

We are a culture of collaboration and scientific freedom, and we believe in the values of diversity, inclusion and belonging to inspire innovation.

Altos Labs provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. 

This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training. 

Altos currently requires all employees to be fully vaccinated against COVID-19, subject to legally required exemptions (e.g., due to a medical condition or sincerely-held religious belief).

Thank you for your interest in Altos Labs where we strive for a culture of Scientific Freedom, Learning, and Belonging.

Note: Altos Labs will not ask you to download a messaging app for an interview or outlay your own money to get started as an employee. If this sounds like your interaction with people claiming to be with Altos, it is not legitimate and has nothing to do with Altos. Learn more about a common job scam at https://www.linkedin.com/pulse/how-spot-avoid-online-job-scams-biron-clark/

Tags: Bioinformatics Biology Computer Science Data analysis Data management Engineering Java LLMs Machine Learning PhD Prototyping Python R R&D Research

Perks/benefits: Career development

Region: North America
Country: United States
Job stats:  18  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.