Principal Software Engineer, Data Architecture

Somerville, MA

Applications have closed

Flagship Pioneering, Inc.

We are Flagship Pioneering We are a biotechnology company that invents platforms and builds companies that change the world. CEO Chats from the Flagship…

View company page

About Generate Biomedicines

Generate Biomedicines is a new kind of therapeutics company – existing at the intersection of machine learning, biological engineering, and medicine – pioneering Generative Biology™ to create breakthrough medicines where novel therapeutics are computationally generated, instead of being discovered. Generate has built a machine learning-powered biomedicines platform with the potential to generate new drugs across a wide range of biologic modalities. This platform represents a potentially fundamental shift in what is possible in the field of biotherapeutic development.

We pursue this audacious vision because we believe in the unique and revolutionary power of generative biology to radically transform the lives of billions, with an outsized opportunity for patients in need. We are seeking collaborative, relentless problem solvers that share our passion for impact to join us!

Generate was founded in 2018 by Flagship Pioneering and has received over $420 million in funding, providing the resources to rapidly scale the organization. The Company has offices in Somerville, and Andover, Massachusetts with over 200 employees.

The Role:

We are seeking a creative, motivated Software Engineer to help build our data platform. Generate Biomedicines is an innovative platform company leveraging Machine Learning to design novel protein therapeutics. This role will be an integral part of building a powerful, integrated data platform that streamlines data flow between wet and dry lab functions. The successful candidate will work closely with scientists from across the organization to understand both platform and project requirements, and translate those into high-quality software that rapidly advances our scientific programs.

 

Responsibilities

  • Design, plan, and lead the development of a highly scalable and flexible assay data management platform, which:
    • Captures and standardizes assay results and experiment context
    • Enables optimized experiment design
    • Integrates with automation for high-throughput data generation
    • Produces training-ready data for rapid model development
  • Design and develop data pipelines leveraging internal and external data sources
  • Maintain deep technical understanding of Generate’s data inventory and support scientists’ access to data
  • Design and implement data models to support lab and ML workflows
  • Work with lab and computational scientists to:
    • Identify needs and opportunities for platform improvement
    • Prioritize, plan, and implement
    • Drive adoption
  • Design and develop analytical reports and dashboards
  • Lead development efforts, mentor/manage software engineers

 

Qualifications

  • Self-motivated and curious with strong desire to both learn from and teach others
  • Outstanding communications and interpersonal skills
  • Bachelor’s or Master’s in Computer Science, Data Science, or related field plus 7 years of experience working in the pharmaceutical or biotech domain
  • Deep understanding of data storage technologies (SQL, NoSQL, Object Storage, etc)
  • Demonstrated proficiency designing and implementing data models, and an understanding of the tradeoffs involved in technology selection
  • Demonstrated understanding of data management best practices and biological data
  • Experience building and maintaining production data pipelines
  • Experience preparing data sets for machine learning
  • Proficiency with cloud technologies (AWS a plus)

 

Generate Biomedicines is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.

COVID Safety:

Generate Biomedicines enforces a mandatory vaccination policy for COVID-19. All employees must be fully vaccinated and have received a booster.  The purpose of this policy is to safeguard the health of our employees, their families, and the community at large from infectious disease that may be reduced by vaccinations.  The company will make exceptions to this policy if required by applicable law and will consider requests for an exemption from this policy due to a medical reason, or because of a sincerely held religious belief, or any other exemptions that may be recognized by applicable.

  Recruitment & Staffing Agencies: Generate Biomedicines do not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to Generate Biomedicines or its employees is strictly prohibited unless contacted directly by Generate Biomedicines's internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Generate Biomedicines and Generate Biomedicines will not owe any referral or other fees with respect thereto.

Tags: Architecture AWS Biology Computer Science Data management Data pipelines Engineering Machine Learning ML models NoSQL Pipelines SQL

Perks/benefits: Career development Flex hours

Region: North America
Country: United States
Job stats:  1  0  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.