Principal Software Engineer, Data Architecture
Somerville, MA
Applications have closed
Flagship Pioneering, Inc.
We are Flagship Pioneering We are a biotechnology company that invents platforms and builds companies that change the world. CEO Chats from the Flagship…About Generate Biomedicines
Generate Biomedicines is a new kind of therapeutics company – existing at the intersection of machine learning, biological engineering, and medicine – pioneering Generative Biology™ to create breakthrough medicines where novel therapeutics are computationally generated, instead of being discovered. Generate has built a machine learning-powered biomedicines platform with the potential to generate new drugs across a wide range of biologic modalities. This platform represents a potentially fundamental shift in what is possible in the field of biotherapeutic development.
We pursue this audacious vision because we believe in the unique and revolutionary power of generative biology to radically transform the lives of billions, with an outsized opportunity for patients in need. We are seeking collaborative, relentless problem solvers that share our passion for impact to join us!
Generate was founded in 2018 by Flagship Pioneering and has received over $420 million in funding, providing the resources to rapidly scale the organization. The Company has offices in Somerville, and Andover, Massachusetts with over 200 employees.
The Role:
We are seeking a creative, motivated Software Engineer to help build our data platform. Generate Biomedicines is an innovative platform company leveraging Machine Learning to design novel protein therapeutics. This role will be an integral part of building a powerful, integrated data platform that streamlines data flow between wet and dry lab functions. The successful candidate will work closely with scientists from across the organization to understand both platform and project requirements, and translate those into high-quality software that rapidly advances our scientific programs.
Responsibilities
- Design, plan, and lead the development of a highly scalable and flexible assay data management platform, which:
- Captures and standardizes assay results and experiment context
- Enables optimized experiment design
- Integrates with automation for high-throughput data generation
- Produces training-ready data for rapid model development
- Design and develop data pipelines leveraging internal and external data sources
- Maintain deep technical understanding of Generate’s data inventory and support scientists’ access to data
- Design and implement data models to support lab and ML workflows
- Work with lab and computational scientists to:
- Identify needs and opportunities for platform improvement
- Prioritize, plan, and implement
- Drive adoption
- Design and develop analytical reports and dashboards
- Lead development efforts, mentor/manage software engineers
Qualifications
- Self-motivated and curious with strong desire to both learn from and teach others
- Outstanding communications and interpersonal skills
- Bachelor’s or Master’s in Computer Science, Data Science, or related field plus 7 years of experience working in the pharmaceutical or biotech domain
- Deep understanding of data storage technologies (SQL, NoSQL, Object Storage, etc)
- Demonstrated proficiency designing and implementing data models, and an understanding of the tradeoffs involved in technology selection
- Demonstrated understanding of data management best practices and biological data
- Experience building and maintaining production data pipelines
- Experience preparing data sets for machine learning
- Proficiency with cloud technologies (AWS a plus)
Generate Biomedicines is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.
COVID Safety:
Generate Biomedicines enforces a mandatory vaccination policy for COVID-19. All employees must be fully vaccinated and have received a booster. The purpose of this policy is to safeguard the health of our employees, their families, and the community at large from infectious disease that may be reduced by vaccinations. The company will make exceptions to this policy if required by applicable law and will consider requests for an exemption from this policy due to a medical reason, or because of a sincerely held religious belief, or any other exemptions that may be recognized by applicable.
Recruitment & Staffing Agencies: Generate Biomedicines do not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to Generate Biomedicines or its employees is strictly prohibited unless contacted directly by Generate Biomedicines's internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Generate Biomedicines and Generate Biomedicines will not owe any referral or other fees with respect thereto.Tags: Architecture AWS Biology Computer Science Data management Data pipelines Engineering Machine Learning ML models NoSQL Pipelines SQL
Perks/benefits: Career development Flex hours
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Databricks-related jobs
- Open Data warehouse-related jobs