Data Scientist Large Language Model Generative AI

San Francisco, CA

Applications have closed

Pivotal Life Sciences

PLS envisions becoming the life sciences investment industry's best tech-enabled investment platform. The AI team aims to provide best-in-class intelligence support across all steps of the investment process, from deal sourcing to exit.

View company page

Pivotal Life Sciences – Large Language Model/Generative AI Data Scientist – Job Description 



If WHO you work for & with and WHY your company exists is at the top of your list of criteria for choosing your next opportunity, we encourage you to take a look at the roles we are hiring for at Pivotal Life Sciences (PLS) and join our growing team of data scientists/engineers working together toward a common goal—the health and care of the patient. We’re looking for people to not only join us—but to be a big part of the solution. To do more and be more. To do well and do good. If you have the ability to see the bigger picture and bring it to life, let's connect! 


Pivotal Life Sciences (“PLS”), a part of the Nan Fung Group, is a global investment platform focusing on the life sciences. Leveraging on Nan Fung Group’s strong capital base and long-term commitment to the space, the company aims to become the ideal partner for scientists, entrepreneurs, corporations, and investors in the life science space. Through direct investments via Pivotal bioVenture Partners funds (both in US and China) and fund investments covering the full spectrum of the industry (including therapeutics, medical devices, and diagnostics) and across different development stages, Nan Fung Life Sciences has significant presence in both US and Greater China. Learn more at 


Locations: San Francisco, CA, USA, Shanghai, China  


The Vision 

PLS envisions becoming the life sciences investment industry's best tech-enabled investment platform. The AI (Artificial Intelligence) team aims to provide best-in-class intelligence support across all steps of the investment process, from deal sourcing to exit. The system will function as an additional team member and help augment the investment team to make better investments and build better companies for unmet therapeutic needs. Ultimately, PLS will become a scientifically driven investor across the life science ecosystem from academic spinouts to venture rounds and to exit. Come be part of our vision! 


The Role 

As a PLS Language Model/Generative AI Data Scientist, you will be a member of our new global AI and Data Intelligence team. This team’s goal is to build state-of-the-art data and AI technology with strong research fundamentals for our life sciences investment arm. You will utilize your expertise in the data sciences to investigate and work with relevant data sources and build AI products that support the investment process such as disease mapping, portfolio management predictions, financial planning, and management operations optimizations. This is a great opportunity to work on a range of AI powered projects in a growing team with exposure to the best life science companies today. 



  • Work with researchers as well as software engineers and other data scientists to develop an AI and analytics platform which can support our investments team 
  • Mapping and analyzing multiple data sets with emphasis on text-based data. For example: competitive landscape analysis for a drug target or company, tracking of companies in a sector, talent mapping and analysis, etc. 
  • Develop LLM based AI models for due diligence using biological and financial data sources including but not limited to PubMed, Patents, SEC (Securities and Exchange Commission) filing, other biological data sets, financial data sets, and others 
  • Work closely with the investment team to understand the business needs, and provide the investment team with insights on scientific and financial due diligence of new investment opportunities  
  • Design and implement AI solutions working within a Software Engineering Life Cycle (SDLC (Software Development Life Cycle))  
  • Maintain awareness of and utilize where appropriate, state-of-the-art AI including Generative models as well as more classic NLP models, etc. 
  • Collaborate closely with different functions to evangelize an AI supported investment process 
  • Text extraction and intelligence/insights development from biological, financial, and operations data sets 
  • Optimization of operational work such as talent management, deal structuring, portfolio management with data solutions and AI models using extracted CRM (Customer Relationship Management) data, legal, financial, SEC filings, etc. 
  • Benchmark, evaluate and document model performance, and provide recommendations for continuous improvement of models 



  • Master’s degree or PhD in computer science, artificial intelligence, applied mathematics, statistics, machine learning or related discipline 
  • 3-5 years of applied experience in machine learning, deep learning methods, statistical data analysis and complex data visualization; experience in life science industry would be a plus 
  • Deep experience with Python  
  • Experience with the more recent large language models (GPT-4, Stable Diffusion models, others, other more focused language models) 
  • Experience or strong interest in working with cloud computing systems (preferably AWS (Amazon Web Services)) 
  • Experience with AI platforms such as SageMaker, MLFlow, others, preferred 
  • Experience with building machine/deep learning models with at least one common framework such as PyTorch, Tensorflow, Keras, Scikit learn etc. 
  • Knowledge of relational database architecture and data management with expertise in SQL 
  • Familiarity with software development practices such as unit testing, code reviews, and version control 
  • Excellent analytical skills and presentation skills  
  • Strong verbal and written communication skills and ability to work independently and cooperatively 
  • Proficiency in English 
  • US Work Visa 
  • Hybrid work schedule: Able to be in San Francisco office, in-person at least 3 days per week, option to work from home 2 days per week 
  • Salary Range $142k-$200k 


Job stats:  705  109  0

Tags: Architecture AWS Computer Science Data analysis Data management Data visualization Deep Learning Diffusion models Engineering Generative AI Generative modeling GPT GPT-4 Keras LLMs Machine Learning Mathematics MLFlow NLP PhD Python PyTorch RDBMS Research SageMaker Scikit-learn SDLC SQL Stable Diffusion Statistics TensorFlow Testing

Perks/benefits: Career development Competitive pay

Region: North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.