Principal Data Scientist

New York - New York City

Full Time Senior-level / Expert
Veeva Systems logo
Veeva Systems
Apply now Apply later

Posted 1 week ago

Veeva [NYSE: VEEV] is the leader in cloud-based software for the global life sciences industry. Committed to innovation, product excellence, and customer success, our customers range from the world’s largest pharmaceutical companies to emerging biotechs. Veeva’s software helps our customers bring medicines and therapies to patients faster.
We are the first public company to become a Public Benefit Corporation. As a PBC, we are committed to making the industries we serve more productive, and we are committed to creating high-quality employment opportunities.
Veeva is a Work Anywhere company which means that you can choose to work in the environment that works best for you - on any given day. Whether you choose to work remotely from home or work in an office - it’s up to you.
The Role
Veeva Data Cloud is a family of data products aimed at bringing more innovative solutions and greater choice to the life sciences data market.  Life sciences companies license our data to inform high-impact commercial initiatives, such as patient journey mapping, healthcare professional (HCP) targeting, and field force alignment.  Veeva Data Cloud leverages software and cloud technology to develop and deliver better data, faster. 
As the Principal Data Scientist for the Veeva Data Cloud team, you will be focused on designing and building our methodologies to bring Projected Data Products to life for our customers. You are excited about statistics and data science at scale on big data and taking billions of records to tell a story, from sample curation, projection methodologies, anomaly detection, scaling approaches, clustering, and more. You design and build algorithms in a computationally efficient and statistically effective manner, while being able to keep the business problems we are working to solve in mind. While ML is an important part of your toolkit, it's not your only skill. The ability to dissect the problem and to select from a variety of techniques is key. This is a great opportunity for someone who is excited about using their statistics and data science expertise to design and build the algorithms and models used at the core of launching Veeva’s Projected Data Products. You’ll collaborate closely with the Product Management and Engineering team to productize the methodologies and get to see enterprise Life Science customers leverage your work every day.

What You'll Do

  • Apply statistical, machine learning, and data mining techniques to large health data sets to build new products and methodologies
  • Collaborate closely with a team of Data Scientists, Product Managers, Software Engineers and Data Engineers to discover and deliver product offerings from prototype to scale, then iterate and enhance 
  • Explore and find meaning in high volumes of data, identify signals and patterns to identify relationships to infer the universe from imperfect data.  Important skills include querying, data cleansing, experiment design, solution assessment, identifying scaling challenges.
  • Rapidly build prototype product solutions, communicate findings, and iterate
  • Draw from prior experience and technical expertise to identify product improvements and inform testing plans; break overall objectives down into underlying problems that can be prioritized and solved


  • 10+ years of hands-on data science and statistics experience, demonstrating increasing responsibility and impact over time, including experience as the point person on projects
  • M.S. or Ph.D. in Applied Statistics, Mathematics, Computer Science, Machine Learning or other quantitative discipline
  • Highly proficient in Python (packages: pandas, scikit-learn, statsmodels) and SQL; experience working with AWS preferred
  • Experience working with large quantities of data to develop models that work in a stable, production approach with live data
  • Advanced knowledge of statistical analysis and data mining techniques (regression, multilevel regression, poststratification, semi-supervised learning, forecasting, decision trees, clustering, A/B testing, etc.)
  • Experience working with engineering to productionalize models including scaling, monitoring, and documentation
  • Comfortable (and excited!) about ambiguity and breaking goals down into tangible and actionable work plans
  • Strong communication skills and ability to work across internal teams 

Nice to Have

  • Statistician in a health-related field, such as epidemiology

Perks & Benefits

  • Flexible PTO
  • Allocations for continuous learning & development
  • Health & wellness programs
Veeva’s headquarters is located in the San Francisco Bay Area with offices in more than 15 countries around the world.
Veeva is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity or expression, religion, national origin or ancestry, age, disability, marital status, pregnancy, protected veteran status, protected genetic information, political affiliation, or any other characteristics protected by local laws, regulations, or ordinances.
Job tags: AWS Big Data Data Mining Engineering Healthcare Machine Learning ML Pandas Python Scikit-Learn SQL
Job region(s): North America
Job stats:  6  0  0
  • Share this job via
  • or