Sr. Data Scientist

Remote

Applications have closed

Trace Genomics

Trace Genomics delivers comprehensive, precise soil analysis to help farmers, ag retailers and product companies turn insights into action.

View company page

Trace Genomics has developed the first analytics engine that learns as it maps the living soil. Founded in 2015 to serve the farming community, Trace Genomics helps growers maximize the value of every acre. The company is building the largest, most actionable body of soil intelligence, making thousands of growers experts on what’s underground. Working collaboratively across the agriculture ecosystem, Trace Genomics helps growers optimize costs, manage risk and protect their soil. Headquartered in Silicon Valley—led by industry veteran Dan Vradenburg and founded by two female PhDs from Stanford University—Trace Genomics is having a meaningful impact on the future of food.
At Trace Genomics, our goal is to leverage our unique data on the soil environment to provide new tools for agriculture to solve important problems. Our database is composed of information on the microbial, chemical, and physical attributes of soil, one of the most complex habitats on the planet. This data can inform solutions to challenges facing agriculture today, from the need to grow crops more sustainably to the uncertain effect of new products on crop health. We are hiring a Senior Data Scientist with deep expertise in machine learning, statistics, building data products, and data tools to expand our ability to apply data to solve problems facing agriculture. You will work with a science and analytics team that is responsible for a wide range of challenging problems including, but not limited to, bioinformatics, machine learning, statistics, customer analyses, and product evaluations.
Responsibilities: Lead and enhance Trace Genomics’s predictive capabilities by: ●      Extracting new features from our unique soil and microbiome data layers;●      Identifying and guiding development of products that leverage statistics and/or state of the art machine learning based methodologies;●      Designing, optimizing, and deploying scalable machine learning based tools into production;●      Evaluating and optimizing existing machine learning processes; and●      Collaborating with engineering and commercial teams to employ best coding practices and to understand customer needs Requirements: ●      Masters or PhD in a quantitative or statistical science●      4+ years of work experience after degree, with demonstrated solid data science knowledge and practice, emphasizing the implementation of machine learning tools in production●      Expertise in machine learning: classification, regression, boosted decision trees (e.g. scikit-learn, XGBoost, LightGBM, etc.)●      Experience with data munging, feature extraction and engineering, data pipeline development and optimization●      Fluency in Python with the ability to read, write, and debug code●      Demonstrated ability to take ownership of significant projects that guide company product development●      Ability to independently interpret signals and draw meaningful conclusions from data●      Ability to communicate well in a multi-disciplinary team of scientists and engineers●      Ability to collaborate on vertical (cross-functional) teams to tackle high-visibility, high-value business opportunities Nice to Haves: ●      Experience with geospatial data, including agricultural data layers●      Experience with accessing public APIs and building tools to incorporate new data layers, e.g. satellite imagery, climate data●      Experience with genomic and/or microbiome datasets●      Experience with Bayesian modeling (e.g. PyMC3)●      Experience with PyTorch, Tensorflow, or other deep learning packages●      Experience with relational databases●      Experience with data visualization and associated toolsKey Personal Attributes Include:Committed to our core values: Customer-Driven, Scientific Integrity, Creative Problem Solving, Diversity & Inclusion, and Fearless Determination

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: APIs Bayesian Classification Data visualization Deep Learning Engineering LightGBM Machine Learning PhD Python PyTorch RDBMS Scikit-learn Statistics TensorFlow XGBoost

Perks/benefits: Career development

Region: Remote/Anywhere
Job stats:  13  2  0
Category: Data Science Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.