Principal Data Scientist (NLP)

remote

Applications have closed

LineSlip Solutions

With LineSlip, automatically extract and organize commercial insurance documents so you can easily visualize data and automate reporting to make smarter business decisions.

View company page

Abstract:

Build our NLP capabilities for the mining of unstructured and quasi-structured data.

 

Role:

The Principal Data Scientist must be an expert in the full data life cycle. The candidate must review the data we have, understand the business problems we are trying to solve, and craft the best solutions given the limitations of data, and the current state of various NLP technologies. In addition, the Principal Data Scientist must produce deliverable results and take them from development to production in collaboration with our engineers.

 

Required Experience

-Min 3 years commercial experience in a job function whose title was NLP engineer or Data Scientist.

-Min 5 years of direct commercial NLP production experience.

-Must be fluent at the highest level both in written and spoken English.

  • Expertise in the following: Custom Named Entity Extraction, Document Classification, Topic Modeling, tabular data extraction, and general machine learning techniques and the math behind them.
  • Experience with preprocessing noisy and/or unstructured textual data and various traditional extraction techniques such as regular expressions.
  • Strong knowledge of Spacy and BERT.
  • Strong Experience with container technology such as K8 for NLP model deployment.
  • Experience deploying models on Azure.
  • Experience refining and creating DL models namely CNN's, transformers, and BiLSTM models for NLP tasks.
  • Industrial experience using one of the following (Pytorch, Tensorflow, Keras)
  • Expert in Python 3.4 and above and TSQL.
  • Expertise in guiding annotation, producing, processing, evaluating, and utilizing training data.
  • Familiar with various ML/DL cloud architectures.

MSc./PhD in Computer Science, Statistics, Computational Linguistics or related quantitative fields

Tags: Azure BERT Classification Computer Science Industrial Keras Machine Learning Model deployment NLP PhD Python PyTorch spaCy Statistics TensorFlow Topic modeling Transformers T-SQL

Region: Remote/Anywhere
Job stats:  430  42  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.