NLP Data Scientist

Remote US

Applications have closed

Verusen is a leading technology company that uses artificial intelligence to provide visibility, digitization and prediction of materials data and inventory for complex supply chains. The company’s AI software harmonizes disparate material data across ERP instances/systems while providing accurate MDM across the enterprise to optimize inventory costs. Intelligent controls enforce inventory procedures to help prevent future inventory spikes, while predictive capabilities optimize allocation and procurement needs. The result is a data foundation you can trust to move quickly to innovate and support related Industry 4.0 initiatives.

Verusen is venture-backed by leading investors from San Francisco to Boston, and is a Signature Company at Georgia Tech’s Advanced Technology Development Center (ATDC). Partnerships including SAP and Accenture. Verusen is a portfolio company of SAP.iO.

In seeking a NLP Data Scientist, Verusen Data Science is aiming to add a member of staff who will provide the following to the organization:

  1. Assistance in the design of new experiments in our natural language processing pipelines.
  2. Assistance in the development of new NLP pipelines including data collection (including working with annotators), data extraction, data cleaning, exploratory data analysis, evaluative metric selection, baseline modeling, prototype modeling, error analysis, production modeling, and ongoing production support. 

Have you been affected by supply chain shortages? 

Verusen is seeking a Data Scientist with a focus on natural language processing and specifically in information retrieval, machine translation, and text summarization. We are looking for a contributor ready to help us build more efficient, resilient, and connected supply chains. As a Data scientist at Verusen, you will be supported by several highly skilled data scientists, including those specializing in NLP. You will have available to you: a weekly study group, design review forums, code reviews, as well as access to annotators and several tasks to sharpen your skills. 

The ideal candidate has experience of ML pipeline development. This candidate can come from academics, personal projects, competitions, open source, or professional work. Additionally, the applicant is expected to have a strong theoretical foundation in classical and modern NLP techniques. More experienced candidates can be considered for Senior level positions. 

Major Responsibilities Include:

  • Deliver a ML project from beginning to end, including understanding the business need, aggregating data, exploring data, building & validating predictive models, and deploying completed models with monitoring.
  • Use Airflow, AWS ML platforms (eg. SageMaker), and scientific frameworks (e.g., TensorFlow, PyTorch, MXNet,  SparkML) to help us build and support ML models.
  • Python (at least 1 year experience), including scientific libraries like Scikit-learn, Pandas, and Numpy.
  • Research and implement novel ML approaches.
  • Work with consultants to analyze, extract, normalize, and label relevant data.

Qualifications

  • Masters degree in quantitative discipline or higher preferred. Less traditional candidates will be seriously considered based on the strength of their portfolio. 
  • Statistics: theoretical foundation in various common distributions, MAP/MLE, bayes, probability theory.
  • Familiarity with some statistical analysis software/library like Scikit-learn, SciPy, R or SPSS, SAS, etc.
  • Deep knowledge of Natural Language Processing.
  • Understanding of generative and discriminative models.
  •  Data engineering: SQL, Spark, Hadoop. 

Nice to Haves

  • Experience building pipelines with ML platforms using Airflow, Amazon SageMaker, Microsoft Azure ML, etc. 
  • Experience creating production code including object oriented programming principles.
  • Unit & integration testing.
  • Experience with algorithmic design, complexity analysis, and scientific computing. 
  • Experience with supply chains.

Why you want to join us

  • Awesome team culture of proven winners with a high sense of urgency and getting stuff done
  • Learning from the best - passionate co-workers and a hands-on and engaged leadership team
  • Great colleagues in a remote-first environment
  • Ability to contribute to sales and marketing strategies in a disruptive market
  • We set you up for success, equipping you with the latest in tech stack advantages
  • Join an ambitious tech company reshaping the way global supply chains work. 
  • Competitive benefits
  • Compensation includes equity

Commitment to Diversity and Inclusion

At Verusen, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants and teammates without regard to race, color, religion, sex, national origin, age, physical and mental disability, sexual orientation, gender identity and/or expression, status as a veteran and any other characteristic protected by applicable law. We respect and seek to empower each individual and support a diverse culture, perspectives, skills, and experiences within our workforce.  We believe that diversity and inclusion among our teammates are critical to our success, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Airflow AWS Azure Data analysis EDA Engineering Hadoop Machine Learning ML models MXNet NLP NumPy Open Source Pandas Pipelines Probability theory Python PyTorch R Research SageMaker SAS Scikit-learn SciPy Spark SparkML SQL Statistics TensorFlow Testing

Perks/benefits: Career development Competitive pay Equity

Regions: Remote/Anywhere North America
Country: United States
Job stats:  35  9  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.