NLP Data Scientist
Remote US
Applications have closed
Verusen is a leading technology company that uses artificial intelligence to provide visibility, digitization and prediction of materials data and inventory for complex supply chains. The company’s AI software harmonizes disparate material data across ERP instances/systems while providing accurate MDM across the enterprise to optimize inventory costs. Intelligent controls enforce inventory procedures to help prevent future inventory spikes, while predictive capabilities optimize allocation and procurement needs. The result is a data foundation you can trust to move quickly to innovate and support related Industry 4.0 initiatives.
Verusen is venture-backed by leading investors from San Francisco to Boston, and is a Signature Company at Georgia Tech’s Advanced Technology Development Center (ATDC). Partnerships including SAP and Accenture. Verusen is a portfolio company of SAP.iO.
In seeking a NLP Data Scientist, Verusen Data Science is aiming to add a member of staff who will provide the following to the organization:
- Assistance in the design of new experiments in our natural language processing pipelines.
- Assistance in the development of new NLP pipelines including data collection (including working with annotators), data extraction, data cleaning, exploratory data analysis, evaluative metric selection, baseline modeling, prototype modeling, error analysis, production modeling, and ongoing production support.
Have you been affected by supply chain shortages?
Verusen is seeking a Data Scientist with a focus on natural language processing and specifically in information retrieval, machine translation, and text summarization. We are looking for a contributor ready to help us build more efficient, resilient, and connected supply chains. As a Data scientist at Verusen, you will be supported by several highly skilled data scientists, including those specializing in NLP. You will have available to you: a weekly study group, design review forums, code reviews, as well as access to annotators and several tasks to sharpen your skills.
The ideal candidate has experience of ML pipeline development. This candidate can come from academics, personal projects, competitions, open source, or professional work. Additionally, the applicant is expected to have a strong theoretical foundation in classical and modern NLP techniques. More experienced candidates can be considered for Senior level positions.
Major Responsibilities Include:
- Deliver a ML project from beginning to end, including understanding the business need, aggregating data, exploring data, building & validating predictive models, and deploying completed models with monitoring.
- Use Airflow, AWS ML platforms (eg. SageMaker), and scientific frameworks (e.g., TensorFlow, PyTorch, MXNet, SparkML) to help us build and support ML models.
- Python (at least 1 year experience), including scientific libraries like Scikit-learn, Pandas, and Numpy.
- Research and implement novel ML approaches.
- Work with consultants to analyze, extract, normalize, and label relevant data.
Qualifications
- Masters degree in quantitative discipline or higher preferred. Less traditional candidates will be seriously considered based on the strength of their portfolio.
- Statistics: theoretical foundation in various common distributions, MAP/MLE, bayes, probability theory.
- Familiarity with some statistical analysis software/library like Scikit-learn, SciPy, R or SPSS, SAS, etc.
- Deep knowledge of Natural Language Processing.
- Understanding of generative and discriminative models.
- Data engineering: SQL, Spark, Hadoop.
Nice to Haves
- Experience building pipelines with ML platforms using Airflow, Amazon SageMaker, Microsoft Azure ML, etc.
- Experience creating production code including object oriented programming principles.
- Unit & integration testing.
- Experience with algorithmic design, complexity analysis, and scientific computing.
- Experience with supply chains.
Why you want to join us
- Awesome team culture of proven winners with a high sense of urgency and getting stuff done
- Learning from the best - passionate co-workers and a hands-on and engaged leadership team
- Great colleagues in a remote-first environment
- Ability to contribute to sales and marketing strategies in a disruptive market
- We set you up for success, equipping you with the latest in tech stack advantages
- Join an ambitious tech company reshaping the way global supply chains work.
- Competitive benefits
- Compensation includes equity
Commitment to Diversity and Inclusion
At Verusen, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants and teammates without regard to race, color, religion, sex, national origin, age, physical and mental disability, sexual orientation, gender identity and/or expression, status as a veteran and any other characteristic protected by applicable law. We respect and seek to empower each individual and support a diverse culture, perspectives, skills, and experiences within our workforce. We believe that diversity and inclusion among our teammates are critical to our success, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow AWS Azure Data analysis EDA Engineering Hadoop Machine Learning ML models MXNet NLP NumPy Open Source Pandas Pipelines Probability theory Python PyTorch R Research SageMaker SAS Scikit-learn SciPy Spark SparkML SQL Statistics TensorFlow Testing
Perks/benefits: Career development Competitive pay Equity
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Senior Data Architect jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Quality Analyst jobs
- Open Data Analyst Intern jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open ETL Developer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Databricks-related jobs
- Open DevOps-related jobs