Senior Data Scientist

Remote

Alkymi

Alkymi is the end-to-end solution for managing your investment document processing.

View company page

At Alkymi, we’re building the first AI-powered business system for unstructured data for financial services. As the first technology service to offer safe and secure large language models inside of financial document workflows, Alkymi is changing the way the alts industry accesses their data - empowering them to service more clients, quickly pivot investment strategies and increase their revenue per employee. Founded in 2017 in New York City, Alkymi works with some of the world’s leading businesses and financial services firms to automate their highest-impact workflows by delivering an unparalleled product experience. We’re laser focused on understanding our customers’ workflows from top to bottom, and building easy-to-use, powerful, tools to meet their objectives. We combine cutting edge data science and machine learning with best-in-class software engineering to delight our users at some of the world’s most demanding and sophisticated businesses.

Core Responsibilities:

  • The Senior Data Scientist is responsible for developing and implementing cutting-edge Natural Language Processing (NLP) algorithms that automate the processing of business documents.
  • The Senior Data Scientist will work closely with our product and engineering teams to ensure that our solutions are scalable, efficient, and effective.
  • Specific duties include: (1) design NLP pipelines to meet customer requirements; (2) Use different approaches for training entity tagging models (e.g. CRF, RNN, CNN, Transformer); (3) Analyze large datasets to identify patterns and insights that drive business decisions; (4) Stay up-to-date on the latest developments in computer vision and NLP research, and incorporate new techniques into our solutions; (5) Communicate technical concepts and insights to both technical and non-technical stakeholders; (6) Build and deploy systems for automating business processes; (7) use Python and source code management, debugging, testing, and deployment to develop software; (8) use text pre-processing and normalization techniques including tokenization and POS tagging; (9) use asynchronous programming and frameworks such as Tensorflow, Pytorch, and Natural Language Processing deep learning algorithms; and (10) oversee the work of and mentor junior data scientists.
  • Telecommuting available from anywhere in US.
  • HQ at 228 Park Ave S, Ste. 63730, New York, NY 10003.

Qualifications:

  • This position requires a Master’s degree or the equivalent in Computer Science, Mathematics, or a related field.
  • Must have 2 years of related experience. Must also have 12 months of experience, as demonstrated through employment or academic coursework, with each of the following: 1) Analyzing large datasets to identify patterns and data insights; 2) Build and deploy systems for automating business processes; 3) Experience with Python, CRF, RNN, CNN, Transformer; Tensorflow; Pytorch; 4) Experience with source code management, debugging, testing, and deployment; 5) Experience with text pre-processing and normalization techniques, including tokenization and POS tagging; and 6) Experience using Natural Language Processing deep learning algorithms.
  • Employer will accept experience gained before, during, or after a Master’s program.
  • Full-time,telecommuting available from anywhere in US.
  • HQ at 228 Park Ave S, Ste. 63730, New York, NY 10003
  • Please apply online at https://www.alkymi.io/company/careers.

Salary: $151,000 to $185,000/year
Apply now Apply later
  • Share this job via
  • or

Tags: Computer Science Computer Vision Deep Learning Engineering LLMs Machine Learning Mathematics NLP Pipelines Python PyTorch Research RNN TensorFlow Testing Unstructured data

Region: Remote/Anywhere
Job stats:  14  6  0
Category: Data Science Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.