Director of Data and Model Optimisation

New York City, United States - Remote

Apply now Apply later

PolyAI is at the forefront of automating customer service through voice technology. Our voice assistants empower businesses to deliver exceptional customer service at every interaction. We are seeking a dedicated and innovative Director of Data and Model Optimisation to join our team and elevate our machine learning models to new heights.

As the Director of Data and Model Optimisation, you will be responsible for shaping and implementing the strategy for data collection, data pipeline development, and model optimisation. Your primary focus will be on Automatic Speech Recognition (ASR) and Large Language Models (LLM). You will play a critical role in ensuring the delivery of high-quality data and annotations, iterating on model improvements, and driving the overall performance of our production-grade machine learning models.

Salary range: $200,000 - $300,000 + bonus + equity

Requirements

  • PhD in Computer Science, Machine Learning, or a related field, or equivalent industry experiences.
  • Minimum of 5+ years of experience in working with deep learning and statistical models, with at least 3+ years of industry experience.
  • Experience in training and optimising large language models (LLMs).
  • Strong expertise in developing and managing data pipelines for machine learning applications.
  • In-depth knowledge of data quality standards and annotation processes.
  • Proficiency in Python and familiarity with relevant ML frameworks and libraries.
  • Experience with cloud services such as AWS, GCP, or Azure.
  • Demonstrated ability to lead and mentor technical teams, promoting a collaborative and innovative work environment.
  • Excellent verbal and written communication skills, with the ability to convey complex technical concepts to diverse audiences.
  • A passion for tackling technical challenges and driving practical solutions.

Responsibilities:

  • Develop and manage data pipelines to efficiently and compliantly transport data into our machine learning data lake.
  • Define the structure and quality standards for data collection to ensure high-quality annotations and robust datasets.
  • Lead initiatives to improve data quality and quantity, focusing on the specific needs of ASR and LLM models.
  • Iterate on model quality improvements, employing cutting-edge techniques and methodologies to enhance performance.
  • Collaborate closely with cross-functional teams, including data engineers, machine learning engineers, and product teams to align data and model optimisation strategies with business goals.
  • Oversee the end-to-end lifecycle of model development, from data collection and preprocessing to model training, evaluation, and deployment.
  • Stay abreast of the latest advancements in machine learning, ASR, and LLM to ensure the continuous evolution of our technologies.
  • Mentor and guide a team of data scientists and machine learning engineers, fostering a culture of innovation and excellence.

Benefits

  • πŸ’° Participation in the company’s employee share options plan
  • πŸ₯ 100% of Single Cost (employee) and 50% of Dependent for medical, dental & vision
  • πŸ‘ͺ Life Insurance
  • ◻️ STD and LTD
  • πŸ’° The opportunity to contribute to the company's 401k plan
  • 🏝 20 days vacation + 10 public holidays + paid sick leave
  • πŸ“š Annual learning and development allowance
  • 🏑 One-off WFH allowance when you join
  • 🧑 Enhanced parental leave
  • πŸ‘¨β€πŸ‘©β€πŸ‘§ Company-funded fertility and family-forming programmes
  • 🌸 Menopause care programme with Maven

Equal Opportunity Statement:

PolyAI is proud to be an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

Apply now Apply later
  • Share this job via
  • or
Job stats:  0  0  0

Tags: ASR AWS Azure Computer Science Data pipelines Data quality Deep Learning GCP LLMs Machine Learning Maven ML models Model training NLP PhD Pipelines Python Statistics

Perks/benefits: 401(k) matching Career development Equity / stock options Fertility benefits Health care Insurance Medical leave Parental leave Salary bonus

Regions: Remote/Anywhere North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.