Director of Data and Model Optimisation
New York City, United States - Remote
PolyAI is at the forefront of automating customer service through voice technology. Our voice assistants empower businesses to deliver exceptional customer service at every interaction. We are seeking a dedicated and innovative Director of Data and Model Optimisation to join our team and elevate our machine learning models to new heights.
As the Director of Data and Model Optimisation, you will be responsible for shaping and implementing the strategy for data collection, data pipeline development, and model optimisation. Your primary focus will be on Automatic Speech Recognition (ASR) and Large Language Models (LLM). You will play a critical role in ensuring the delivery of high-quality data and annotations, iterating on model improvements, and driving the overall performance of our production-grade machine learning models.
Salary range: $200,000 - $300,000 + bonus + equity
Requirements
- PhD in Computer Science, Machine Learning, or a related field, or equivalent industry experiences.
- Minimum of 5+ years of experience in working with deep learning and statistical models, with at least 3+ years of industry experience.
- Experience in training and optimising large language models (LLMs).
- Strong expertise in developing and managing data pipelines for machine learning applications.
- In-depth knowledge of data quality standards and annotation processes.
- Proficiency in Python and familiarity with relevant ML frameworks and libraries.
- Experience with cloud services such as AWS, GCP, or Azure.
- Demonstrated ability to lead and mentor technical teams, promoting a collaborative and innovative work environment.
- Excellent verbal and written communication skills, with the ability to convey complex technical concepts to diverse audiences.
- A passion for tackling technical challenges and driving practical solutions.
Responsibilities:
- Develop and manage data pipelines to efficiently and compliantly transport data into our machine learning data lake.
- Define the structure and quality standards for data collection to ensure high-quality annotations and robust datasets.
- Lead initiatives to improve data quality and quantity, focusing on the specific needs of ASR and LLM models.
- Iterate on model quality improvements, employing cutting-edge techniques and methodologies to enhance performance.
- Collaborate closely with cross-functional teams, including data engineers, machine learning engineers, and product teams to align data and model optimisation strategies with business goals.
- Oversee the end-to-end lifecycle of model development, from data collection and preprocessing to model training, evaluation, and deployment.
- Stay abreast of the latest advancements in machine learning, ASR, and LLM to ensure the continuous evolution of our technologies.
- Mentor and guide a team of data scientists and machine learning engineers, fostering a culture of innovation and excellence.
Benefits
- π° Participation in the companyβs employee share options plan
- π₯ 100% of Single Cost (employee) and 50% of Dependent for medical, dental & vision
- πͺ Life Insurance
- β»οΈ STD and LTD
- π° The opportunity to contribute to the company's 401k plan
- π 20 days vacation + 10 public holidays + paid sick leave
- π Annual learning and development allowance
- π‘ One-off WFH allowance when you join
- 𧑠Enhanced parental leave
- π¨βπ©βπ§ Company-funded fertility and family-forming programmes
- πΈ Menopause care programme with Maven
Equal Opportunity Statement:
PolyAI is proud to be an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.
Tags: ASR AWS Azure Computer Science Data pipelines Data quality Deep Learning GCP LLMs Machine Learning Maven ML models Model training NLP PhD Pipelines Python Statistics
Perks/benefits: 401(k) matching Career development Equity / stock options Fertility benefits Health care Insurance Medical leave Parental leave Salary bonus
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Research Scientist jobs
- Open Junior Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open Principal Data Scientist jobs
- Open Sr Data Engineer jobs
- Open BI Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr. Data Scientist jobs
- Open Lead Data Analyst jobs
- Open Azure Data Engineer jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Engineering Manager jobs
- Open Product Data Analyst jobs
- Open Data Analyst II jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open Consulting-related jobs
- Open Generative AI-related jobs
- Open Business Intelligence-related jobs
- Open Data governance-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Docker-related jobs
- Open Snowflake-related jobs