Lead Machine Learning Operations Engineer (PySpark, Python, R)
Johnston, RI, United States
Factory Mutual Insurance Company
FM Global's multinational presence and capabilities allow us to provide seamless insurance solutions, services and claims response around the world.Overview
FM Global is a leading property insurer of the world's largest businesses, providing more than one-third of FORTUNE 1000-size companies with engineering-based risk management and property insurance solutions. FM Global helps clients maintain continuity in their business operations by drawing upon state-of-the-art loss-prevention engineering and research; risk management skills and support services; tailored risk transfer capabilities; and superior financial strength.
To do so, we rely on a dynamic, culturally diverse group of employees, working in more than 100 countries, in a variety of challenging roles.
FM Global is seeking a Lead Machine Learning Engineer to join our AI/ML team to lead Machine Learning Engineering, working very closely with Data Science, Data Engineering, Subject Matter Experts and Solution Architecture teams. As a part of our dynamic team, you will be an Azure AI/ML Ops Engineer focused on building a robust data platform and pipelines that enable advanced analytics.
This role offers the unique opportunity to develop AI/ML-based applications that have a meaningful impact on our customers. Our machine learning platform helps manage the various components of the ML application development life cycle, starting from data ingestion, and experimentation, to model training, deployment, and monitoring.
All of these components are interdisciplinary, so you will be working closely with cross-functional teams across the organization.
As a Lead Machine Learning Engineer you enable the Data Science team by developing platform tooling, by undertaking Data Engineering, by deploying Data Science models to production and monitoring production performance of Data Science models.
As the Lead MLOps Engineer, you will Lead Machine Learning projects end-to-end and develop platform tooling for the Data Science team. You will be responsible for Machine Learning Operations outcomes: Velocity of Model Deployments, Validation of Model Deployed Code and Versioning of Data, Model and Infrastructure.
Responsibilities
Leadership: As a Lead Machine Learning Engineer on the FM Global AI/ML Ops team, you will develop data pipelines, take data science prototype models to production, fix production bugs, monitor operations and provision the necessary infrastructure in Azure.
Collaboration: Work with data scientists to understand their data needs and put together data pipelines to ingest data. Work with data scientists to take data science model prototypes to production. Mentor and train junior team members.
Continuous Improvement: Enhance and improve the code deployment and model monitoring frameworks and project operations documentation.
Technical Competency: Design, provision and maintain the cloud infrastructure needed to support Data Engineering, Data Science, Machine Learning Engineers, and Machine Learning Operations. Write high quality code that has high test coverage, Lead code reviews to help improve code quality, help adopt best practice standards and guidelines.
Passion for Technology: Work closely with IT to Stay up-to-date with the latest trends and technologies in MLOps, LLMOps, machine learning, and artificial intelligence, and share your knowledge with the team to help us stay at the forefront of the field.
Technologies/Tools we use: Python, PySpark, SQL, R Studio, Databricks, Azure ADO, Azure AI/ML Data Factory, Azure (Virtual Machines, Azure Web Apps, Cloud Storage, Azure ML), Anaconda packages, Git, GitHub, GitHub Actions, Terraform, Artifactory, Airflow, Docker, Kubernetes, Linux/Windows VMs, Dynatrace, Slack for monitoring and alerting.
Qualifications
Technical Knowledge:
Minimum 7 years of hands-on experience implementing AI/ML solutions and platform tooling for Data Science. Expert in Spark SQL, PySpark, (Python and/or R programming language) which includes experience in libraries such as Pandas, scikit-learn, R (tidyverse, glm, caret etc…), MLFlow, Experimentation, Tracking, Productionizing and proficient in SQL.
5 or more years of professional experience in MLOps, Data Engineering, software engineering, or a related field.
Infrastructure Operations: Minimum 5 to 7+ years of hands-on experience in some combination of the following technologies: Azure (VMs, Web Apps, Managed Databases), GitHub Actions, Terraform, Packer, Airflow, Docker, Kubernetes, Linux/Windows VM administration, Shell scripting (primary Bash but PowerShell as well).
A solid understanding of modern security and networking principles and standards. Knowledge of best practices in software engineering is necessary.
Collaborative Spirit: Enjoys working in a team environment, with the ability to effectively communicate and collaborate with technical and non-technical team members alike.
Curiosity and Innovation: Possesses a profound curiosity about AI/ML and a strong desire to explore how to improve Machine Learning Models from Design to Deployment.
Education Qualifications: A foundational knowledge of Data Science is strongly preferred. Bachelor's or higher degree in Computer Science, Statistics, Mathematics, Data Science, and/or related quantitative degree is preferred from an accredited institution.
Compensation, Grade, and Job Title will be determined based on qualifications, experience, and technical skillset.
The position is eligible to participate in FM Global’s comprehensive Total Rewards program that includes an incentive plan, generous health and well-being programs, a 401(k) and pension plan, career development opportunities, tuition reimbursement, flexible work, paid time off allowances and much more.
FM Global is an Equal Opportunity Employer and is committed to attracting, developing, and retaining a diverse workforce.
#FMG
#LI-TA1
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Anaconda Architecture Azure Computer Science Databricks Data pipelines Docker Engineering Git GitHub Kubernetes Linux LLMOps Machine Learning Mathematics MLFlow ML models MLOps Model training Pandas Pipelines PySpark Python R Research Scikit-learn Security Shell scripting Spark SQL Statistics Terraform
Perks/benefits: Career development Flex hours Flex vacation Health care
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Business Intelligence Engineer jobs
- Open Data Science Intern jobs
- Open Data Engineer II jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Marketing Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open MLOps Engineer jobs
- Open Business Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Sr Data Engineer jobs
- Open Sr. Data Scientist jobs
- Open Principal Data Scientist jobs
- Open Research Scientist jobs
- Open Big Data Engineer jobs
- Open Senior Data Architect jobs
- Open Data Engineering Manager jobs
- Open Junior Data Engineer jobs
- Open Azure Data Engineer jobs
- Open Data Quality Analyst jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Data visualization-related jobs
- Open NLP-related jobs
- Open Finance-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open LLMs-related jobs
- Open APIs-related jobs
- Open Generative AI-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open Consulting-related jobs
- Open Hadoop-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs