Principal Machine Learning Platform Engineer
Beaverton, OR
Applications have closed
Lucid Motors
With extraordinary design, performance, range, convenience, and utility, Lucid Gravity is the future of sustainable mobility, reimagining the luxury electric SUV.Principal Machine Learning Platform Engineer is responsible to define and lead the Data and Machine learning Architecture, Performance, and Scalability of lucid ML operations, for both Vehicle and Operational data, ingesting, processing, and storing Trillions of rows of data per day. This hands-on role helps solve real big data problems, which most of the standard tools on the market are not capable of handling. You will be designing solutions, writing codes and automation, defining standards, and establish best practices across the company.
The Role
- Build a world class ML platform for the Lucid engineering and data teams to use, supporting all aspects of future car design
- Design the next generation of ML architecture that will power large-scale data science projects
- Lead the design and implementation of ML training and inference frameworks
- Design Lucid’s deployment strategy for pipelines using open source technologies (e.g. Kubernestes, MLFlow, Spark, Airflow) and modern software design principles to promote modularity, repeatability, scalability, and fault tolerance
- Design and implement robust, automated, production-level software and machine learning components using horizontally scalable components
- Integrate effectively with cross-functional teams of engineers, product managers, and domain experts to promote company productivity
- Present project metrics and complex ML concepts to both technical and non-technical audiences to inform timely decision-making
- Deep understanding of data system design and experience handling large data sets
- Apply and manage industry best practice tools and processes such as Data Lake, Delta Lake, S3, Spark ETL, Airflow, Hive Catalog, Ranger, Redshift, Spline, Kafka, MQTT, Timeseries Database, Cassandra, Redis, Presto, Kubernetes, Docker, CI/CD, DevOps
- Contribute to the overall architecture, implementation and ongoing maintenance of our codebase
- Assist infrastructure team to optimize the performance and scale our data ingestion and processing infrastructure to server ever-increasing volume.
- Translate big data and analytics requirements into data models that will operate at a large scale and high performance and guide the data analytics engineers on these data models.
- Articulate direction and focus in areas of high ambiguity
- Mentor junior team members
Qualifications
- M.S. or PhD degree in Computer Science, or a closely related engineering discipline.
- 10+ years of hands-on experience in ML pipeline, ETL, and data modeling to implement large-scale systems for decision support or other time-sensitive applications
- 5+ years of hands-on experience in productionizing and deploying ML platforms and applications and high velocity big data applications;
- 5+ years working with: RDBMS/SQL, distributed data stores, NoSQL databases, time-series databases, Spark streaming, publish/subscribe, columnar data formats
- Extensive experience in creating production level, highly concurrent, multithreaded or distributed ML models for training, validation, and inference leveraging real-time systems
- Extensive experience with Kubernetes-based ML training and deployment architectures using technologies such as KubeFlow, Seldon, and KServe.
- Experience with heterogeneous computing and GPU programming.
- Experience processing high velocity distributed vehicle telemetry or other Internet-of-Things sensor data, both centrally and at the edge
- Expert proficiency in Spark, Kafka, Presto, Airflow, or similar data streaming/processing technologies.
- Proficiency with machine/deep learning frameworks such as TensorFlow, Keras, Pytorch, Caffe, MXNet, etc
- Proven hands-on experience building microservice-based solutions to implement large-scale data infrastructures and ML pipelines
- Strong knowledge and understanding of machine learning pipelines from standardization, normalization, clustering, modeling, scoring, validation
- Understanding of ETL engineering and tools so you can interface with data integration teams
Notice regarding COVID-19 protocols At Lucid, we prioritize the health and wellbeing of our employees, families, and friends above all else. In response to the novel Coronavirus all new Lucid employees, whose job will be based in the United States may or may not be required to provide original documentation confirming status as having received the prescribed inoculation (doses). Vaccination requirements are dependent upon location and position, please refer to the job description for more details. Individuals in positions requiring vaccinations may seek a medical and/or religious exemption from this requirement and may be granted such an accommodation after submitting a formal request to and the subsequent review and approval thereof by our dedicated Covid-19 Response team. To all recruitment agencies: Lucid Motors does not accept agency resumes. Please do not forward resumes to our careers alias or other Lucid Motors employees. Lucid Motors is not responsible for any fees related to unsolicited resumes.
Tags: Airflow Architecture Big Data Caffe Cassandra CI/CD Clustering Computer Science Data Analytics Deep Learning DevOps Docker Engineering ETL GPU Kafka Keras Kubeflow Kubernetes Machine Learning MLFlow ML models MXNet NoSQL Open Source PhD Pipelines PyTorch RDBMS Redshift Spark SQL Streaming TensorFlow
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Sr Data Engineer jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Big Data Engineer jobs
- Open Business Data Analyst jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open ETL Developer jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open PhD-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Data governance-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs