Senior ML Engineer, AI/ML Platform & Data
San Jose
Full Time Senior-level / Expert USD 170K - 325K
Our Company
Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
Our Company
Changing the world through digital experiences is what Adobe’s all about. We provide everyone—from emerging artists to global brands—the tools they need to craft and deliver exceptional digital experiences! We’re passionate about empowering people to build beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every medium.
We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new insights can come from anywhere in the organization, and we know the next big idea could be yours!
The Opportunity
Firefly is the new family of creative generative AI models coming to Adobe products that offers a new way to conceptualize, build, and scale content. It’s a natural extension of the technology Adobe has produced over the past 40 years.
At the core of Firefly are our commercially safe AI models trained on hundreds of millions of images owned or licensed by Adobe. We are hiring for a highly strategic and visible role to help evolve these models. This is an opportunity to reach millions of creatives, helping them reinvent the way they work.
What you will Do
Design, develop, and maintain robust AI/ML infrastructure solutions to support the training and deployment of large-scale AI models, using Kubernetes and Python on AWS cloud
Implement and optimize distributed training frameworks leveraging GPUs to improve performance and scalability. Improve resiliency, elasticity, data loading and provide out-of-the-box support for GPU optimization methods such as FP8, FSDP and model parallelism
Write high quality, product level code that is easy to maintain and test following standard methodologies
Collaborate closely with ML Researchers and Machine Learning Engineers to accelerate the training of the cutting-edge ML models
Keep track of the latest innovation in academia and open-source community to implement rapid adoption of pioneering technologies to improve the performance of the ML platform
Help train better models by improving orchestration and scheduling, scaling the number of jobs, faster experimentation with AutoML and similar
Collaborate with data scientists and ML researchers to streamline the model training pipeline and ensuring efficient resource utilization
Drive innovation in infrastructure practices to support pioneering machine learning research and development
What you need to succeed
PhD or Master’s in computer science or related field and 5+ years relevant industry experience.
Proven proficiency with Python and developing systems, frameworks and SDKs
Experience with infrastructure and understanding of model serving, training, orchestration, and management of GPU resources
Experience with machine learning and distributed Pytorch
Strong critical thinking, analytical and quantitative problem-solving ability
Excellent communication, relationship skills and a strong teammate
While not required, it’s an added plus if you also have:
Experience with KubeFlow, MLFlow, Ray, SageMaker, or similar
Experience with Nvidia HPC
Experience with Pytorch distributed, MPI, Megatron, Horovod and other AI training frameworks
#FireflyGenAI
Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. The U.S. pay range for this position is $170,900 -- $325,200 annually. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. Your recruiter can share more about the specific salary range for the job location during the hiring process.At Adobe, for sales roles starting salaries are expressed as total target compensation (TTC = base + commission), and short-term incentives are in the form of sales commission plans. Non-sales roles starting salaries are expressed as base salary and short-term incentives are in the form of the Annual Incentive Plan (AIP).
In addition, certain roles may be eligible for long-term incentives in the form of a new hire equity award.
Adobe will consider qualified applicants with arrest or conviction records for employment in accordance with state and local laws and “fair chance” ordinances.Adobe is proud to be an Equal Employment Opportunity and affirmative action employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more.
Adobe aims to make Adobe.com accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email accommodations@adobe.com or call (408) 536-3015.
Adobe values a free and open marketplace for all employees and has policies in place to ensure that we do not enter into illegal agreements with other companies to not recruit or hire each other’s employees.
Tags: AWS Computer Science FSDP Generative AI GPU Horovod HPC Kubeflow Kubernetes Machine Learning MLFlow ML infrastructure ML models Model training Open Source PhD Python PyTorch Research SageMaker
Perks/benefits: Equity / stock options
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Research Scientist jobs
- Open Data Science Manager jobs
- Open Data Engineer II jobs
- Open Principal Data Scientist jobs
- Open Business Data Analyst jobs
- Open Data Scientist II jobs
- Open BI Analyst jobs
- Open Sr Data Engineer jobs
- Open Business Intelligence Engineer jobs
- Open Lead Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Science Intern jobs
- Open Senior Business Intelligence Analyst jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Analytics Engineer jobs
- Open Marketing Data Analyst jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Analyst II jobs
- Open Data Engineering Manager jobs
- Open ETL Developer jobs
- Open Data quality-related jobs
- Open Tableau-related jobs
- Open Privacy-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open LLMs-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Business Intelligence-related jobs
- Open Generative AI-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Git-related jobs
- Open Hadoop-related jobs
- Open Docker-related jobs