Software Engineer - Generative AI, AGI Inference Engine
Boston, Massachusetts, USA
Full Time Mid-level / Intermediate USD 115K - 223K
Amazon.com
Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa...
Are you interested in advancing Amazon's Generative AI capabilities? Come work with a talented team of engineers and scientists in a highly collaborative and friendly team. We are building state-of-the-art Generative AI technology that will benefit all Amazon businesses and customers.
Key job responsibilities
As a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.
A day in the life
You will read papers and consult with scientists to get inspiration of emerging techniques, and blend those into our roadmap; You will design and experiment with new algorithms, benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions, and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems, and constantly create solutions to minimize the ops load.
About the team
Our mission is to build best-in-class, fast, accurate, and cost-efficient large language model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
We are open to hiring candidates to work out of one of the following locations:
Boston, MA, USA | New York, NY, USA
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- Bachelor's degree in computer science or equivalent
- Experience with Python, PyTorch, and C++ programming and performance optimization
- Experience with Large Language Model inference
- Experience with Trainium and Inferentia Development
- Experience with GPU programming (TensorRT-LLM)
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $115,000/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.
Key job responsibilities
As a Software Development Engineer, you will be responsible for designing, developing, testing, and deploying high performance inference capabilities, including but not limited to multi-modality, SOTA model architectures, latency, throughput, and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy, and define the team’s roadmap. You will drive system architecture, spearhead best practices, and mentor junior engineers.
A day in the life
You will read papers and consult with scientists to get inspiration of emerging techniques, and blend those into our roadmap; You will design and experiment with new algorithms, benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions, and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems, and constantly create solutions to minimize the ops load.
About the team
Our mission is to build best-in-class, fast, accurate, and cost-efficient large language model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
We are open to hiring candidates to work out of one of the following locations:
Boston, MA, USA | New York, NY, USA
Basic Qualifications
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
Preferred Qualifications
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience- Bachelor's degree in computer science or equivalent
- Experience with Python, PyTorch, and C++ programming and performance optimization
- Experience with Large Language Model inference
- Experience with Trainium and Inferentia Development
- Experience with GPU programming (TensorRT-LLM)
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $115,000/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.
Tags: AGI Architecture Computer Science Engineering Generative AI GPU LLMs Model inference Python PyTorch SDLC TensorRT Testing
Perks/benefits: Career development Equity
Region:
North America
Country:
United States
Job stats:
13
1
0
Categories:
Deep Learning Jobs
Engineering Jobs
Generative AI Jobs
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Marketing Data Analyst jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open MLOps Engineer jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Sr Data Engineer jobs
- Open Business Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Data Analyst Intern jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data Quality Analyst jobs
- Open Manager, Data Engineering jobs
- Open Research Scientist jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs