Software Engineer, Machine Learning Systems

Palo Alto

Lamini

Lamini makes it easy for every enterprise and developer to build customized, private LLM models with superior fine-tuning and generative AI technologies. As the leader in generative AI, Lamini enables faster and higher-performing fine-tuned...

View company page

Lamini AI is at the forefront of bringing LLMs to production. We are on a mission to help every company unlock the power of generative AI by putting its own data to work. Our team is made up of highly committed engineers, researchers, and tech industry veterans. We’re backed by leading VCs as well as computing and technology companies.
We are looking for a machine learning systems expert enthusiastic about engaging with all facets of the ML system stack. We’re looking for someone who is eager to traverse the entire ML system stack, iterate fast on building new ML cloud systems, and is hungry to build and own enormous contributions.

About the role:

  • ML System engineers in our team are responsible for one or more of the following
  • Deployment and management of high-performing compute clusters.
  • Enhancing inference and training performance through optimizations across the system stack, encompassing high-level mechanisms such as queuing and scheduling, medium-level optimizations within inference and training engines, and low-level optimizations targeting GPU kernel efficiency.

Qualifications:

  • Experience building and rapidly prototyping production cloud-based software
  • Demonstrated fluency with data structures, algorithms, architecture, and agile software best practices in any language
  • Experience in Python and C++/Rust
  • Understanding of the latest technologies in LLMs, like LoRa, Mamba, etc.
  • Understanding or willingness to learn about the entire system stack
  • Desire to work in an inclusive and collaborative environment
  • An interest in continually learning from others, teaching others, and digging into new challenges

Nice to have:

  • Desire to create speed of light training and inference systems for next-generation AI
  • Deep technology expertise in machine learning systems, e.g. TinyML, Triton, CUDA, ROCm, Exo, MLIR, Halide, etc
We believe in hiring passionate individuals who believe in the AI revolution to make software accessible to all. If you’re excited about this role but are not sure if your past experience aligns perfectly, we still encourage you to apply and meet with us. 
At Lamini AI, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants without regard to race, color, religion, sex, pregnancy (including childbirth, lactation and related medical conditions), national origin, age, physical and mental disability, marital status, sexual orientation, gender identity, gender expression, genetic information (including characteristics and testing), military and veteran status, and any other characteristic protected by applicable law. Lamini AI believes that diversity and inclusion among our employees is critical to our success as a company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. Selection for employment is decided on the basis of qualifications, merit, and business need.
Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Architecture CUDA Generative AI GPU LLMs LoRA Machine Learning Prototyping Python Rust Teaching Testing

Region: North America
Country: United States
Job stats:  7  1  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.