Senior Staff Research Scientist, On Device/Efficient AI/LLM

665 Clyde Avenue, Mountain View, CA, USA

Samsung Research America

For more than 70 years, Samsung has been at the forefront of innovation. Our discoveries, inventions and breakthrough products have helped shape the history of the digital revolution. We continue to expand our global reach and open new...

View company page

Lab Summary:  The Samsung AI Center (SAIC) within Samsung Research America (SRA) leads at the forefront of innovation in creating intelligent and interactive machines building upon an ecosystem that is user-centric rather than device-centric. The success of AI will depend on how well devices understand their users – and how well devices empower users.  SAIC takes on grand scientific and engineering challenges in machine intelligence and actively contributes to the international research community through scientific publications and presentations in major conferences and journals in research areas of Computer Vision, HCI, Contextual/multi-modal modeling, NLP, Speech Recognition, Dialogue, and Machine and Deep Learning. The SAIC is a key part of Samsung’s global R&D effort and aims to have influence on future Samsung products reaching hundreds of millions of users worldwide.

 Position Summary: We are looking for candidates with machine learning and deep learning background, in the field of on-device and efficient AI. You will work with a team of research scientists and engineers tackling real-world problems involving Samsung’s Artificial Intelligence initiatives. You will be involved in very promising team projects with talented people at Samsung. You will benefit a lot by working in a fun and creative environment. The AI research center is a key part of Samsung’s global R&D effort and aims to have influence on future Samsung products reaching hundreds of millions of users worldwide.

Position Responsibilities:

  • Design, develop and implement novel efficient model architectures of large foundation models (e.g., LLM, LVM, etc.) for various applications including language, vision, audio, sensor data, etc.
  • Develop and implement efficient model training algorithms, such as LoRa, DoRA, etc.
  • Develop and implement efficient inference algorithms, such as speculative decoding, and model parallelism, etc.
  • Deploy and optimize efficient models on edge devices (e.g., Samsung phone GPU/NPU).
  • Generate creative solutions (patents) and publish research results in top conferences (papers).

Required Skills:

  • Ph.D. in C.S., EE or equivalent combination of education, training, and experience
  • 15+ years of experience with AI related Research.
  • Bi-lingual Korean language is highly preferred.
  • Expertise in the state-of-the-art model architectures, including Transformer, Mamba, Liquid neural networks, etc.
  • Experience in efficient model training algorithms and model design, including LoRa, DoRA, Flash Attention, FlashAttention-2, etc.
  • Experience in efficient model inference algorithms, including speculative decoding, KV caching, etc.
  • Experience in model compression techniques (e.g., pruning, quantization, knowledge distillation, SVD, etc.).
  • Experience in developing and optimizing deep learning models (e.g., Transformer, Mamba, liquid) on edge devices (e.g., GPU/NPU).
  • Proficiency in on-device neural network libraries (e.g., llama.cpp, MLC LLM, etc.);
  • Experience in efficient hardware accelerator design for neural computing on mobile devices.
  • Proven track record of research/publications on machine learning and artificial intelligence field (NeurIPS, ICLR, ICML, AAAI, IJCAI, CVPR, ACL, etc.)
Our total rewards programs are designed to motivate and engage exceptional talent. The base pay range for roles at this level is listed below, but may be higher or lower in other states due to geographic differentials in the labor market. Within the base pay range, individual rates depend on a number of factors—including the role’s function and location as well as the individual’s knowledge, skills, experience, education and training. This is part of our comprehensive compensation package with annual bonus eligibility and generous benefits to help you live life well.Base Pay Range$188,400—$282,450 USD

Additional Information

Essential Job Functions

This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, and frequently operate standard office equipment, such as telephones and computers.

Samsung Research America is committed to complying with all Federal, State and local laws related to the employment of qualified individuals with disabilities. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the recruiter or email sratalent@samsung.com.

Affirmative Action / Equal Opportunity

Samsung Research America is an Affirmative Action and Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, disability, or status as a protected veteran.

For more information regarding protection from discrimination under Federal law for applicants and employees, please refer to the links below.

Know Your Rights  |  Pay Transparency

Apply now Apply later
  • Share this job via
  • or

Tags: Architecture ASR Computer Vision Deep Learning Engineering GPU ICLR ICML LLaMA LLMs LoRA Machine intelligence Machine Learning Model design Model inference Model training NeurIPS NLP R R&D Research

Perks/benefits: Career development Conferences Salary bonus

Region: North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.