Solutions Architect

Somerville, Massachusetts, United States

Apply now Apply later


About Neural Magic

Based in Somerville, Massachusetts, Neural Magic is a series A startup backed by leading investors including Andreessen Horowitz, NEA, NEA, Pillar, VMware, Verizon Ventures, Comcast Ventures, and Amdocs. At Neural Magic we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise on the planet. Neural Magic accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As a leading developer and maintainer of the vLLM project and inventor of state-of-the-art techniques for model quantization and sparsification, Neural Magic provides a stable platform for enterprises to build, optimize and scale LLM deployments.

Our Mission

Neural Magic is on a mission to bring the power of open-source LLMs and vLLM to every enterprise on the planet.

Your Role

As a technical Solutions Architect at Neural Magic, you will be at the center of delivering technical value to our customers. Working closely with our sales and R&D teams, you will be responsible for owning all technical aspects of the sales cycle and post-sales, delivering solutions that align customer’s needs with the capabilities of our technology.

Qualified candidates will use their engineering and customer engagement skills to create proof of concepts and dynamic demos that spark the imagination of potential customers. You’ll regularly engage with customers in both pre and post sales capacities to evaluate and scope solutions, lead technical evaluations, assist with implementations, and address obstacles that might hinder a customer's success. Using your ability to identify challenges before they happen and quickly learn new technical concepts, you enable customers to understand and experience the value of Neural Magic's product.

This role is hands-on and requires a combination of a strategic solutions mindset, top notch communication skills, technical depth and ability to adapt to changing customer conditions. We are seeking an individual excited to work with state-of-the-art deep models, who will will work cross-functionally with our technical and customer teams to deliver value. If you are someone who wants to contribute to solving challenging technical problems at the forefront of machine learning, this is the role for you.

Responsibilities

  • Translate business needs into technical requirements, work with customers to design solutions that deliver measurable value.
  • Work closely with sales team to qualify opportunities based on fit with our solution
  • Work closely with our product and machine learning teams to provide valuable input into market and customer deployment requirements with vLLM
  • Support customer efforts related to model evaluation, fine-tuning, model optimization, performance benchmarking, and deployment reference architecture

Requirements

  • Work closely with customers to test and deploy their neural network models using industry leading frameworks and Neural Magic’s software
  • Support the design and implementation of solutions for our customers, and partner with Neural Magic’s engineering organization to create and deliver these solutions.
  • Be a trusted advisor and partner, providing analysis of deep learning approaches, helping to define and conduct pilot tests
  • Bachelors or Masters degree in computer science, engineering or another quantitative field, preferably with a focus on machine learning
  • Professional experience in the areas of developing and deploying enterprise software solutions and customer implementations.
  • Proficient with Tensorflow, Pytorch, and other machine learning frameworks
    • (preferred): experience with hugging face
    • (preferred): experience with kubernetes
  • Excellent communication skills, especially written and presentation skills with the ability to tailor technical information for different audiences.

Benefits

  • Competitive compensation and stock option plan
  • Comprehensive health care (medical, dental, vision)
  • Retirement plan (401k, IRA)
  • Generous paid time off (vacation, sick leave, holidays)
  • Family leave (maternity, paternity)
  • Disability coverage
  • Professional development opportunities
  • Flexible work arrangements (remote options)
  • Wellness resources
  • Free food and snacks (in the office)

Neural Magic is an equal-opportunity employer committed to fostering a diverse and inclusive workplace. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  0  0
Category: Architecture Jobs

Tags: Architecture Computer Science Deep Learning Engineering Generative AI Kubernetes LLMs Machine Learning Open Source PyTorch R R&D Spark TensorFlow vLLM

Perks/benefits: 401(k) matching Career development Competitive pay Equity / stock options Flex hours Flex vacation Health care Medical leave Parental leave Startup environment

Region: North America
Country: United States

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.