Senior Full Stack Engineer, Deep Learning Algorithms

UK, Remote

NVIDIA

NVIDIA erfindet den Grafikprozessor und fördert Fortschritte in den Bereichen KI, HPC, Gaming, kreatives Design, autonome Fahrzeuge und Robotik.

View company page

Join the team building software which will be used by the entire world of AI. Work with high class software engineers to implement a large scale toolset that tests deep learning models and frameworks on the most powerful computers. Strong social skills are necessary alongside the capacity to thrive in a fast-paced, multifaceted setting. In this role you will be interacting with internal partners, users, and members of the open source community to implement solutions for building, testing, integrating, and releasing of NVIDIA AI Services and Deep Learning Frameworks on the most powerful, enterprise-grade GPU clusters capable of hundreds of Peta FLOPS. This role spans multiple products such as PyTorch, TensorFlow, JAX, PaddlePaddle. You will work with internal engineering teams to deploy and operationalize AI models and services at scale by driving adoption for end-to-end Machine Learning and Deep Learning solutions in the cloud and on prem.

We are seeking passionate and python developers to help us scale our AI and deep learning services, platforms, models and internal tools. You will be responsible for implementing and maintaining tools, and infrastructure that enable our teams to productize NVIDIA SW stack:  DL Frameworks, DL models, AI services and other tools and technologies. Are you ready for this challenge?

What you’ll be doing:

  • Automating and optimizing testing of Deep Learning models and AI Services from different data domains with focus on inference

  • Developing shared utilities for setting up systems, running tests, recording results and visualization on dashboards.

  • Configuring, maintaining, and building solutions on industry-standard tools (e.g. GitLab, Docker, ElasticSearch etc)

  • Lead best-practices for building, testing, and releasing software including AI Services and DL models

  • Identifying infrastructure needs and translating them into action

  • Building tools for automatic content generation mechanisms that saves dozens of engineering hours

What we need to see:

  • BSc or MS degree in Computer Science, Computer Architecture or related technical field

  • 5+ years of work experience in software development

  • Excellent Python programming skills, and a deep understanding of OOP concepts.

  • Experience in building both back-end services (e.g Flask, FastAPI, Django) services and front-end (e.g. JS, React, Vue, Dash, Streamlit)

  • Understanding of Deep Learning on the level that allows benchmarking DL models 

  • Familiarity with DevOps concepts such as CI/CD, Docker, Jenkins, and automation tools.

  • Strong time-management and organization skills for coordinating multiple initiatives and priorities. 

  • Ability to  utilize new technologies and products in very complex projects.

  • Good communication and documentation habits

Ways to stand out from the crowd:

  • Deep understanding of containerization technologies such as Docker

  • Experience in building monitoring or dashboarding solutions.

  • Hands-on in configuring complex CI pipelines

  • Experience with HPC based compute clusters and scheduling solutions like Slurm or familiarity with orchestration tools such as Kubernetes

  • Hands-on experience working in cloud environment

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and forward-thinking people in the world working for us. If you're creative and autonomous, we want to hear from you! We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

#deeplearning
Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Architecture CI/CD Computer Science Deep Learning DevOps Django Docker Elasticsearch Engineering FastAPI Flask GitLab GPU HPC JAX Kubernetes Machine Learning OOP Open Source Pipelines Python PyTorch React Streamlit TensorFlow Testing Vue

Region: Remote/Anywhere
Job stats:  14  3  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.