Senior Research Engineer for Large Models
London, UK
DeepMind
Artificial intelligence could be one of humanity’s most useful inventions. We research and build safe artificial intelligence systems. We're committed to solving intelligence, to advance science and benefit humanity.At DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
Snapshot
We are looking to further grow our large scale machine learning expertise to accelerate projects across the breadth of DeepMind's research program. We are increasingly focused on problems that require crafting and building research infrastructure at the largest of scales!
About Us
We’re a dedicated scientific community, committed to “solving intelligence” and ensuring our technology is used for widespread public benefit. We’ve built a supportive and inclusive environment where collaboration is encouraged and learning is shared freely. We don’t set limits based on what others think is possible or impossible. We drive ourselves and inspire each other to push boundaries and achieve ambitious goals.
The role
In this role, our Senior Research Engineers work on cutting-edge research problems at the largest scales—including projects like Chinchilla (a compute-optimal language model), Sparrow (a human-aligned dialog agent using search), Flamingo (a visual language model operating on multiple modalities) and more.
You'll be building machine learning infrastructure used to make these models scalable, performant, robust, and reusable—enabling future research breakthroughs by continuously using upstream pre-trained model artifacts applied in new and novel ways.
In the research domain, we are interested in approaches such as quantisation, sharding regimes, transformer optimisation and others that make large models more performant and less resource intensive. Engineering challenges we solve help build scalable and reusable tech stacks and APIs for model use across multiple research groups.
You'll work alongside world-class research efforts that are constantly pushing the boundaries in the fields of large models, heterogeneous compute, distributed computation on accelerators, and large model RL training—to name a few.
Your colleagues will be Software and Research Engineers with a diverse set of backgrounds working to accelerate DeepMind's mission and research goals. Our team's solid fundamentals across both engineering and research makes us well suited to make the use of large models maximally permissive internally.
About you
In order to set you up for success as a Senior Research Engineer at DeepMind, we look for the following skills and experience:
- Candidates should have an in-depth knowledge in at least one of the following areas, and will gain familiarity with all of the below through on-the-job learning.
- Training and using large models (>10 billion parameters)
- Transformer architecture
- Using HW accelerators (GPU / TPU)
- Distributed ML system optimisation
- Excellent knowledge of either C++ or Python
In addition, the following would be an advantage:
- Experience implementing, evaluating, and fine-tuning ML algorithms
- Knowledge of Reinforcement Learning
Competitive salary applies
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture Engineering GPU Machine Learning ML infrastructure Python Reinforcement Learning Research
Perks/benefits: Career development Competitive pay
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open GCP-related jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Java-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Kubernetes-related jobs
- Open Generative AI-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs