Senior Research Engineer, AllenNLP

Seattle, WA

The Allen Institute for AI

AI2, founded by Paul Allen and led by Ali Farhadi, conducts high-impact research and engineering to tackle key problems in artificial intelligence.

View company page

Remote hybrid: Persons in these roles are expected to spend part of their time on-site in our Seattle offices and may occasionally work remotely from their home in the Greater Seattle area. On-site requirements vary based on position and team. If you have questions about Hybrid work arrangements for this role, please ask your recruiter.
Compensation Range: $153,040 - $235,680

Senior Research Engineer, AllenNLP

Who We Are: 

Open Language Model (OLMo) is the AI2 LLM framework designed to provide access to data, training code, models, and evaluation code necessary to advance AI through open research to empower academics and researchers to study the science of language models collectively. The goal is to share a high quality and open language model that will provide an avenue for people in the AI research community to work directly on language models for the first time. Following the launch of OLMo (Open Language Model), AI2 is now embarking on furthering research to provide scientific insights and solutions on how to substantially close the gap with the state-of-the-art in adapted models.

Who You Are: 

You are a talented, hands-on engineer who thrives in a fast-paced environment, is self-directed, a team player, and knows how to get things done. You have a deep knowledge of Python, and a strong understanding of modern deep learning, natural language processing, and the inner workings of the transformer architecture. You can translate high-level goals into concrete research and implementation steps, set an approach, follow through, and present results. When it’s time to explain your ideas, you bring clarity to complex technical issues. You use these skills to create real-world benefits for researchers and other practitioners, and you are excited to help advance our effort to create the best-performing open large language model.

Your Next Challenge: 

You will be a part of the core team of research and machine learning engineers working on the infrastructure, architecture, modeling and training of OLMo (Open Language Model). In this role you will be owning the design and implementation of the code that trains the OLMo models. You will be responsible for building scalable machine learning pipelines as we push the boundaries of large language modeling research. You will be collaborating with colleagues inside and outside your own team, but you are responsible for a feature or experiment from start to finish, from conception to implementation.

The essential functions include, but are not limited to the following:

  • Building infrastructure to facilitate the next generation of LLM research
  • Optimizing training and inference for language models
  • Triaging between experiments and executing on the most impactful
  • Supporting and collaborating with an open-source community
  • Bridging the gap between cutting-edge research and a widely adopted product
  • Bringing software engineering best practices to a research environment
  • Releasing your contributions back to the broader community in the form of open source software, model releases, and additions to AI2’s public API and Open Research Corpus

What You’ll Need: 

  • Expertise at building ML infrastructure - having 4+ years of industry experiences building infrastructure that handles data preprocessing/transformation and model training, evaluation, and deployment
  • Deep experience in the complete model development cycle, including data set construction, training, tuning, evaluation, performance profiling, and monitoring
  • Knowledge of modern deep learning and natural language processing techniques
  • Strong software engineering skills, particularly around building performant systems and debugging
  • At-home with on-hands programming – must have experience with Python and PyTorch/Jax/Tensorflow. We expect you to be the kind of engineer who can pick up a new programming language, library, or API as needed without it being a big deal.
  • Familiarity working with cloud compute resources (e.g. AWS) and containerization (e.g. Docker)
  • Strong collaboration and communication skills - our environment is small and collaborative, and we'd like you to thrive while working closely with others

Bonus qualifications:

  • Advanced degree in Data Science/CS/EE/Applied Mathematics/Statistics/ML/NLP or related fields and/or relevant and equivalent engineering experience
  • Contributions to open-source ML or research libraries (e.g. spaCy, AllenNLP, transformers)
  • Experience successfully operating models at scale in a production setting
  • Experience in HPC settings

Education:

  • BS or MS in Computer Science, Statistics, Engineering, Applied Mathematics, or a related quantitative field

Physical Demands and Work Environment:

The physical demands described here are representative of those that must be met by a team member to successfully perform the essential functions of this position. Reasonable accommodations may be made to enable individuals with disabilities to perform the functions.

  • Must be able to remain in a stationary position for long periods of time.
  • The ability to communicate information and ideas so others will understand. Must be able to exchange accurate information in these situations.
  • The ability to observe details at close range.
  • Can work under deadlines.

A Little More About AI2:

The Allen Institute for Artificial Intelligence is a non-profit research institute in Seattle founded by Paul Allen. The core mission of AI2 is to contribute to humanity through high-impact research in artificial intelligence. 

In addition to AI2’s core mission, we also aim to contribute to humanity through our treatment of each member of the AI2 Team. Some highlights are:

  • We are a learning organization – because everything AI2 does is ground-breaking, we are learning every day. Similarly, through weekly AI2 Academy lectures, a wide variety of world-class AI experts as guest speakers, and our commitment to your personal on-going education, AI2 is a place where you will have opportunities to continue learning alongside your coworkers. 
  • We value diversity - We seek to hire, support, and promote people from all genders, ethnicities, and all levels of experience regardless of age. We particularly encourage applications from women, non-binary individuals, people of color, members of the LGBTQA+ community, and people with disabilities of any kind. 
  • We value inclusion - We understand the value that people's individual experiences and perspectives can bring to an organization, and we are building a culture in which all voices are heard, respected and considered.
  • We emphasize a healthy work/life balance – we believe our team members are happiest and most productive when their work/life balance is optimized. While we value powerful research results which drive our mission forward, we also value dinner with family, weekend time, and vacation time. We offer generous paid vacation and sick leave as well as family leave.
  • We are collaborative and transparent – we consider ourselves a team, all moving with a common purpose. We are quick to cheer our successes, and even quicker to share and jointly problem solve our failures.
  • We are in Seattle – and our office is on the water! We have mountains, we have lakes, we have four seasons, we bike to work, we have a vibrant theater scene, and we have so much else. We even have kayaks for you to paddle right outside our front door. We welcome interest from applicants from outside of the United States.
  • We are friendly– chances are you will like every one of the 200+ people who work here. We do. 

AI2 is proud to be an Equal Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. You may view the related Know Your Rights compliance poster and the Pay Transparency Nondiscrimination Provision by clicking on their corresponding links. 

This employer participates in E-Verify and will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S. If E-Verify cannot confirm that you are authorized to work, this employer is required to give you written instructions and an opportunity to contact the Department of Homeland Security (DHS) or Social Security Administration (SSA) so you can begin to resolve the issue before the employer can take any action against you, including terminating your employment. Employers can only use E-Verify once you have accepted a job offer and completed the Form I-9.

We are committed to providing reasonable accommodations to employees and applicants with disabilities to the full extent required by the Americans with Disabilities Act (ADA). If you feel you need a reasonable accommodation pursuant to the ADA, you are encouraged to contact us at recruiting@allenai.org 

Benefits: 

  • Team members and their families are covered by medical, dental, vision, basic life insurance, basic accidental death and dismemberment insurance, short-term disability, long-term disability, and an employee assistance program. 
  • Team members are able to enroll in our voluntary life insurance program, our voluntary accidental death and dismemberment program, our health savings account plan, our healthcare reimbursement arrangement plan, and our health care and dependent care flexible spending account plans. 
  • Team members are able to enroll in our company’s 401k plan. 
  • Team members will receive $125 per month to assist with commuting or internet expenses and will also receive $200 per month for fitness and wellbeing expenses.
  • Team members will also receive up to ten sick days per year, up to seven personal days per year, up to 20 vacation days per year and twelve paid holidays throughout the calendar year.
  • Team members will be able to receive annual bonuses and can participate in the long-term incentive plan.

 

 

 

 

 

Note: This job description in no way states or implies that these are the only duties to be performed by the team members(s) of this position. Team members will be required to follow any other job-related instructions and to perform any other job-related duties requested by any person authorized to give instructions or assignments. All duties and responsibilities are essential functions and requirements and are subject to possible modification to reasonably accommodate individuals with disabilities. To perform this job successfully, the team member(s) will possess the skills, aptitudes, and abilities to perform each duty proficiently. Some requirements may exclude individuals who pose a direct threat or significant risk to the health or safety of themselves or others. The requirements listed in this document are the minimum levels of knowledge, skills, or abilities. This document does not create an employment contract, implied or otherwise, other than an at will relationship.

Apply now Apply later
  • Share this job via
  • or

Tags: APIs Architecture AWS Computer Science Deep Learning Docker Engineering HPC JAX LLMs Machine Learning Mathematics ML infrastructure ML models Model training NLP Open Source Pipelines Python PyTorch Research Security spaCy Statistics TensorFlow Transformers

Perks/benefits: 401(k) matching Career development Fitness / gym Flex hours Flexible spending account Flex vacation Health care Home office stipend Insurance Medical leave Salary bonus Startup environment Transparency

Region: North America
Country: United States
Job stats:  14  1  0

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.