Large Language Model Intern
San Mateo, CA
Brain Technologies
Brain organizes the world's software and make it natural to use.Brain is an early-stage startup with an ambitious vision to build exciting applications using Generative AI. The immense new power of Generative AI will make it so that everyone, not just technical programmers, can build and share exciting and interesting applications with the world. We have an incredible team, with investments led by Laurene Powell Jobs and Goodwater Capital.
As a Large Language Model Intern at Brain, you will build and/or fine-tune custom Large Language Models (LLMs) to supercharge the performance of our Imagica platform on specific application or other domains. Included in that process would be assembling and cleaning data, setting up relevant infrastructure, training and/or fine-tuning models, and evaluating performance on an appropriate data set.
How you’ll contribute to the vision:
- Help identify application or other domains where building custom LLMs would be an advantage over generic LLMs
- Collect data needed for training custom LLMs
- Clean and process the data for training/evaluations
- Train and/or find-tune custom LLMs
- Evaluate the performance of custom LLMs between each other and with respect to established generic LLMs
- Communicate findings and results to team
- Package results into a form readily usable in the Imagica platform
We’re excited about you because you:
- Are genuinely excited about the vast possibilities of Generative AI
- Want to be part of creating useful and innovative user-facing applications with Generative AI
- Experience with training and/or fine-tuning small to medium-sized (1B to 20B parameters) LLMs
- Familiar with several current open source LLMs
- Familiar with HuggingFace and other similar environments
- Experience with creating, processing, and cleaning data sets for model training/tuning/evaluation
- Experience with running evaluation metrics with held-out eval sets
- Are currently pursuing or recently completed a B.S., M.S., or Ph.D in computer science or related field
- You are fearlessly self-driven, yet enjoy helping others meet our goals.
- You enjoy solving problems and helping others meet our goals.
We know you work hard! Below are some benefits with working at Brain:
- We are a small team. You get to see many aspects of building a great product!
- We are also a growing team. We have plans to offer a lot more.
- Be a part of something massive and influence outcomes for our business.
- The ability to make, own and carry out decisions.
What to expect during our interview selection process:
- A written portion where you will discuss what you find exciting about interning at Brain and ideas you have for Imagica and how custom LLMs can help
- A face to face (remote) session where we will review your background, past projects, discuss your written exercise, and talk about how you would improve Generative AI and Imagica in particular with custom LLMs
About Brain:
Brain Technologies is based in San Mateo, California and was founded by serial entrepreneur Jerry Yue in December 2015 after founding multiple successful tech companies in Beijing including billion dollar company Benlai. Brain currently hosts its growing team in San Mateo and Chengdu, China. If you want to build great applications with Generative AI, we want to hear from you!
Tags: Computer Science Generative AI HuggingFace LLMs Model training Open Source
Perks/benefits: Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Principal Data Engineer jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Data Scientist II jobs
- Open Data Analytics Engineer jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Business Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Analyst Intern jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Quality Analyst jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open DevOps-related jobs