Ai2 is seeking talented and motivated Research Interns to join the FlexOlmo team, working on a series of large language models designed for flexible data use, with a focus on Mixture-of-Experts (MoE), long-context language models (LCLMs), and retrieval
Requirements
- Pursuing a Ph.D. degree in Computer Science or similar field with research experience in machine learning, natural language processing, language and vision, or related areas
- Outstanding individual contributor (IC) skills, especially with deep learning frameworks (e.g. PyTorch)
- An outstanding publication record at AI-related venues, such as NeurIPS, ICLR, ICML, COLM, ACL, EMNLP
- Research experience in areas such as large language models, training dynamics, scaling laws, and data curation
- Experience with mixture-of-experts, long-context language models, and retrieval is preferred but not required
Responsibilities
- Define and lead a high-impact research project
- Train and release leading models
- Collaborate with and learn from team members across Ai2
- Build open-source software for the research community
- Author scientific papers for publication in a high-profile conference or journal
Other
- Must be able to remain in a stationary position for long periods of time
- The ability to communicate information and ideas so others will understand
- The ability to observe details at close range
- Can work under deadlines
- Located [or willing to relocate] in Berkeley, CA