Periodic Labs is looking to post-train frontier models to autonomously run various parts of the scientific discovery pipeline, aiming to make novel scientific discoveries.
Requirements
- Creating and scaling RL environments for LLMs
- Creating high-quality evals for frontier models
- Working closely with domain experts to define evaluation criteria, tools, and environments for agents
- Carefully crafting training datasets and reward functions, with LLMs and/or human trainers
- Training frontier LLMs with RL
Responsibilities
- post-train frontier models to autonomously run various parts of the scientific discovery pipeline
- Models you train will generate hypotheses, design experiments that run in an actual lab, operate sophisticated scientific equipment, and more
- work with the world’s leading experts in the physical sciences in order to create high-quality evaluation and training tasks
- scale up RL environments
- design creative reward functions
- run large-scale RL runs
- automating scientific discovery
Other
- Team members are owners who identify and solve problems without boundaries or bureaucracy.
- We eagerly learn new tools and new science to push forward our mission.