OpenAI's Frontier Evals team is looking to push the boundaries of their frontier models in the finance domain by developing evaluations for financial reasoning and related capabilities.
Requirements
- Strong engineering and statistical analysis skills
Responsibilities
- Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
- Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
- Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities
Other
- Have prior background / domain expertise in finance, especially investment banking or private equity, and a passion for these problems
- Detail-oriented and thorough
- Team player / willing to do a variety of tasks to move the team forward
- Passionate and knowledgeable about AGI/ASI measurement
- Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end
- An ability to work cross-functionally
- Excellent communication skills