Anthropic is looking to improve the performance and capabilities of their AI system, Claude, in handling complex tasks and agentic systems.
Requirements
- Significant ML and software engineering experience
 
- High level familiarity with the architecture and operation of large language models
 
- Extensive prior experience exploring and testing language model behavior
 
- Experience prompting and/or building products with language models
 
- Experience with developing complex agentic systems using LLMs
 
- Experience with large-scale RL on language models
 
- Experience with multi-agent systems
 
Responsibilities
- Finetune new capabilities into Claude that maximize Claude’s performance or ease of use on agentic tasks
 
- Ideate, develop, and compare the performance of different tools for agents
 
- Systematically discover and test prompt engineering best practices for agents
 
- Develop automated techniques for designing and evaluating agentic systems
 
- Assist with automated evaluation of Claude models and prompts across the training and product lifecycle
 
- Work with our product org to find solutions to our most vexing challenges applying agents to our products
 
- Help create and optimize data mixes for model training
 
Other
- Good communication skills and an interest in working with other researchers on difficult tasks
 
- Passion for making powerful technology safe and societally beneficial
 
- Stay up-to-date and informed by taking an active interest in emerging research and industry trends
 
- Enjoy pair programming
 
- Bachelor's degree in a related field or equivalent experience
 
- Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time