Anthropic is looking to improve the performance and capabilities of their AI system, Claude, in handling complex tasks and agentic systems.
Requirements
- Significant ML and software engineering experience
- High level familiarity with the architecture and operation of large language models
- Extensive prior experience exploring and testing language model behavior
- Experience prompting and/or building products with language models
- Experience with developing complex agentic systems using LLMs
- Experience with large-scale RL on language models
- Experience with multi-agent systems
Responsibilities
- Finetune new capabilities into Claude that maximize Claude’s performance or ease of use on agentic tasks
- Ideate, develop, and compare the performance of different tools for agents
- Systematically discover and test prompt engineering best practices for agents
- Develop automated techniques for designing and evaluating agentic systems
- Assist with automated evaluation of Claude models and prompts across the training and product lifecycle
- Work with our product org to find solutions to our most vexing challenges applying agents to our products
- Help create and optimize data mixes for model training
Other
- Good communication skills and an interest in working with other researchers on difficult tasks
- Passion for making powerful technology safe and societally beneficial
- Stay up-to-date and informed by taking an active interest in emerging research and industry trends
- Enjoy pair programming
- Bachelor's degree in a related field or equivalent experience
- Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time