Anthropic is looking to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society as a whole
Requirements
- 3+ years of experience training or finetuning deep learning models
- Experience designing evals, building RL environments, or contributing to model training pipelines
- Strong technical aptitude to partner with engineers and researchers, with strong proficiency in at least one programming language (Python preferred)
- Recent experience building production systems with large language models
- Advanced degree in Computer Science, Machine Learning, Artificial Intelligence, Statistics, or a related technical field
- Experience in a customer or client facing role
- Ability to navigate and execute amidst ambiguity, and to flex into different domains based on the business problem at hand
Responsibilities
- Design and execute high-quality finetuning projects for critical customers, delivering customized AI solutions with exceptional reliability
- Partner with customers to identify domains where Claude should improve, then collaborate with Research teams to develop evals, RL environments, and training infrastructure that advance model capabilities
- Leverage advanced machine learning skills to optimize finetuning strategies, design robust evaluation frameworks, and contribute to novel training approaches
- Collaborate closely with ML researchers to develop and implement cutting-edge finetuning techniques and model improvement methodologies
- Partner with account executives to understand customer requirements and translate them into both immediate solutions and longer-term research opportunities
- Serve as the primary technical advisor for customers on finetuning and model improvement projects, offering guidance on integration, deployment, and best practices
- Stay current with the latest advancements in AI, finetuning techniques, and reinforcement learning for large language models
Other
- At least a Bachelor's degree in a related field or equivalent experience
- Excellent communication and interpersonal skills, able to convey complicated topics in easily understandable terms to a diverse set of external and internal stakeholders
- Ability to travel occasionally to customer sites for workshops, research collaboration, and implementation support
- Location-based hybrid policy: currently, we expect all staff to be in one of our offices at least 25% of the time
- Visa sponsorship: we do sponsor visas, but we aren't able to successfully sponsor visas for every role and every candidate