Anthropic is looking to create reliable, interpretable, and steerable AI systems, and the Horizons team is playing a critical role in advancing these AI systems, specifically in the area of reinforcement learning (RL) for secure coding, vulnerability remediation, and other areas of defensive cybersecurity.
Requirements
- Experience in cybersecurity research
- Experience with machine learning
- Strong software engineering skills
- Familiarity with RL techniques and environments
- Familiarity with LLM training methodologies
- Professional experience in security engineering, fuzzing, detection and response, or other applied defensive work
- Experience participating in or building CTF competitions and cyber ranges
Responsibilities
- Designing and implementing RL environments
- Conducting experiments and evaluations
- Delivering work into production training runs
- Collaborating with other researchers, engineers, and cybersecurity specialists across and outside Anthropic
Other
- At least a Bachelor's degree in a related field or equivalent experience
- Location-based hybrid policy: currently, we expect all staff to be in one of our offices at least 25% of the time
- Visa sponsorship: we do sponsor visas, but we aren't able to successfully sponsor visas for every role and every candidate
- Strong communication skills
- Ability to balance research exploration with engineering implementation