Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Anthropic Logo

Research Engineer, Reward Models

Anthropic

$315,000 - $510,000
Aug 13, 2025
San Francisco, CA, US
Apply Now

The company is looking to develop sophisticated techniques for teaching AI systems to understand and embody human values, as well as to push forward AI capabilities.

Requirements

  • Strong engineering background in machine learning, with demonstrable expertise in preference learning, reinforcement learning, deep learning, or related areas
  • Proficient in Python, deep learning frameworks, and distributed computing
  • Familiar with modern LLM architectures and alignment techniques
  • Experience with improving model training pipelines and building data pipelines
  • Proficiency in Python and experience with deep learning frameworks is required for this role
  • Experience with reward models is not required, but experience with LLMs or other large models is a significant plus

Responsibilities

  • Help implement novel reward modeling architectures and techniques
  • Optimize training pipelines
  • Build and optimize data pipelines
  • Collaborate across teams to integrate reward modeling advances into production systems
  • Communicate engineering progress through internal documentation and potential publications

Other

  • At least a Bachelor's degree in a related field or equivalent experience
  • Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time
  • Visa sponsorship: We do sponsor visas, but we aren't able to successfully sponsor visas for every role and every candidate
  • Clear communication of complex technical concepts and research findings
  • Deep interest in AI alignment and safety
  • Strong communication skills