Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Associate Director, Reinforcement Learning (ML)

Amgen

Salary not specified

Dec 16, 2025

Thousand Oaks, CA, US

Amgen is seeking to lead the strategy and execution for Reinforcement Learning from Human Feedback (RLHF) and related reinforcement learning approaches across R&D, medical, operations, and commercial use cases to better serve patients.

Requirements

Deep, hands-on expertise in Reinforcement Learning from Human Feedback (RLHF) and/or advanced reinforcement learning, including reward modeling, policy optimization, exploration strategies, and offline/online evaluation.
Demonstrated experience deploying RLHF or RL systems into production for real-world applications (e.g., large language models, recommendation systems, decision support tools, or workflow automation), ideally in healthcare, life sciences, or other regulated domains.
Strong background in modern machine learning and deep learning, with practical experience in Python and frameworks such as PyTorch or TensorFlow, and familiarity with LLM ecosystems and tooling.
Experience working with large-scale data and cloud ecosystems (e.g., Azure, Databricks, Snowflake, or similar), and partnering with data engineering or platform teams to build robust pipelines and experimentation platforms.
Demonstrated understanding of responsible AI, safety, and governance, especially in the context of RLHF and LLMs (e.g., bias, robustness, transparency, and guardrail design).
Familiarity with pharma/biotech, healthcare, or other regulated industries, including an understanding of compliance, privacy, and consent practices related to patient and HCP data.
Certifications on Reinforcement Learning (AWS AI, Azure AI Engineer, Google Cloud ML, etc.) are a plus.

Responsibilities

Lead the design and development of RLHF systems including reward modeling, policy optimization, safety and alignment mechanisms, and evaluation frameworks for large language models and other AI systems.
Drive hands-on technical execution, particularly for high-impact projects, reviewing architectures, experimentation plans, and code, and helping the team navigate scientific and engineering trade-offs.
Establish best-practice pipelines for human feedback, partnering closely with internal customer teams to define feedback protocols, annotation quality standards, and governance for RLHF data.
Define and track success metrics for RLHF systems, balancing offline and online evaluation, A/B tests, safety and robustness criteria, and business or scientific outcomes.
Collaborate across Amgen leaders to ensure RLHF solutions are aligned with strategy, compliant with policy, and integrated into real workflows.
Partner with Data, Platform and Technology teams to ensure that RLHF workloads are supported by scalable data platforms, model hosting, experimentation infrastructure, and MLOps best practices.
Champion responsible and compliant AI, working with Legal, Compliance, and Information Security to implement governance around human feedback, data usage, model behavior, transparency, and risk management in a regulated environment.

Other

Doctorate degree and 3 years of Computer Science, IT or related field experience
Master’s degree and 5 years of Computer Science, IT or related field experience
Bachelor’s degree and 7 years of Computer Science, IT or related field experience
Associate’s degree and 12 years of Computer Science, IT or related field experience
High school diploma / GED and 14 years of Computer Science, IT or related field experience