Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Associate Director, Reinforcement Learning (ML)

BioSpace

Salary not specified

Dec 20, 2025

Thousand Oaks, CA, US

Amgen is looking to solve real-world problems in the healthcare industry using Reinforcement Learning from Human Feedback (RLHF) and related reinforcement learning approaches to serve patients better and faster.

Requirements

Deep, hands-on expertise in Reinforcement Learning from Human Feedback (RLHF) and/or advanced reinforcement learning, including reward modeling, policy optimization, exploration strategies, and offline/online evaluation.
Demonstrated experience deploying RLHF or RL systems into production for real-world applications (e.g., large language models, recommendation systems, decision support tools, or workflow automation), ideally in healthcare, life sciences, or other regulated domains.
Strong background in modern machine learning and deep learning, with practical experience in Python and frameworks such as PyTorch or TensorFlow, and familiarity with LLM ecosystems and tooling.
Experience driving sophisticated, cross-functional initiatives, collaborating with non-technical stakeholders (e.g., physicians, scientists, commercial leaders, compliance, legal) and translating needs into impactful AI solutions.
Strong ability to communicate complex technical topics simply, tailoring content to senior executives and non-technical audiences; well-versed in data and model storytelling, including risks, assumptions, and limitations.
Experience working with large-scale data and cloud ecosystems (e.g., Azure, Databricks, Snowflake, or similar), and partnering with data engineering or platform teams to build robust pipelines and experimentation platforms.
Demonstrated understanding of responsible AI, safety, and governance, especially in the context of RLHF and LLMs (e.g., bias, robustness, transparency, and guardrail design).

Responsibilities

Lead the design and development of RLHF systemsincluding reward modeling, policy optimization, safety and alignment mechanisms, and evaluation frameworks for large language models and other AI systems.
Drive hands-on technical execution, particularly for high-impact projects, reviewing architectures, experimentation plans, and code, and helping the team navigate scientific and engineering trade-offs.
Establish best-practice pipelines for human feedback, partnering closely with internal customer teams to define feedback protocols, annotation quality standards, and governance for RLHF data.
Define and track success metricsfor RLHF systems, balancing offline and online evaluation, A/B tests, safety and robustness criteria, and business or scientific outcomes.
Collaborate across Amgenleaders to ensure RLHF solutions are aligned with strategy, compliant with policy, and integrated into real workflows.
Partner with Data, Platform and Technology teamsto ensure that RLHF workloads are supported by scalable data platforms, model hosting, experimentation infrastructure, and MLOps best practices.
Champion responsible and compliant AI, working with Legal, Compliance, and Information Security to implement governance around human feedback, data usage, model behavior, transparency, and risk management in a regulated environment.

Other

Doctorate degree and 3 years of Computer Science, IT or related field experience
Masters degree and 5 years of Computer Science, IT or related field experience
Bachelors degree and 7 years of Computer Science, IT or related field experience
Associates degree and 12 years of Computer Science, IT or related field experience
High school diploma / GED and 14 years of Computer Science, IT or related field experience