Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

GenAI/LLM Systems Test Engineering Manager

Pearson

$165,000 - $170,000

Oct 29, 2025

Phoenix, AZ, US

Pearson is seeking a technical leader to solve the problem of ensuring accuracy, reliability, scalability, and trust in their GenAI-powered educational products, specifically the Personal Teaching Assistant (PTA), Personal Learning Assistant (PLA), and Open Educational Resource (OER) support.

Requirements

Strong technical expertise in AI/ML product testing, including GenAI workflows, model evaluation, and/or LLM-based solutions.
Hands-on experience with test automation frameworks, API/UI testing, and CI/CD pipelines.
Familiarity with LLM testing challenges (e.g., prompt variation, non-deterministic outputs, evaluation metrics, model drift).
Knowledge of Python or similar languages for building test tools or harnesses.
Experience in performance, scalability, and security testing of distributed systems.
Exposure to prompt engineering, bias testing, or AI ethics frameworks is a plus.
Experience with AI/ML systems, Large Language Models (LLMs), and modern test engineering practices.

Responsibilities

Define and drive AI Test Strategy: Create comprehensive test strategies specifically for GenAI products and LLM-based workflows, including model accuracy, bias, safety, and reliability validation.
Develop and maintain AI testing frameworks, including prompt testing and evaluation of model outputs
Hands-on Technical Leadership: Build and evolve test automation frameworks, pipelines, and evaluation harnesses for AI/ML models, APIs, and integrated systems.
AI-Specific Quality Validation: Design tests around prompt engineering, hallucination detection, model evaluation metrics, and edge-case scenarios.
System & Integration Testing: Validate end-to-end workflows, including multi-agent orchestration, MCP models, APIs, and UI interactions.
Work closely with Performance & Scalability Test team to ensure AI-driven systems perform consistently at scale across different use cases and data sets.
Coordinate testing activities across globally distributed QE teams

Other

6+ years of experience in Quality Engineering, with at least 3 years in a technical leadership or managerial role.
Bachelor’s degree in Computer Science, Engineering, or related field (advanced degree preferred).
Proven ability to work in cross-functional, Agile/Scrum environments
Excellent communication and leadership skills
Ability to work on-site