Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Pearson Logo

GenAI/LLM Systems Test Engineering Manager

Pearson

$165,000 - $170,000
Oct 29, 2025
Phoenix, AZ, US
Apply Now

Pearson is seeking a technical leader to solve the problem of ensuring accuracy, reliability, scalability, and trust in their GenAI-powered educational products, specifically the Personal Teaching Assistant (PTA), Personal Learning Assistant (PLA), and Open Educational Resource (OER) support.

Requirements

  • Strong technical expertise in AI/ML product testing, including GenAI workflows, model evaluation, and/or LLM-based solutions.
  • Hands-on experience with test automation frameworks, API/UI testing, and CI/CD pipelines.
  • Familiarity with LLM testing challenges (e.g., prompt variation, non-deterministic outputs, evaluation metrics, model drift).
  • Knowledge of Python or similar languages for building test tools or harnesses.
  • Experience in performance, scalability, and security testing of distributed systems.
  • Exposure to prompt engineering, bias testing, or AI ethics frameworks is a plus.
  • Experience with AI/ML systems, Large Language Models (LLMs), and modern test engineering practices.

Responsibilities

  • Define and drive AI Test Strategy: Create comprehensive test strategies specifically for GenAI products and LLM-based workflows, including model accuracy, bias, safety, and reliability validation.
  • Develop and maintain AI testing frameworks, including prompt testing and evaluation of model outputs
  • Hands-on Technical Leadership: Build and evolve test automation frameworks, pipelines, and evaluation harnesses for AI/ML models, APIs, and integrated systems.
  • AI-Specific Quality Validation: Design tests around prompt engineering, hallucination detection, model evaluation metrics, and edge-case scenarios.
  • System & Integration Testing: Validate end-to-end workflows, including multi-agent orchestration, MCP models, APIs, and UI interactions.
  • Work closely with Performance & Scalability Test team to ensure AI-driven systems perform consistently at scale across different use cases and data sets.
  • Coordinate testing activities across globally distributed QE teams

Other

  • 6+ years of experience in Quality Engineering, with at least 3 years in a technical leadership or managerial role.
  • Bachelor’s degree in Computer Science, Engineering, or related field (advanced degree preferred).
  • Proven ability to work in cross-functional, Agile/Scrum environments
  • Excellent communication and leadership skills
  • Ability to work on-site