Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Jump - Advisor AI Logo

QA Engineer for Generative AI

Jump - Advisor AI

$75,000 - $90,000
Dec 3, 2025
Salt Lake City, UT, US
Apply Now

Jump is looking for a QA engineer to improve their AI/ML systems by coordinating and running data labeling/annotation campaigns, evaluating production system outputs, and ensuring customers receive accurate transcripts, summaries, and action items.

Requirements

  • Curiosity and aptitude to learn ML/AI evaluation (prompt testing, golden sets, offline evals, safety/guardrails)
  • Familiarity with AI prompts, LLMs, and the Jump product (as a user or employee)
  • Comfortable reading software system logs and finding patterns in messy data
  • Experience with BigQuery or other data warehouses
  • Experience with web API testing
  • Basic familiarity with query languages, relational databases, and other data storage systems
  • Designed and executed AI evaluation workflows (golden datasets, human-in-the-loop scoring, clear rubrics)

Responsibilities

  • Serve as the embedded QA engineer on two pods (Jump’s cross-functional teams), collaborating with product managers to evaluate AI outputs, run exploratory and regression testing, and unblock engineers and PMs.
  • Learn and track AI/ML quality signals, including golden datasets, prompt/regression suites, and metrics such as WER, diarization accuracy, action-item precision/recall, summary faithfulness, hallucination rate, and PII handling.
  • Build dashboards for quality KPIs (defect escape rate, flake rate, regression coverage, MTTD/MTTR, AI eval scores) and drive continuous improvement.
  • Partner with Product and Engineering to ensure requirements are testable, edge cases are captured, and AI evaluation rubrics are clear and repeatable.
  • Foster a no-drama, direct-and-kind culture that moves with high-quality velocity.
  • Designed and executed AI evaluation workflows (golden datasets, human-in-the-loop scoring, clear rubrics)—a plus but not required; candidates with this experience may be considered for higher compensation
  • Created risk-based test plans and lightweight automation that caught regressions early

Other

  • 3+ years in QA or Quality Engineering for SaaS products
  • Strong exploratory testing skills and clear, concise written communication for reproducing issues
  • You don’t need a traditional STEM background to excel here. You’ll thrive if you Get excited about spotting patterns
  • Have a strong grasp of human language and thought processes
  • You might have a background in Editing, Technical writing