Mercor connects elite creative and technical talent with leading AI research labs. This role is to collaborate with an AI research lab to benchmark and improve AI model capabilities.
Requirements
- Strong online research and analytical skills
- Ability to synthesize insights from diverse sources and data sets
Responsibilities
- Design prompts and evaluation sets for large language models (LLMs).
- Design and review consulting-style prompts, structured answers, and evaluation criteria.
- Benchmark AI-generated responses against consulting frameworks and real-world standards.
- Provide structured feedback on logic, clarity, and business rigor.
- Conduct online research and synthesize insights from diverse sources to support evaluation.
- Collaborate with AI research teams to refine model outputs and training data.
Other
- 2+ years of experience at McKinsey, Bain, BCG, or a similarly competitive consulting firm
- Excellent written communication and attention to detail
- Independent Contractor, Remote
- Flexible, 20+ hours/week, Duration: 4 weeks