Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

ExecutivePlacements.com Logo

Software Engineer - Developer Experience

ExecutivePlacements.com

Salary not specified
Nov 21, 2025
San Francisco, CA, United States of America
Apply Now

The company is building a gamified developer platform to generate high-fidelity datasets for advancing LLM technology. The role is responsible for the technical lifecycle of data pipelines, from defining new data formats to shipping the necessary tooling, environments, documentation, and quality assurance processes to enable these formats at scale.

Requirements

  • Foundational full-stack skills, with experience in React and at least one modern backend language (e.g., Python, Node.js, Go).
  • Experience designing or running evaluations for LLM outputs to measure and track quality, accuracy, or other performance metrics.
  • Familiarity with building tools for other developers, such as CLIs, SDKs, or internal dashboards.
  • Experience with cloud infrastructure (AWS), Docker, and CI/CD pipelines.
  • Ship tooling, sandboxes, CLIs/SDKs, and capture/instrumentation to make contribution flows fast and safe.
  • Implement automated checks, eval harnesses, reviewer workflows, and data quality bars
  • Design schemas, metadata, and versioning for new task/trajectory formats.

Responsibilities

  • Own projects end-to-end, from initial prototyping to ongoing maintenance, bug fixing, and iteration based on feedback.
  • Own developer experience pipelines end-to-end: Prototype tooling for collecting new data formats ? productionize workflow ? iterate from developer experience
  • Champion DX: Create clear, concise guidelines and documentation to empower our data contributors and ensure high-quality inputs for your projects.
  • Quality & governance: Develop and manage the quality standards for your projects, which includes training and aligning content reviewers to ensure data consistency and accuracy.
  • Implement automated checks, eval harnesses, reviewer workflows, and data quality bars; be hands on and in the weeds to align with reviewers on standards.
  • Maintain & iterate: Monitor, debug, and continuously improve reliability, latency, and contributor success rates.
  • Define Frontier data formats: Co-author specs/RFCs with frontier lab researchers; design schemas, metadata, and versioning for new task/trajectory formats.

Other

  • Excellent written communication skills, with a proven ability to explain complex concepts to a less technical audience.
  • An organized and process-oriented mindset you enjoy bringing structure to ambiguous problems and are meticulous about quality.
  • Strong technical judgment and a pragmatic mindset you know how to balance speed with quality, recognizing the need for a scrappy solution versus when to invest in a robust architecture.
  • A deep resourcefulness with AI you are highly adept at prompt engineering and using AI tools to find the fastest path to a solution.
  • Curiosity, pride in your work, desire to push the frontiers