Mercor is looking to support high-impact research collaborations with leading AI labs by improving AI systems through work extending coding benchmarks that reflect real-world development across diverse languages and domains
Requirements
- Strong proficiency in Go
- Experience with debugging, testing, and validating code
Responsibilities
- Develop and validate coding benchmarks in Go by curating issues, solutions, and test suites from real-world repositories
- Ensure benchmark tasks include comprehensive unit and integration tests for solution verification
- Maintain consistency and scalability of benchmark task distribution
- Provide structured feedback on solution quality and clarity
- Debug, optimize, and document benchmark code for reliability and reproducibility
Other
- Degree in Software Engineering, Computer Science, or a related field
- 3-10 years of experience as a backend software engineer, ML engineer, or applied data scientist
- Comfortable with technical writing and attention to detail
- Independent contractor
- Part-time (15-20 hours/week) commitment