Mercor is seeking experienced Python Engineers to support high-impact research collaborations with leading AI laboratories by developing and validating coding benchmarks that mirror real-world development scenarios.
Requirements
- Strong proficiency in Python programming language.
- Demonstrated experience with debugging, testing, and validating code to ensure reliability and accuracy.
Responsibilities
- Develop and validate coding benchmarks in Python by curating issues, solutions, and comprehensive test suites sourced from real-world repositories.
- Design and implement thorough unit and integration tests to verify the correctness of solutions and benchmark tasks.
- Maintain consistency and scalability across benchmark task distribution to support diverse research needs.
- Provide structured and constructive feedback on solution quality, clarity, and usability to facilitate continuous improvement.
- Debug, optimize, and document benchmark code to ensure robustness, reproducibility, and ease of use for research teams.
- Collaborate effectively with research teams and other stakeholders to align benchmarks with project objectives and standards.
Other
- 3-10 years of professional experience as a backend software engineer, machine learning engineer, or applied data scientist.
- Degree in Software Engineering, Computer Science, or a related technical field.
- Excellent technical writing skills with meticulous attention to detail.
- Ability to work independently in a remote, asynchronous environment with flexible hours.