Advancing model benchmarking, evaluation, and real-world performance analysis in AI research at a cutting-edge level
Requirements
- Strong grasp of ML experimentation, benchmarking, and evaluation methodologies
- PhD or 5+ years of experience in Machine Learning, AI Research, or related fields
Responsibilities
- Design and compile complex ML tasks inspired by real-world applications.
- Validate implementations, assess reproducibility, and identify performance gaps.
- Provide structured, high-quality feedback on model behavior and results.
Other
- PhD or 5+ years of experience in Machine Learning, AI Research, or related fields
- Excellent analytical, research, and communication skills
- Ability to work independently in a remote, flexible environment
- Fully remote, flexible schedule (20–40 hrs/week, extendable)