Large-scale language models are evolving into powerful engines of scientific discovery, but require high-quality training data to democratize education, keep pace with research, and streamline software development. This role aims to leverage Python expertise to create this training data and power the next generation of AI.
Requirements
- Python coding expertise
- Fluency in Python
- Expertise in algorithms
- Expertise in data structures
- Expertise in software architecture
- Expertise in frontend development
- Expertise in backend development
- Expertise in cloud infrastructure
- Expertise in systems programming
- Experience with asynchronous programming
- Experience with RESTful API integration
- Experience with memory management
- Experience with object-oriented design
- Experience with secure coding practices
- Experience with debugging distributed systems
Responsibilities
- Challenge advanced language models on topics like asynchronous programming, RESTful API integration, memory management, object-oriented design, secure coding practices, and debugging distributed systems
- Document every failure mode so we can harden model reasoning
- Converse with the model on software engineering tasks and technical scenarios using Python
- Verify logical accuracy and coding fluency
- Assess code quality and clarity
- Capture reproducible error traces
- Suggest improvements to prompt engineering and evaluation metrics
Other
- A bachelor's, master's, or PhD in computer science, software engineering, or a closely related technical field is ideal
- Real-world Python experience, technical writing, or open-source contributions signal fit
- Clear, metacognitive communication—"showing your work"—is essential
- As a contractor you'll supply a secure computer and high-speed internet
- company-sponsored benefits such as health insurance and PTO do not apply