Advance the state of the art in agentic model capabilities - creating models and agents that can reliably perform tasks across digital systems on behalf of humans, combining automation, reasoning, and interaction capabilities to execute workflows end-to-end, leveraging both text-based environments (CLI tools, APIs, scripts, MCPs) and visual environments (GUI applications).
Requirements
- At least 1 year of experience with deep learning and large language model training.
- Proven expertise in language model pre-training, post-training, or reinforcement learning.
- Proven publication record in top-tier conferences.
- Demonstrated ability to develop original research and perform hands-on research in a collaborative and dynamic environment.
Responsibilities
- Reinforcement learning approaches for improving logical and mathematical reasoning, tool use and computer use agents
- Developing novel training algorithms for enhancing reasoning and action taking efficiency and reliability
- Exploring synthetic environment creation, multi-agent training and self-play for RL training
- Exploring scaling laws between test-time and training-time compute
- Advanced optimization techniques for efficient training of large-scale models
- Improving foundational models that includes a variety of data modalities (language, vision, multi-modal, and structured data) and modern model architectures.
- Collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community.
Other
- Accepted or currently enrolled in a PhD program in Computer Science or related STEM field.
- Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
- In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples.
- During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community.
- Microsoft is an equal opportunity employer.