Microsoft Research is looking for motivated Research Interns to tackle cutting-edge challenges at the intersection of distributed systems, AI systems, and software engineering to improve the reliability of large-scale cloud and AI systems.
Requirements
- Experience of building scalable and reliable systems.
- Ability to think unconventionally to derive creative and innovative solutions.
Responsibilities
- Dive into real-world systems : Work with large-scale codebases, configurations, and deployments powering Microsoft Azure and Office 365.
- Analyze production data : Discover how real cloud systems fail—and design strategies to prevent it.
- Push the boundaries : Apply cutting-edge LLM and Agentic technology to solve reliability challenges in cloud and AI systems.
- Innovate in failure diagnosis and prevention : Build novel tools for monitoring, logging, and troubleshooting at scale.
- Validate your ideas in the wild : Integrate and evaluate your solutions on real Microsoft services and incidents.
Other
- Currently enrolled in a PhD program in Computer Science or a related STEM field.
- Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
- In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples.
- Demonstrated ability to develop original research agenda.
- Ability to collaborate effectively with other researchers and product development teams.
- Proficient interpersonal skills, cross-group, and cross-culture collaboration.