Microsoft is looking to pioneer advancements in Artificial Intelligence (AI) and Systems, driving the transfer of innovative technologies into their products, establishing Microsoft’s leadership in technical domains and enhancing community engagement. This role specifically aims to improve the reliability and efficiency of cloud systems through research.
Requirements
- Experience with distributed systems, cloud infrastructure and software development.
- Published in one of the following venues: OSDI, SOSP, ASPLOS, NSDI, EuroSys, ATC, SoCC, MLSys, ICSE, FSE, WebConf, etc.
Responsibilities
- improving reliability and observability of Agentic Systems
- workload-aware placement of compute resources
- holistic characterization and modelling of workloads
- fault injection for improving workload reliability
- mining dependency graphs for web-scale systems
Other
- Currently enrolled or accepted in a PhD program in Computer Science, Software Engineering, Electrical Engineering, or a related STEM field.
- At least 1 year of experience in conducting research and authoring peer-reviewed publications.
- Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
- In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples.
- Proficient written and verbal communication skills.
- Able to work in a cross-functional and multi-disciplinary setting across research and product.