Microsoft 365 Copilot is a groundbreaking productivity tool that leverages the power of large language models, user data, Microsoft Graph, and the web to drive unparalleled creativity and productivity. Our team in Microsoft Search, Assistant, and Intelligence (MSAI) designs and operates the central infrastructure enabling Copilot experiences across Teams, Outlook, Word, PowerPoint, and more. You’ll work on systems that scale to millions of users and deliver AI-driven capabilities that redefine how people work every day.
Requirements
- 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
- 2+ years of experience building distributed, near real-time, high-load systems.
- 2+ years of experience translating requirements into feature implementations.
- Familiarity with modern software design patterns (microservices, containers, caching, queuing).
- Experience with one of CUDA kernels, CPU/GPU performance optimization, network latency, and managing large-scale capacity fleets.
Responsibilities
- design, implement, and optimize core services that make Copilot fast, reliable, and intelligent.
- work on complex problems in GPU capacity management, LLM inference, and AI efficiency at scale.
- build distributed systems, improve inference performance, and ensure resiliency for millions of users.
- Work with stakeholders to determine user requirements for a set of features.
- Contribute to design documents and identify dependencies for product areas with minimal oversight.
- Implement and maintain code for services and features, reusing components where applicable.
- Stay current with emerging technologies and patterns to improve reliability, efficiency, and performance at scale.
Other
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check
- Embody our Culture and Values
- Break down larger work items into smaller tasks and provide accurate estimates.
- Act as a Designated Responsible Individual (DRI) during on-call rotations to monitor and restore services for simple issues.