Microsoft Teams is seeking a Principal Software Engineer - Platform Architect to design and build its next-generation programmability and API interfaces, focusing on AI & LLM technologies, to enable Independent Software Vendors (ISVs) and enterprise developers to introduce intelligent AI agents within Teams and shape the infrastructure for secure LLM-driven experiences at hyperscale.
Requirements
- 10+ years of experience focused on advanced software development and architecture, building commercial-grade, high-availability services.
- 10+ years of experience driving software platform architecture projects from both a technical and functional perspective, guiding complex projects from planning through execution and service operations.
- Experience in LLM infrastructure and paradigms: Agent-to-agent communication protocols and orchestration, Model Context Protocol, Integration of third-party tools, APIs, and data sources into LLM pipelines.
- Secure execution of LLMs in cloud environments, multi-tenant isolation, throttle control.
- coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
- coding in languages including, but not limited to, C-Sharp, C++, or Rust
Responsibilities
- Define long-term technical strategy: Set vision and direction for Teams API and AI platform initiatives, including agent-to-agent protocols, composable LLM pipelines, and developer extensibility.
- Architect complex, scalable systems: Lead design for end-to-end scenarios involving distributed microservices, multi-tenant routing, LLM orchestration, tool integration, and secure API endpoints.
- Build hyperscale LLM-enabled infrastructure: Craft systems to support developer-deployed AI agents, multi-step workflows, data connectors, and runtime MCP patterns across global Teams environments.
- Design APIs & SDKs for AI applications: Define developer-facing components that allow ISVs to deploy, configure, and manage LLM-powered agents and tools within Teams.
- Drive engineering culture: Champion scalability, reliability, observability, security, operational excellence, and customer obsession.
- Hands-on leadership: Develop, debug, optimize, and maintain platform features end-to-end—from coding to deployment and operational excellence.
- Operational responsibility: Troubleshoot complex infrastructure issues at scale, proactively reduce incidents, and maintain oncall readiness.
Other
- 3 days / week in-office
- Travel 25-50%
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Collaborate broadly: Partner with leadership, cross-functional teams (Research, Security, UX), and ecosystem stakeholders to deliver unified, strategic outcomes.