Microsoft Digital (MSD) is looking to reimagine and transform end-user productivity across Microsoft's global workforce by driving AI transformation and shaping an AI-first organization. The Principal Software Engineer will lead the design and development of the core network management and operations architecture for one of the largest enterprise networks in the world, infusing AI-driven acceleration into every layer of the technology stack and business processes to ensure 99.99% availability, reliability, and performance for mission-critical systems.
Requirements
- 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C, Java, JavaScript, or Python
- 5+ years experience architecting, building, and operating large-scale infrastructure and network management systems, with hands-on expertise in leveraging Infrastructure as Code (IaC) and Network as Code (NaC) principles to automate, orchestrate and optimize network operations at enterprise scale.
- Experienced designing resilient, secure, and highly available network architectures, and implementing automated deployment and configuration management pipelines using tools such as Terraform, Ansible, Azure Resource Manager, AWS CloudFormation, etc.
- Experienced building AI-driven agentic solutions from conception to scale, including work with the latest AI technologies such as large language models (LLMs), generative AI, multi-agent orchestration, MCP and platforms like Azure AI Foundry, GitHub Copilot, Claude, etc.
- Experienced with AI concepts, architecture, and acting as a change agent driving adoption of new technologies and practices across teams.
- Experienced driving operational excellence through AI agentic and code-driven approaches to monitoring, observability, and incident response.
- Experienced delivering solutions to support mission-critical workloads, ensuring compliance, scalability, and reliability in complex, multi-cloud or hybrid environments.
Responsibilities
- Lead the design and development of the core network management and operations architecture for one of the largest enterprise networks in the world.
- Architect and deliver a seamless framework that empowers frontier firm execution for networking solutions, supporting employees, engineers, program managers, and sourcing engineers at a truly global scale.
- Ensuring 99.99% availability, reliability, and performance for mission-critical systems.
- Leveraging existing and industry-leading systems and agentic AI, this role is at the forefront of reimagining how to infuse AI-driven acceleration into every layer of the technology stack and business processes, driving transformative change across the organization.
- Partners with appropriate stakeholders to determine user requirements for a set of scenarios.
- Leads identification of dependencies and the development of design documents for a product, application, service, or platform.
- Defines system-level architecture and works across stakeholders to drive alignment and adoption.
Other
- This position is located at the Redmond campus, with 3 days per week work in the office and 2 days per week work from home.
- Relocation assistance is available.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Ability to innovate at a platform level and drive integrated, end-to-end solutions through the shipping process.
- Skilled at cross divisional relationship building and ability to collaborate with partners; able to pull together technology and expertise across M365, Windows, and Azure to drive clarity across teams, agree on dependencies and common goals, and establish long-lasting, trusted relationships with partner teams.