Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Microsoft Logo

Principal Software Engineer - AI Driven Configuration & Experimentation Platform

Microsoft

$139,900 - $274,800
Oct 27, 2025
Redmond, WA, US
Apply Now

Microsoft's ECS platform needs to evolve beyond core experimentation to include next-generation platforms for change inventory intelligence and AI-powered RCA agents, aiming to redefine how engineers troubleshoot, learn from incidents, and continuously improve service reliability across M365, Copilot, and Azure.

Requirements

  • 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
  • 5+ years experience in designing and building large-scale distributed system, developer platforms, or ML powered backend services.
  • 3+ years deep technical focus in one or more of the following areas: Context modeling and embedding systems (e.g., code understanding, semantic retrieval, telemetry correlation).
  • 3+ years deep technical focus in one or more of the following areas: Intelligent developer or operational assistants (e.g., Copilot, Amazon Q, Claude or similar AI integrated workflows).
  • 3+ years deep technical focus in one or more of the following areas: Change management, deployment safety, and reliability engineering.
  • 3+ years deep technical focus in one or more of the following areas: Deep understanding of cloud infrastructure (Azure, AWS, or equivalent), service orchestration, and CI/CD pipelines at global scale.

Responsibilities

  • Lead the design and evolution of large-scale distributed systems that empower thousands of developers across Microsoft.
  • Collaborate with partner teams, influence long-term strategy, and shape the architecture for high-reliability experimentation, change management, and AI-driven operational quality.
  • Drive company-wide impact by defining technical strategy and standards for experimentation, change inventory, and incident analysis.
  • Partner with leaders across engineering and product to solve systemic challenges in safe rollouts, telemetry, and the automation of root cause analysis (RCA).
  • Mentor engineers and raise the bar for engineering quality, design rigor, and AI-augmented developer experience.
  • Lead technical strategy and architecture for ECS, shaping the future of experimentation, configuration, and change intelligence platforms used across M365, Copilot, and Azure.
  • Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.

Other

  • 3 days / week in-office
  • Individual Contributor
  • Full-Time
  • Partners with appropriate stakeholders to determine user requirements for a set of scenarios.
  • Leverages subject-matter knowledge of cross-product features with appropriate stakeholders (e.g., project managers) to drive multiple group's project plans, release plans, and work items.