Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Microsoft Logo

Senior Software Engineer - Availability Platform - Azure Compute

Microsoft

$119,800 - $234,700
Sep 24, 2025
Remote, US
Apply Now

The Azure Compute team is looking to solve the problem of ensuring every Azure VM achieves an SLA of 99.99+%, by building a fault-tolerant, distributed system on top of commodity datacenter hardware to deliver infrastructure for hosting cloud applications in virtual machines (VMs).

Requirements

  • Coding in languages including, but not limited to, C, C++, C, Java, JavaScript, or Python
  • Experience designing algorithms and data structures
  • Experience with AI and machine learning to build predictive failure models
  • Experience with generative AI to enhance diagnostics, automate root cause analysis, and accelerate incident resolution
  • Experience with services architecture at hyperscale
  • Experience with cloud infrastructure
  • Experience with cutting-edge AI to redefine cloud infrastructure

Responsibilities

  • Collaborates with appropriate stakeholders to determine user requirements for a scenario.
  • Drives identification of dependencies and the development of design documents for a platform and services.
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items.
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.
  • Owns services that monitor the health of millions of Azure machines and the control plane services that make all repair decisions in Azure.

Other

  • Bachelor's Degree in Computer Science or related technical field
  • 6+ years technical engineering experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Ability to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
  • Equivalent experience