Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Microsoft Logo

Site Reliability Engineer II - Ctj - Poly

Microsoft

$100,600 - $199,000
Sep 10, 2025
Redmond, WA, USA
Apply Now

Microsoft is looking to solve the problem of delivering high-quality, reliable, and scalable Office 365 government cloud services to its critical customers, with a focus on meeting the highest expectations for feature quality, security, reliability, availability, and performance.

Requirements

  • Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
  • Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
  • 2+ years technical experience working with large-scale cloud or distributed systems.
  • Demonstrates expertise in distributed systems design, interactions between cloud technology layers and components, common dependencies at scale, and the code that defines infrastructures.
  • Develops an understanding of the code, features, and operations of specific products at scale as required to contribute to incremental improvements in product availability, reliability, efficiency, observability, and/or performance.
  • Researches and maintains an awareness in industry trends, advances in distributed systems and cloud technologies, new tools, and/or processes for maintaining and improving product availability, reliability, efficiency, observability, and/or performance.
  • Leverages technical expertise in large scale distributed systems and specific products, as well as objective insights drawn from analyses of production telemetry data to suggest changes or add-ons to product features or code to improve the availability, reliability, efficiency, observability, and performance of product components or features supported by their team.

Responsibilities

  • Design, develop, and deliver the required software engineering to serve and protect O365 government clouds.
  • Own deployment, availability, reliability, performance and customer escalation targets for sovereign environments.
  • Proactively identify and reduce issues through design, testing, and implementation of software-based solutions.
  • Collaborate with Engineering and Program Management partners to translate customer, business, and technical requirements into architectural designs and feature releases.
  • Drive efficiencies through software improvement and root cause analysis resulting in service delivery, maturity, and scalability.
  • Develop and test basic changes to optimize code and improve the observability, reliability and operability of a defined range of platform, system, or product components or features with direction from other engineers.
  • Independently develops code or scripts that automate the performance of repetitive and easily scalable operations processes (e.g., monitoring, alerting, deploying products and updates) across components and features of products operating at scale.

Other

  • Passionate about distributed systems and working with highly scalable services
  • Enjoys new technological challenges and is motivated to solve them
  • Excited about making better software and continuously improving the development, integration, and deployment processes
  • Smart, highly motivated, self-starter who thrives in a bottoms-up, fast-paced, highly technical environment
  • Effective collaborator, experienced in creating technical partnerships across teams