Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Microsoft Logo

Senior Site Reliability Engineering Manager - CTJ - Top Secret

Microsoft

$119,800 - $234,700
Sep 4, 2025
Remote, US
Apply Now

Microsoft is looking to solve the problem of protecting millions of computers from thousands of active attack attempts every month by building and delivering cloud solutions to meet the scale required to support government environments.

Requirements

  • Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
  • Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration
  • Equivalent experience
  • 3+ years technical experience working with large-scale cloud or distributed systems
  • 1+ year(s) people management experience
  • Experience with AI/ML-powered automation
  • Experience with distributed systems and cloud infrastructure

Responsibilities

  • Lead Reliability Strategy: Drive the vision and execution of reliability, performance, and security across critical systems and services. Influence product design and engineering decisions to ensure resilient, scalable infrastructure.
  • Build and Scale Automation: Champion intelligent automation (AI/ML-powered) for monitoring, deployment, and incident response to reduce manual overhead and accelerate safe delivery.
  • Drive Operational Excellence: Use telemetry and service-level data to guide improvements in availability, efficiency, and cost. Lead post-incident reviews and service improvement plans that restore customer trust and drive long-term resilience.
  • Foster Engineering Partnerships: Collaborate deeply with product engineering and security teams from early development through production to align on reliability goals and prevent recurrence of issues.
  • Grow and Empower Teams: Attract, mentor, and develop high-performing SRE talent. Create a culture of inclusion, learning, and accountability that supports career growth and innovation.
  • Shape Technical Direction: Guide architecture and tooling decisions across distributed systems and cloud infrastructure. Promote adoption of best practices and scalable solutions across teams.
  • Operate production services and work closely with other engineering teams to ensure services and systems are highly stable, meet performance SLAs, and meet the expectations of internal and external customers and users.

Other

  • Must have an active U.S. Government Top-Secret Security Clearance
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • U.S. citizenship due to citizenship-based legal restrictions
  • Verification of U.S. citizenship via a valid passport, or other approved documents, or verified US government Clearance
  • Must be able to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter