Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Site Reliability Engineering Manager - General Motors Insurance

GM Financial

$115,000 - $213,000

Aug 25, 2025

Arlington, TX, US

General Motors Insurance is building an Insurtech business to reinvent auto insurance by leveraging data from GM's connected vehicle fleet to revolutionize the auto insurance experience.

Requirements

Strong application development knowledge in C-Sharp and JavaScript/TypeScript, with solid foundations in software engineering principles, debugging, and performance optimization.
Deep expertise in Microsoft Azure PaaS (App Services, Functions, Databases, Networking) with working knowledge of AWS equivalents.
Familiarity with container orchestration platforms (Azure Kubernetes Service, Kubernetes, Docker).
Strong understanding of disaster recovery, incident management, and security best practices for business-critical applications.
Knowledge of cloud networking (Azure DNS, Virtual Networks, Application Gateway; AWS Route 53, VPC) and CDN/WAF technologies (Akamai, Azure Front Door, AWS CloudFront).
Infrastructure as Code (IaC) principles and tools: Terraform, ARM, Bicep.
CI/CD practices and tooling: Azure DevOps YAML, GitHub Actions, GitOps.

Responsibilities

Lead and mentor a team of Site Reliability Engineers, fostering a culture of reliability, learning, and continuous improvement.
Stay hands-on when needed for coding, automating, debugging, and contributing to reliability tools and pipelines.
Define and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets, ensuring alignment with business goals.
Own and maintain the PagerDuty implementation, including supporting teams with on-call scheduling, escalation flows, and incident response processes.
Empower and guide engineers to serve as Incident Commanders during critical events, fostering a ‘you build it, you run it’ culture; support them through post-incident reviews and Root Cause Analyses (RCAs) to drive preventive measures and shared learning.
Collaborate with product and development teams early in the lifecycle to design for operability, scalability, and maintainability.
Implement observability solutions using Azure Monitor, Application Insights, Log Analytics, and integrate CDN/WAF telemetry (Akamai, Azure Front Door; AWS CloudFront as applicable).

Other

Proven track record of leading and growing engineering teams, including performance management, mentoring, and career development.
Strong collaboration skills—able to influence without authority across product, engineering, and operations teams.
Incident leadership and crisis communication under pressure.
Ability to translate technical reliability metrics into clear business impact statements for executives.
Proven capability to provide operational visibility on environmental health to Senior Leadership, Technology and Business partners