General Motors Insurance is building an Insurtech business to reinvent auto insurance by leveraging data from GM's connected vehicle fleet to revolutionize the auto insurance experience.
Requirements
- Strong application development knowledge in C-Sharp and JavaScript/TypeScript, with solid foundations in software engineering principles, debugging, and performance optimization.
- Deep expertise in Microsoft Azure PaaS (App Services, Functions, Databases, Networking) with working knowledge of AWS equivalents.
- Familiarity with container orchestration platforms (Azure Kubernetes Service, Kubernetes, Docker).
- Strong understanding of disaster recovery, incident management, and security best practices for business-critical applications.
- Knowledge of cloud networking (Azure DNS, Virtual Networks, Application Gateway; AWS Route 53, VPC) and CDN/WAF technologies (Akamai, Azure Front Door, AWS CloudFront).
- Infrastructure as Code (IaC) principles and tools: Terraform, ARM, Bicep.
- CI/CD practices and tooling: Azure DevOps YAML, GitHub Actions, GitOps.
Responsibilities
- Lead and mentor a team of Site Reliability Engineers, fostering a culture of reliability, learning, and continuous improvement.
- Stay hands-on when needed for coding, automating, debugging, and contributing to reliability tools and pipelines.
- Define and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets, ensuring alignment with business goals.
- Own and maintain the PagerDuty implementation, including supporting teams with on-call scheduling, escalation flows, and incident response processes.
- Empower and guide engineers to serve as Incident Commanders during critical events, fostering a ‘you build it, you run it’ culture; support them through post-incident reviews and Root Cause Analyses (RCAs) to drive preventive measures and shared learning.
- Collaborate with product and development teams early in the lifecycle to design for operability, scalability, and maintainability.
- Implement observability solutions using Azure Monitor, Application Insights, Log Analytics, and integrate CDN/WAF telemetry (Akamai, Azure Front Door; AWS CloudFront as applicable).
Other
- Proven track record of leading and growing engineering teams, including performance management, mentoring, and career development.
- Strong collaboration skills—able to influence without authority across product, engineering, and operations teams.
- Incident leadership and crisis communication under pressure.
- Ability to translate technical reliability metrics into clear business impact statements for executives.
- Proven capability to provide operational visibility on environmental health to Senior Leadership, Technology and Business partners