Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

General Motors Logo

Engineering Manager, Observability

General Motors

$200,000 - $285,000
Oct 24, 2025
Mountain View, CA, US
Apply Now

The AI Cloud and Developer Infrastructure organization at GM is responsible for delivering and maintaining tools and services that engineers use daily. The goal is to enhance the entire development process, ensuring AV engineers and others have world-class tools and a seamless development experience so they can focus on critical problems.

Requirements

  • Deep understanding of core observability pillars: logs, metrics, and traces. Experience with technologies like Prometheus, Grafana, OpenTelemetry , and log management systems is crucial
  • Strong background in designing, developing, and architecting distributed systems, cloud-native applications, and microservices
  • Familiarity with Go, Python, Typescript or similar along with software development practices to inform code reviews and architectural decisions
  • Experience with modern cloud offerings like GCP, AWS, or Azure and technologies like CI/CD pipelines, Kubernetes, and Docker
  • Experience with GCP, AWS, or Azure
  • Familiarity with Kubernetes, Docker, Istio, Terraform, Prometheus, Grafana, TSDBs and observability pipelines ( e.g. either for logging or metrics or tracing)
  • Skilled in defining and instrumenting SLIs and SLOs

Responsibilities

  • Define and execute the technical vision and roadmap for the observability platform, ensuring it provides actionable insights into complex systems.
  • Provide technical guidance on instrumentation, logging, metrics, and tracing to ensure comprehensive visibility across GM’s AV software stack.
  • Ensure the team's tools enable rapid detection, debugging, and resolution of unknown or unforeseen system failures to minimize downtime.
  • Work with other engineering teams—such as those developing AI/ML, firmware, and infrastructure—to implement observability practices and improve system reliability.
  • Lead the development of internal tools and data pipelines to collect, analyze, and visualize telemetry data at a massive scale.
  • Manage relationships and costs associated with third-party observability software and platforms.
  • Identify high ROI investments with minimal guidance.

Other

  • Manage and grow a team of engineers, conducting performance reviews, providing coaching, and supporting career development.
  • 5+ years of experience leading software or site reliability engineering (SRE) teams and balancing the tradeoff between velocity and reliability
  • Excellent interpersonal and communication skills to collaborate effectively with diverse teams and stakeholders
  • 3+ years of experience managing software engineering or site reliability engineering (SRE) teams.
  • This role is categorized as remote or hybrid. This means the successful candidate, if within 50 mile radius of GM location, is expected to report to the office three times per week, at minimum.