Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Abnormal Security Logo

Senior Software Engineer - Site Reliability

Abnormal Security

$176,000 - $230,000
Aug 15, 2025
Remote, US
Apply Now

Abnormal Security is looking for a Senior Software Engineer - Site Reliability to join their Infrastructure team to be responsible for the reliability, scalability, and operational excellence of their systems and services. They will lead initiatives to improve the operational maturity of both SRE-managed services and critical product systems, driving change across the organization in support of stable operations.

Requirements

  • 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering roles
  • Deep knowledge of production-grade distributed systems and cloud-native architectures
  • Demonstrated experience managing service availability, latency, and incident response in production environments
  • Strong programming skills in Python, Go, or similar languages
  • Experience with Kubernetes, Terraform, and observability tools (e.g., Prometheus, Grafana, Datadog)
  • Proven ability to lead complex, multi-team initiatives and influence system design for reliability
  • Familiarity with AWS and multi-cloud environments (e.g., Azure, GCP)

Responsibilities

  • Own the operational maturity of services in the SRE software stack, driving architectural and tooling improvements
  • Proactively partner with product teams to embed SRE best practices and support services with operational challenges
  • Independently define and drive quarterly goals for the SRE team with measurable impact on system reliability and developer productivity
  • Design and maintain systems that promote observability, automated recovery, scalability, and resilience
  • Lead incident reviews and root cause analyses; ensure follow-up actions are implemented and shared across teams
  • Collaborate with engineering leadership to shape the team roadmap and contribute to company-wide reliability goals
  • Mentor other engineers and drive adoption of SRE principles throughout the engineering organization

Other

  • Has demonstrated experience leading broad technical initiatives across teams and systems
  • Is a strong communicator and mentor, able to influence both within the SRE team and across engineering
  • Possesses a product-focused mindset with the ability to translate business needs into reliability goals
  • Prior experience embedding with product engineering teams to support operational goals
  • Experience in regulated environments or with FedRAMP-compliant systems