Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Harvey Logo

Senior Software Engineer, Site Reliability Engineer (SRE)

Harvey

$200,000 - $260,000
Dec 1, 2025
San Francisco, CA, USA
Apply Now

Harvey is transforming how legal and professional services operate by combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise. The Software Engineer on the Site Reliability team will ensure the reliability, scalability, and performance of this legal AI platform, owning the systems that keep it fast, secure, and always on as the company grows.

Requirements

  • Expertise in infrastructure as code(IaC) tools (Pulumi, Terraform, CloudFormation, etc.).
  • Deep familiarity with observability tools (Datadog, Sentry, etc.) and incident response practices (PagerDuty, IncidentIO, etc.)
  • Proficiency with cloud infrastructure platforms (Azure, GCP, AWS, etc.)
  • Strong programming skills (Python, Bash, Go, or similar languages)
  • Proven track record of diagnosing complex system problems and implementing durable solutions
  • Solid understanding of CI/CD, Kubernetes, containerization, networking, databases, and cloud security principles
  • Excellent problem-solving skills, meticulous attention to detail, and a commitment to operational excellence

Responsibilities

  • Design, implement, and manage monitoring, alerting, and infrastructure resources (compute, storage, networking) across 50+ global regions
  • Lead incident management processes, including postmortems, root cause analyses, and driving actionable improvements
  • Automate operational tasks and workflows, building tools and processes for capacity planning, graceful rollouts, and safe data access to maintain high reliability and reduce manual intervention
  • Collaborate across teams to drive reliability, security, and compliance throughout the software lifecycle
  • Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions while maintaining system performance, reliability, and functionality.

Other

  • 5+ years of experience in Site Reliability Engineering or similar roles supporting production environments
  • This role is based in San Francisco, CA. We use an in-person work model and offer relocation assistance to new employees.
  • We move fast, operate with intensity, and take real ownership of the problems we tackle — from early thinking to long-term outcomes.
  • We stay close to our customers — from leadership to engineers — and work together to solve real problems with urgency and care.
  • If you thrive in ambiguity, push for excellence, and want to help shape the future of work alongside others who raise the bar, we invite you to build with us.