Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

OutSystems Logo

Site Reliability Engineering Manager

OutSystems

Salary not specified
Sep 24, 2025
Boston, MA, USA • Washington, DC, USA • Philadelphia, PA, USA • New York, NY, USA
Apply Now

OutSystems is looking to solve the business and technical problem of ensuring the reliability, scalability, and availability of their AI-powered low-code development platform, which is crucial for enterprise customers who are increasingly adopting AI and struggling with legacy systems and application backlogs.

Requirements

  • Expertise in AWS, GCP, and/or Azure (with strong preference for AWS).
  • Deep knowledge of Kubernetes (K8s), and service mesh technologies (e.g., Istio, Gloo).
  • Experience with Terraform, Spacelift, CloudFormation.
  • Experience with Jenkins, GitHub Actions, GitLab CI, or ArgoCD.
  • Experience with Prometheus, Grafana, Datadog, ELK/EFK stack, OpenTelemetry.
  • Proficiency in Python, Go, Bash, or similar languages.
  • Solid grasp of DNS, load balancing, TLS, IAM, and security best practices.

Responsibilities

  • Building, scaling, and maintaining highly available, distributed systems.
  • Managing incidents, SLAs, SLOs, and service reliability metrics.
  • Fostering a culture of automation, reliability, and continuous improvement.
  • Designing resilient and fault-tolerant systems.
  • Debugging complex distributed systems.
  • Implementing and managing CI/CD pipelines.
  • Ensuring robust monitoring and observability of systems.

Other

  • EST location required!
  • Strong leadership with the ability to inspire, mentor, and grow high-performing technical teams.
  • Excellent problem-solving skills with a calm, analytical approach under pressure.
  • Effective communicator who can translate complex technical concepts into business language.
  • Skilled collaborator, able to work cross-functionally with engineering, product, and business teams.