SentinelOne is looking to hire a Senior Infrastructure Engineer to design, build, and deploy their observability infrastructure as a service. This role is crucial for monitoring SentinelOne's solutions, improving engineering teams' operational visibility, and speeding up software delivery.
Requirements
- Proven expertise in designing resilient infrastructure and architecting robust cloud solutions.
- Substantial, practical experience in ensuring high site reliability for large-scale SaaS products.
- Demonstrated proficiency with core observability technologies such as Grafana, Prometheus, Thanos/Mimir/Cortex, OTEL, and comparable platforms.
- Hands-on experience with leading container orchestration systems like Kubernetes, including practical application of Helm, Kustomize, and similar tooling.
- Practical multi-cloud experience, with demonstrated expertise in at least one major platform (AWS, GCP).
- A solid understanding of modern CI/CD principles and advanced deployment automation tools (e.g., GitHub Actions).
- Hands-on experience with Infrastructure as Code (IaC) tools like Terraform and Ansible.
Responsibilities
- Drive exceptional operational efficiency for critical observability services you own (based on Grafana, Prometheus, Thanos, OTEL), meticulously balancing unwavering reliability with astute cost-effectiveness, including optimizing cloud resource utilization.
- Champion automation to significantly reduce operational toil and minimize pager burden, freeing up valuable engineering time.
- Ensure comprehensive operational visibility by rigorously implementing Infrastructure as Code (IaC), embedding robust observability, and championing industry best practices.
- Design and implement resilient, scalable systems and platforms that empower SentinelOne engineers to deliver features with unparalleled safety, speed, and reliability.
- Expertly administer and evolve core observability tools, including Grafana, Prometheus, Thanos/Mimir/Cortex, and OTEL collectors/pipelines.
- Operate and innovate across diverse, large-scale environments, spanning Kubernetes clusters (EKS, GKE) and core cloud platforms (AWS, GCP).
- Lead swift and effective resolution of complex technical issues, ensuring continuous system integrity and peak performance.
Other
- Under Federal & FedRAMP regulations hiring for this role is limited to US citizens only.
- FedRAMP Staff may be subject to customer or third party background checks up to and including secret clearance if required by their role at SentinelOne.
- A robust background of 7+ years in IT or a related engineering discipline, marked by consistent achievement and technical growth.
- A strong aptitude for and keen interest in mastering comprehensive observability solutions, coupled with hands-on experience in the field.
- Actively review the technical work of peers, providing insightful and constructive feedback that fosters growth and upholds SentinelOne's engineering standards.