Anduril Industries is looking to build and maintain resilient infrastructure supporting the deployment of mission-critical software across multi-cloud and on-premises environments.
Requirements
- Strong Kubernetes is required
- Extensive experience with SRE principles and practices in Kubernetes environments
- Extensive experience across one or more managed Kubernetes offerings (Rancher, Anthos, etc.)
- Experience with hardening and monitoring Kubernetes clusters
- Deep experience with architecting and securing cloud infrastructure in deployed/production environment
- Highly proficient in multiple modern cloud providers (preferably GCP or AWS)
- Experience with Infrastructure-as-Code (preferably Terraform)
Responsibilities
- Design, deploy, and operate mission-critical Kubernetes clusters that are highly available, resilient, and scalable, without relying on cloud-managed services
- Develop and maintain tooling infrastructure to build and support deployments to the cloud and on-prem
- Troubleshoot and remediate complex issues in Linux-based infrastructure
- Collaborate with software development teams to optimize containerized application deployments (CI/CD)
- Monitor for and remediate security weaknesses and baseline regressions across infrastructure
- Serve as a subject matter expert for teams across Anduril looking to leverage Kubernetes and cloud infrastructure
Other
- Ability to decompose abstract program goals into achievable technical objectives and strategies for measuring progress
- Experience leading and mentoring teams of engineers and working cross-functionally across partner teams
- Possess a desire to work on critical mission software that has a real-world impact
- Currently possesses and is able to maintain an active U.S. Top Secret SCI security clearance
- Willingness to travel up to 10%