The company is looking to solve the problem of installing, configuring, maintaining, and troubleshooting a large-scale multi-tenant Kubernetes "on-prem" cluster which serves as the foundation for mission-critical applications across multiple tenants.
Requirements
- Experience with Linux CLI
- Experience writing scripts using Bash/Python
- Experience with containerization technologies such as Docker
- Experience administrating/monitoring Kubernetes clusters
- Experience with IaC (Infrastructure as Code) principles and automation infrastructure provisioning and configuration using tools such as Helm and Ansible
- Experience using system monitoring tools such as Prometheus/Grafana
- Experience with Git for source code management, branching strategies, and team collaboration
Responsibilities
- installation, configuration, maintenance, and troubleshooting of a large-scale multi-tenant Kubernetes "on-prem" cluster
- collaborate closely with DevOps, Security, and Application teams to implement automation, enforce best practices, and support integration of new services within the Kubernetes cluster
- ensure the reliability, performance, and security of the Kubernetes-based infrastructure
- implement automation
- enforce best practices
- support integration of new services within the Kubernetes cluster
- troubleshooting and resolving issues related to Kubernetes workloads, networking, ingress, storage, and performance
Other
- Bachelor's Degree in Computer Science or related field and have at least eight (8) years of demonstrable experience
- Active TS/SCI with an appropriate polygraph is required to be considered for this role
- At least eight (8) years of demonstrable experience
- Five (5) years full time Computer Science directly related work that can be substituted for a degree
- Master's Degree in Computer Science or related field may substitute for two (2) years' experience