Optimize cloud infrastructure strategy, platform reliability, and security operations on Google Cloud Platform (GCP), while driving DevOps best practices and compliance standards across the organization.
Requirements
- Deep expertise in GCP services and operations.
- Proven experience managing production Kubernetes (GKE) environments.
- Strong knowledge of CI/CD pipelines, IaC, and security compliance frameworks.
- Experience with enterprise-level SRE practices.
- Familiarity with AI infrastructure (GPU clusters, model deployment, data pipelines).
- Certifications such as AWS Certified Solutions Architect – Professional, Google Professional Cloud Architect, or Azure Solutions Architect Expert.
Responsibilities
- Oversee GCP cloud operations, including Cloud Logging, Monitoring, Security Command Center, Secret Manager, IAM, and Cloud Armor.
- Manage containerized deployments using Google Kubernetes Engine (GKE) and containerd.
- Lead CI/CD strategy and infrastructure automation with Terraform and ArgoCD.
- Establish and enforce security standards around authentication, authorization, and compliance.
- Drive system reliability engineering (SRE) practices for uptime, performance, and disaster recovery.
- Collaborate with engineering teams to ensure seamless infrastructure support for applications.
- Monitor cloud spend and optimize for cost efficiency.
Other
- 7–10 years of experience in cloud infrastructure and operations, with at least 3–5 years in leadership.
- Excellent problem-solving, leadership, and cross-functional collaboration skills.
- Google Cloud Professional Architect certification or equivalent.
- Healthcare or health-tech industry experience.
- Strong business acumen with the ability to balance innovation, reliability, and cost optimization.