Evinova, a global health tech business and subsidiary of AstraZeneca, is seeking a Senior DevOps Engineer to scale and operate its global SaaS platform, ensuring reliability, cost efficiency, and performance of data and AI workloads to enable faster and more secure delivery by scientists, data engineers, and developers.
Requirements
- Deep operational experience with AWS* (Fargate, EKS, EC2, S3, RDS, Lambda, IAM, CloudWatch, CloudTrail).
- Proficiency with CI/CD* tools (ArgoCD, GitHub Actions, Jenkins) and automation scripting (Python, Bash, TypeScript).
- Strong hands-on experience with Kubernetes and containerized workloads*.
- Working experience with AI/ML platforms* (AWS SageMaker, Kubeflow, MLflow, or equivalent).
- Familiarity with GPU workloads and performance/cost tuning for AI pipelines.
- Knowledge of MongoDB operations and performance optimization.
- Solid understanding of FinOps principles*, cost monitoring, and right-sizing in AWS.
Responsibilities
- Lead operations for multi-tenant SaaS workloads running on AWS (Fargate, EKS, S3, RDS, Lambda, etc.).
- Design and implement scalable, highly available, and cost-efficient infrastructure for production and ML workloads.
- Drive incident response, postmortems, and operational runbooks to improve uptime and reduce MTTR.
- Own and enhance CI/CD pipelines (ArgoCD, GitHub Actions, Jenkins) supporting both application and ML model deployment workflows.
- Build automation for environment provisioning, configuration, and lifecycle management using Infrastructure as Code (AWS CDK or Terraform).
- Support and automate ML pipelines for training, testing, and deployment using AWS SageMaker, Kubeflow, or MLflow.
- Develop and maintain dashboards, alerts, and telemetry (Splunk, Grafana, OpenTelemetry, AWS CloudWatch).
Other
- 3 days a week in office.
- Mentor junior engineers on operational best practices, IaC, CI/CD, and observability.
- Collaborate with Data Engineering, AI/ML, and Platform Ops teams to ensure smooth cross-team delivery.
- Participate in global change and incident management processes.
- Enjoys mentoring team members, and fostering a collaborative, innovation-driven team culture.