Dentsu is looking to oversee the reliability, scalability, and operational excellence of their global platforms, specifically supporting the Dentsu.Connect product ecosystem and expanding its impact across the broader enterprise.
Requirements
- Running large-scale, complex platforms
- Cloud infrastructure (preferably Azure)
- Observability, monitoring, and cost optimization
- SRE principles (SLOs, SLIs, incident management)
- CI/CD tooling and DevOps practices
- Cloud automation and infrastructure-as-code
- Familiarity with GitHub, GitHub Actions, and Snowflake DevOps practices
Responsibilities
- Provide strategic direction across SRE, DevOps, and infrastructure domains
- Guide architectural decisions and participate in global architecture forums
- Oversee the evolution of internal platforms and cloud environments (Azure)
- Define and maintain reliability and performance standards (SLOs, SLIs)
- Ensure monitoring, alerting, and operational tooling
- Manage platform cost optimization and resource efficiency
- Functional oversight includes: Site Reliability Engineering, DevOps Tooling and Developer Experience, Core Cloud Infrastructure (Azure), Platform Operations, Observability, and Cost Optimization, Global Technical Support
Other
- Lead a global team of ~20 engineers across SRE, DevOps, and infrastructure
- Manage two team leads overseeing SRE/Support and DevOps/Infrastructure
- Foster a culture of collaboration, accountability, and continuous learning
- Be the primary contact for platform reliability and tooling across global leadership
- Collaborate with Product, Engineering, Security, Data, Compliance, and external partners