The technology landscape of Markel, a Fortune 500 insurance company, needs to remain robust, secure, efficient, and compliant within a highly regulated industry.
Requirements
- Extensive experience (10+ years) in technology operations, security, SRE, and quality assurance, ideally within a large, regulated enterprise environment, preferably in the insurance industry.
- Strong expertise in cloud platforms (Azure, AWS) and cloud operations best practices.
- Deep understanding of DevOps tools and automation frameworks (e.g., Jenkins, Terraform, Ansible, Kubernetes, Docker).
- Hands-on experience with Site Reliability Engineering principles, monitoring tools (e.g., Prometheus, Grafana, Datadog), and incident management.
- Experience leading Disaster Recovery planning and execution.
- Strong knowledge of Governance, Risk, and Compliance frameworks (e.g., SOC 2, ISO 27001, HIPAA).
- Solid understanding of QA methodologies and tools.
Responsibilities
- Lead and manage Cloud Operations, DevOps, and GRC activities for all applications and IT resources in US and Bermuda.
- Own and oversee Site Reliability Engineering (SRE) initiatives to ensure high availability, performance, and resilience of applications and infrastructure.
- Manage and support Disaster Recovery (DR) planning, execution, and testing to safeguard business continuity.
- Lead the Quality Assurance (QA) team to ensure high-quality standards for application development, delivery, and operational processes.
- Continuously monitor application and infrastructure health; implement proactive alerting, tracking, and remediation processes.
- Drive automation initiatives across Cloud and DevOps to improve efficiency, reliability, and scalability.
- Develop and implement operational efficiency practices including FinOps to optimize cloud spending and resource utilization.
Other
- US Work Authorization required.
- 10+ years of experience in technology operations, security, SRE, and quality assurance.
- Proven leadership experience managing multi-disciplinary teams (Cloud Operations, DevOps, SRE, QA, GRC).
- Excellent strategic thinking, planning, and communication skills.
- Strong collaboration skills with the ability to work across multiple teams and geographies.