Jellyvision is seeking a Senior Director of Site Reliability Engineering & Software Engineering to lead mission-critical infrastructure and engineering excellence initiatives, elevate the SRE organization, establish industry-leading practices, and drive strategic technology decisions to simplify platforms and improve performance.
Requirements
- Deep expertise in cloud platforms (AWS, GCP, Azure), containerization (Kubernetes, Docker), and Infrastructure as Code
- Strong background in distributed systems, microservices architecture, and database technologies
- Experience with monitoring and observability tools (Dynatrace, DataDog, New Relic, etc.)
- Experience with AI automation and Workflow Optimization tools
- Knowledge of compliance frameworks and security best practices in cloud environments
- Certifications in cloud platforms or SRE methodologies
Responsibilities
- Lead and elevate our existing SRE team to world-class performance standards, advancing career development and technical excellence
- Optimize and mature our established SRE practices, enhancing SLO/SLI frameworks, error budget management, and incident response effectiveness
- Strengthen our culture of reliability and observability, driving higher standards for continuous improvement across all engineering teams
- Refine existing on-call processes, escalation procedures, and post-incident reviews to accelerate learning and prevent recurring issues
- Drive an AI first agenda, leveraging AI tooling to address key pain points and improve speed to market
- Own and execute the core product technology roadmap, balancing feature delivery with reliability and scalability requirements
- Implement comprehensive monitoring, alerting, and observability solutions across all systems
Other
- Directly manage a team of onshore and offshore software engineers
- A strong people leader. You build high-performing, motivated teams. You coach, develop, and inspire others to do their best work and grow with the company.
- Executive savvy, team-first. You can influence at the highest levels while staying grounded in the needs of your team.
- Comfortable with ambiguity. You thrive in fast-moving environments with legacy complexity and new greenfield opportunities.
- Curious and driven. You’re energized by learning, solving real problems, and making things better—for users, customers, and the business.