WebstaurantStore is looking for a Site Reliability Manager to support their web application development and e-commerce teams, contributing to the growth of their business by ensuring the reliability and scalability of their systems.
Requirements
- Leading teams using Linux, Kubernetes, Prometheus, OTEL, and other cloud native technologies.
- A background running infrastructure at scale with a strong working knowledge of web development.
- A solid understanding of both on premise and cloud based (Azure) solutions.
- Project Management Lifecycles (Agile -Scrum/Kanban and Waterfall).
Responsibilities
- Actively contributing to code reviews, architecture decisions and process improvements.
- Leading teams using Linux, Kubernetes, Prometheus, OTEL, and other cloud native technologies.
- Running infrastructure at scale with a strong working knowledge of web development.
- Managing timelines, which includes collaborating with Development, BA, QA and other IT teams.
- Ensuring our fun and open culture is preserved.
Other
- Responsible for hiring and growing the team.
- Mentoring and managing performance of approximately 10 direct reports to facilitate career goals
- Working effectively in a collaborative and innovative team-oriented environment.
- 6+ years of proven expertise in leading and managing teams.
- Access to a reliable and secure high-speed internet connection.
- A dedicated home office space that is noise- and distraction-free.
- The desire and ability to work and communicate with other team members via chat, webcam, etc.