Google is looking to solve the problem of ensuring reliability, uptime, and fast rate of improvement for its services, including internally critical and externally-visible systems, by hiring an Engineering Manager for Site Reliability Engineering (SRE).
Requirements
- 8 years of experience with software development in one or more programming languages or designing, analyzing, and troubleshooting distributed systems.
- Experience working in large-scale distributed systems, storage, or data analysis systems.
- Experience with coding, algorithms, complexity analysis and large-scale system design.
- Experience collaborating with executive-level stakeholders.
- Experience in leadership in a distributed team structure.
- Experience managing executive and staff level engineers.
Responsibilities
- Advocate the reliability of Sawmill, Google's critical business logging service, including the implementation, client migration, and reliable reality of Sawmill Next, a decentralized deployment.
- Ensure the reliability of new Sawmill features for key clients (e.g., machine learning) and de-tangle workloads to guarantee critical data delivery even during go/degrowth environments.
- Partner with key dependencies (e.g., Borg, Colossus) to enable system support and manage the resourcing and safety of the 8 Logs cells that house Sawmill’s data.
- Act as a key member of the SRE leadership group across Sunnyvale and Sydney. Serve as the day-to-day contact and bridge between Sunnyvale Logs development team leads and the Sydney Logs SRE partner team.
- Participate in the team's operational duties, including Tier 1 primary on-call shifts, to maintain system knowledge and identify pain points.
Other
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 3 years of experience managing people or teams, and leading projects.
- Must be willing to participate in on-call shifts.
- Must be willing to work in a fast-paced environment with changing priorities.
- Must be able to work collaboratively in a team environment.