Waymo's software reliability engineers (SRE) are responsible for the stable operation of Waymo’s fully autonomous systems and supporting infrastructure. As an SRE, you combine software and systems engineering techniques to build and run large-scale, fault-tolerant, reliable systems.
Requirements
- 2+ years of experience writing clean, efficient code in C++, Java and/or Python
- An interest and enthusiasm for massively distributed systems
- Interest in monitoring, debugging, and automation
- Demonstrated ability as an individual contributor engineer, contributing to scalable, high quality software
- 3 years of industry experience
Responsibilities
- Manage end-to-end availability and performance by measuring the entire system and developing automated solutions to improve it
- Involvement in the whole lifecycle of services - from inception and design, through deployment, operation and refinement
- Monitoring of massive batch production pipeline systems
- Writing designs and coding software/automation for global infrastructure
- Participate in a sustainable oncall incident response team and practice blameless retrospectives
- Collaborate with other engineers to build reliable systems that meet internal Waymo business needs
Other
- In this hybrid role you will report to a Tech Lead Manager.
- A Bachelors degree in a relevant field or similar experience
- A Bachelors of Computer Science (or similar)