Google's Site Reliability Development aims to ensure the reliability, uptime, and performance of its large-scale, distributed systems. The role focuses on optimizing existing systems, building infrastructure, and automating processes to maintain and improve the quality of Google's services.
Requirements
- 1 year of experience with software development in one or more programming languages during coursework/projects, research, internships, or practical experience in school, work, or Open Source projects.
- 1 year of experience with data structures or algorithms.
Responsibilities
- Write product or system development code.
- Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
- Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
- Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
- Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
- Design, develop, test, deploy, maintain, and enhance software solutions.
Other
- Manage project priorities, deadlines, and deliverables.
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- Master's degree in Computer Science or Engineering, or a related field.
- Waterloo, ON, Canada; New York, NY, USA; Pittsburgh, PA, USA; Raleigh, NC, USA; Durham, NC, USA; Seattle, WA, USA; Sunnyvale, CA, USA