Google's Site Reliability Engineering (SRE) team needs to ensure the reliability, uptime, and performance of Google's massively distributed, fault-tolerant systems and services, both internal and external, by optimizing existing systems, building infrastructure, and eliminating work through automation.
Requirements
- 2 years of experience with software development in one or more programming languages.
- 2 years of experience designing, analyzing, and troubleshooting large-scale distributed systems.
Responsibilities
- Write product or system development code.
- Review code developed by other engineers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
- Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
- Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
- Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
- manage project priorities, deadlines, and deliverables.
- design, develop, test, deploy, maintain, and enhance software solutions.
Other
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- Master's degree in Computer Science or Engineering.
- The US base salary range for this full-time position is $141,000-$202,000 + bonus + equity + benefits.
- Google is proud to be an equal opportunity workplace and is an affirmative action employer.
- We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.