CCC Intelligent Solutions Inc. (CCC) is looking to ensure the reliability, performance, and availability of applications by hiring a Site Reliability Engineer (SRE) to work closely with the Product Development team.
Requirements
- Hands-on experience in DevOps, Infrastructure, or Site Reliability Engineering roles.
- Ability to analyze and interpret logs, metrics, and monitoring data to troubleshoot issues.
- Familiarity with the software development lifecycle and deployment practices.
- Understanding of microservices architecture and RESTful APIs.
- Comfortable working in Agile/Scrum environments.
- Ability to create and maintain basic automation or scripting (e.g., Python, Bash).
- Experience working with SQL for querying and reporting.
Responsibilities
- Collaborate with engineering teams to promote SRE practices across the organization.
- Create and maintain operational documentation such as runbooks and playbooks.
- Configure and maintain monitoring and alerting tools related to the applications.
- Monitor application and infrastructure health and recommend improvements to performance and reliability.
- Identify manual tasks and contribute to automation efforts.
- Work with developers and QA teams to support non-functional requirements like performance and availability.
- Assist in triaging and resolving production issues and incidents.
Other
- Good communication and collaboration skills.
- 401K Match
- Paid time off
- Annual Incentive Plan Performance Bonus
- Comprehensive health insurance
- Bachelor's degree or equivalent experience (not explicitly mentioned but implied)