Ridgeline is looking to build systems that make reliability a competitive advantage by scaling reliability across their cloud-native platform and delivering high-performance, zero-downtime services with speed, clarity, and confidence.
Requirements
- Proficiency with at least one modern programming language (e.g., Java, Python, JavaScript, Kotlin); experience with scripting and tooling automation is a plus.
- Experience with observability tools (such as Datadog, Prometheus) and fundamentals of monitoring and alerting.
- Familiarity with infrastructure-as-code tools (Terraform, Cloud Development Kit) and basic CI/CD pipelines.
- Comfortable seeking support as needed and communicating clearly within the team.
- Familiarity with AI-assisted tooling or workflows is a plus, but not required
- Willingness to learn about cutting-edge technologies while cultivating expertise in a business domain/problem space.
- Aptitude for problem solving
Responsibilities
- Build and maintain reliability systems such as health monitoring tools, observability dashboards, and alerting infrastructure—with support from senior team members.
- Collaborate closely within the SRE team and with adjacent product/infrastructure teams to embed reliability into features and processes.
- Participate in the SRE on-call rotation for incident response, learning blameless triage and contributing to retrospectives.
- Contribute to the development and maintenance of FinOps tools for enhanced cost visibility, usage transparency, and cloud efficiency.
- With support, implement metrics and monitoring to proactively surface operational issues.
- Contribute to and help refine reliability best practices, documentation, and internal wikis.
- Write clear, maintainable, and well-tested code as part of larger joint initiatives.
Other
- Bachelor’s degree in a relevant technical field, or equivalent practical experience.
- At least 2 years’ experience in software engineering or a related technical role, ideally involving cloud infrastructure, DevOps, or SRE.
- Ability to communicate effectively
- Serious interest in having fun at work
- You are a practical systems thinker, eager to understand how things work and how they can be improved, even if some ambiguity remains.