Honeycomb is looking to solve problems related to the performance, reliability, and scalability of storing and querying billions of events to power their observability platform.
Requirements
- Technical leadership in storage and querying.
- Led the design of patterns and guidelines that improve performance and reliability.
- Expertise in writing Go along with experience working on high-throughput or complex systems.
- Exposure to Kubernetes is a plus.
- Deep debugging expertise.
- Diagnose and resolve the toughest production issues across the stack, restoring reliability where others may be stuck.
- Influence beyond your team.
Responsibilities
- Design and implement solutions that power Honeycomb’s query and data storage infrastructure.
- Improving query performance, evolving the Retriever Service, or making our storage layer more reliable.
- Breaking down complex storage and querying problems into achievable steps, surfacing trade-offs, and pulling in teammates when needed.
- Take ownership end-to-end—from design to production support—reducing toil and ensuring our systems deliver measurable customer and business impact.
- Participate in on-call rotations, helps define team KPIs and SLOs, and proactively reduces toil by addressing reliability challenges before they reach customers.
- Explore new technologies, data storage techniques, and query optimization strategies to keep performance and reliability strong at scale.
- Pairing, reviewing code, and offering constructive feedback.
Other
- Remote-first company.
- Trust, autonomy, and accountability from Day 1.
- Collaborate with Product Management and other engineering teams.
- Focus on business outcomes.
- Strength in mentorship and teamwork.