Building the intelligent backbone of Splunk Observability Cloud by leveraging the latest advancements in data science and machine learning to unlock unprecedented value from massive volumes of telemetry—metrics, traces, and logs—at petabyte scale
Requirements
- Experience designing and building scalable cloud-based systems (AWS, Azure, or GCP), including container orchestration (e.g., Kubernetes, Docker)
- Experience with API design and frameworks (e.g. OpenAPI, GraphQL, gRPC, REST, etc.)
- Prior working experience in delivering RAG and Agentic products into production
- Expert at using vibe coding tools (Claude Code, Codex, Copilot, Windsurf, Cursor) is a must
- Experience developing, deploying, and maintaining applications in AWS environment with cloud native solutions
- Experience monitoring and analyzing metrics, trace, span, and log content
- Background in observability, generative AI, or model robustness
Responsibilities
- Apply the latest Generative AI and Agentic AI to enable AI features in Splunk Observability
- Collaborate across engineering and product teams to establish robust frameworks for evaluating AI systems’ trustworthiness and resilience
Other
- Master's degree in computer science or related field and 4+ years of software engineering experience, or bachelor's degree with 7+ years of experience
- 10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees
- 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees
- 80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours of unused sick time carried forward from one calendar year to the next
- Optional 10 paid days per full calendar year to volunteer