The business problem that this job is looking to solve is to leverage the latest advancements in data science and machine learning to unlock unprecedented value from massive volumes of telemetry—metrics, traces, and logs—at petabyte scale for Splunk Observability Cloud.
Requirements
- Experience designing and building scalable cloud-based systems (AWS, Azure, or GCP), including container orchestration (e.g., Kubernetes, Docker).
- Proven experience in technical leadership, architecture design, and end-to-end feature ownership in AI/ML or platform domains.
- Experience with API design and frameworks (e.g. OpenAPI, GraphQL, gRPC, REST, etc.)
- Prior working experience in delivering RAG and Agentic products into production
- Expert at using vibe coding tools (Claude Code, Codex, Copilot, Windsurf, Cursor) is a must
- Up To Date knowledge on the latest Agentic and Generative AI industry trend and framework
Responsibilities
- Apply the latest Generative AI and Agentic AI to enable AI features in Splunk Observability
- Collaborate across engineering and product teams to establish robust frameworks for evaluating AI systems’ trustworthiness and resilience.
- Provide technical leadership and mentorship within the team, establishing leading practices for development, testing, and artifact management.
Other
- Master's degree in computer science or related field and 7+ years of software engineering experience, or bachelor's degree with 10+ years of experience.
- 10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees
- 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees
- 80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours of unused sick time carried forward from one calendar year to the next
- Optional 10 paid days per full calendar year to volunteer