Design, build, automate, and operate cloud-scale network monitoring software that keeps mission-critical services reliable for Oracle's global cloud.
Requirements
- demonstrated experience in distributed systems
- demonstrated experience in cloud services
- demonstrated experience in production-grade debugging, design, and implementation skills
- solid coding experience
- a track record of building and operating distributed services
- bias for automation
- hands-on with distributed systems, data processing, and production debugging
Responsibilities
- design, build, automate, and operate cloud-scale network monitoring software
- define and evolve system architectures
- implement high-impact changes across new and existing services
- apply AI-driven operations automation—leveraging modern models and tooling—to improve observability, resiliency, and developer velocity
- design and build core services and automated test suites for our Network Monitoring and Analytics platform
- ingesting high-throughput metrics, crafting resilient data pipelines, and delivering real-time, online, and batch analytics
- apply modern architectures and AI-driven insights to power performance monitoring, what‑if analysis, root cause and anomaly detection, prediction, and capacity planning
Other
- provide hands-on technical leadership to peers
- work on varied, complex problems requiring independent judgment and end-to-end ownership
- opportunities to lead projects and mentor other developers
- fully competent in your domain
- communicate clearly across teams