Oracle Cloud Infrastructure (OCI) is looking for a Software Engineer to help shape the future of AI infrastructure and services at Oracle, focusing on critical components of OCI’s AI platform, including large-scale GPU cluster management, self-service ML infrastructure, and end-to-end model lifecycle capabilities.
Requirements
- 12+ years of experience shipping scalable, cloud native distributed systems
- Experience in Go, Java, Python.
- Experience with container orchestration like Kubernetes/Docker Swarm.
- Experienced at building highly available services, possessing knowledge of common service-oriented design patterns and service-to-service communication protocols
- Experience with components of modern infrastructure like containerization, software-defined networking
- Experience with production operations and best practices for putting quality code in production and troubleshoot issues when they arise
- Experience building control plane/data plane solutions for cloud native companies
Responsibilities
- Build cloud service on top of the modern Infrastructure as a Service (IaaS) building blocks at OCI
- Design and build distributed, scalable, fault tolerant software systems
- Participate in the entire software lifecycle – development, testing, CI and production operations
- Design and lead software projects without needing significant guidance and guide/mentor/coach junior engineers
- Balance between product feature development and production operational concerns like writing runbooks, ops automation, structured logging, instrumentation for metrics and events
- Leverage internal tooling at OCI to develop, build, deploy and troubleshoot software
- Participate in on-call for the service with the team
Other
- BS in Computer Science, or equivalent experience
- Able to effectively communicate technical ideas verbally and in writing (technical proposals, design specs, architecture diagrams and presentations)
- MS in Computer Science
- Experience in diagnosing, troubleshooting and resolving performance issues in complex environments
- Deep understanding of Unix-like operating systems