OCI Compute is looking for strong Systems/Software Developers with a strong Windows OS background to take on the challenge of engineering Compute GPU/HPC Infrastructure solutions and build an imaging service for Large Scale Compute/HPC/AI/ML Customer Workloads and performance while providing strong guarantees of availability to our customers.
Requirements
- Strong grasp of system design fundamentals and distributed systems architectural best practices
- Demonstrated ability to write great code in Java, Python, or similar OO languages
- Experienced at building highly available services, possessing knowledge of common service-oriented design patterns and service-to-service communication protocols
- Experience with HPC and GPU compute fundamentals.
- Linux/Windows core operating system including systems tuning
- Imaging tooling such as Ansible, Packer and Oracle Image builder.
- Experience with Oracle's cloud infrastructure.
Responsibilities
- Design and develop image automation software in Java, Python, and other languages.
- Apply engineering principles for defining robust and maintainable architectures and designs.
- Build cloud service on top of the modern Infrastructure as Service (IaaS) building blocks at OCI
- Design and build distributed, scalable, fault tolerant software systems
- Participate in the entire software lifecycle – development, testing, CI/CD and production operations
- Balance between product feature development and production operational concerns like ops automation, structured logging, instrumentation for metrics and participating in on-call.
- solving complex Windows and hypervisor-level issues (e.g., QEMU), and building solutions to address Platform and Custom GPU/HPC images to meet customer workload requirements.
Other
- BS or MS degree in computer science, or equivalent
- 6-10+ years of developing and shipping enterprise distributed and/or cloud native systems
- Strong desire to make an impact and thrive in collaborative and energetic environments
- Ability to effectively communicate technical concepts verbally and through design aspects
- Collaborate broadly across multiple disciplines from hardware designers to HPC/GPU developers.