NVIDIA is looking to expand its enterprise GPU management and monitoring tools by designing and building cloud-native management agents, Kubernetes integrations, and end-to-end integration solutions that combine GPUs with the rest of the datacenter software management ecosystem.
Requirements
- Strong Go and Kubernetes development background
- User space development and debugging expertise in Linux environments
- Experience with APIs and interface design
- Development experience with Rust, Python and/or C, C++.
- Development experience with distributed systems and concurrent applications, especially in a Kubernetes environment
- Experience developing and maintaining enterprise software.
- Experience deploying, managing, and debugging applications in a Kubernetes environment
Responsibilities
- Develop and maintain distributed, robust and scalable Go programs deployed to Kubernetes environments that manage large datacenters
- Develop and maintain user-space applications, containers, Go-bindings, and CLI tools.
- Enable GPU management integration with the state-of-the-art open-source ecosystem, including Kubernetes and Docker.
- Support internal and external users through bug fixes, documentation, and feature improvements.
- Maintain high-quality products through robust test coverage.
Other
- 5+ years of meaningful industry experience
- Outstanding written and verbal interpersonal skills. Business level English
- Strong motivation and commitment to learn new skills
- Ability to execute all aspects of the software development lifecycle. Ability to manage time in a fast, heavily multitasked environment