NVIDIA is looking to expand its enterprise GPU management and monitoring tools by designing and building Linux-based management agents, CLI tools, and end-to-end integration solutions. This involves integrating GPUs with the data center software management ecosystem, maintaining build environments, CI/CD pipelines, and infrastructure, and supporting NVIDIA products across HPC, cloud, and enterprise on both bare metal and virtualized platforms.
Requirements
- 5+ years of meaningful industry experience with a strong C++ development background
- User space development and debugging expertise under Linux environments
- Experience packaging software for Linux package managers (DEB and RPM)
- Experience using Kitware utilities to manage builds (CMake, CPack, CTest)
- Experience with APIs and interface design
- Development experience with python, go, and rust.
- Experience developing CI/CD pipelines using GitLab-CI, GitHub Actions, or Jenkins
Responsibilities
- Develop robust, scalable C++ user space data center management system software under Linux
- Build and maintain user-space libraries, agents, plugins, bindings and CLI tools
- Enable GPU management integration with the OSS ecosystem, including Kubernetes and Docker
- Maintain build and CI/CD processes to deliver our product on CUDA-supported OSes.
- Support internal and external users through bug fixes, documentation and feature improvements
- Maintain high quality products through robust test coverage and smart design
Other
- BS or higher in Computer Science or equivalent experience.
- Outstanding written and verbal interpersonal skills.
- Strong motivation and commitment to learn new skills
- Ability to execute all aspects of the software development lifecycle.
- Ability to manage time in a fast, heavily multitasked environment