NVIDIA's Deep Learning Libraries Group is seeking excellent software engineers to enable the next wave of NVIDIA’s highest performing deep learning libraries. The mission is to design and develop scalable, modular infrastructure that streamlines development, builds, and tests across NVIDIA’s diverse set of platforms, from Drive AGX for autonomous vehicles to DGX servers for datacenters and large language models.
Requirements
- Strong programming skills in Python (or similar) and familiarity with C/C++ development
- Experience setting up, maintaining, and automating continuous integration systems (e.g. Jenkins, GitHub Actions, GitLab pipelines, Azure DevOps)
- Experience in HTML5, CSS, NodeJS, or React
- Fluency in SCM (e.g. Git, Perforce) and build systems (e.g. Make, CMake, Bazel)
- Background with distributed systems and cluster/cloud computing, especially with Kubernetes
- Prior experience designing and developing automation in Jenkins with Groovy (or similar)
- A strong understanding of unit and integration test frameworks and experience with crafting them
Responsibilities
- Designing and developing software for testing and analysis of our codebases
- Building scalable automation for build, test, integration, and release processes for publicly distributed deep learning libraries
- Developing throughout the software stack, from the user experience and user interfaces down to the cluster and database layers
- Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Kubernetes, Jenkins, Docker, CMake, Gitlab, Jira, etc.)
- Develop front-end solutions using HTML, CSS, JavaScript, and related web technologies
- Advancing the state of the art in those industry-standard tools
Other
- A Masters Degree in Computer Science or Computer Engineering or equivalent experience.
- 3+ years of relevant experience
- Are you creative, driven, and autonomous?
- Do you love a challenge?
- NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.