NVIDIA is seeking to build a worldwide network of fast, efficient, and reliable data transfer systems to enable NVIDIA AI workflows by enabling self-service data movement to GPU clusters planetwide.
Requirements
- A strong track record of delivering distributed services in a variety of distributed computing environments, including CSPs such as AWS, Azure, and Google
- History of ownership of product delivery from inception to support
- Prior experience developing distributed systems with Golang and/or scalable web services in Python or similar
- Experience implementing file transfer or synchronization protocols or object storage ecosystem tooling
- Previous work designing complex systems using containers and orchestrated by Kubernetes
- Experience building high-scale distributed systems such as distributed databases, storage systems, or cloud services
- Proficiency in Golang, Python, or both
Responsibilities
- Design, implement, deploy, and maintain innovative software solutions for data ingestion, movement, and management
- Solve technical problems spanning the areas of orchestration, service modeling, API development, monitoring, and automation
- Build highly reliable distributed systems that our customers can depend on for their own business
- Have a strong understanding of distributed systems development, object storage, network file transfer protocols, and file systems
- Have proficiency in Golang, Python, or both
- Deliver distributed services in a variety of distributed computing environments, including CSPs such as AWS, Azure, and Google
- Have prior experience developing distributed systems with Golang and/or scalable web services in Python or similar
Other
- Bachelor’s of Science in Computer Science or a related field (or equivalent experience)
- 5+ yrs of experience
- Applications for this job will be accepted at least until October 18, 2025
- NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer
- You will also be eligible for equity and benefits