NVIDIA is seeking to define the next era of computing through AI by developing efficient, scalable, resilient, and trustworthy systems for training, fine-tuning, and serving ML models across all scales, from personal devices to warehouse-scale data centers.
Requirements
- Recent graduate with a Ph.D. in CS/CE/EE with a strong background in operating systems, distributed systems, inference and training systems, data management systems, networking, cloud computing, and/or computer architecture (or equivalent experience).
- Demonstrated expertise in one specific area with the ability to become the go-to resource within a team having varied backgrounds.
- Background with experimental research and development.
- Experience with C, C++, Python, and/or scripting languages.
- Experience with using AI tools for analysis, design, and code development.
Responsibilities
- Understand and analyze the efficiency, scaling, and resilience challenges in ML systems, algorithms, and applications.
- Develop creative systems solutions (hardware, software, infrastructure) for future ML systems of all scales.
- Contribute to the co-design of next-generation AI/ML algorithms and systems.
- Collaborate with a diverse set of research and product teams across the company, spanning software, hardware, AI, and networking.
- Publish original research and speak at conferences and events.
Other
- Recent graduate with a Ph.D. in CS/CE/EE (or equivalent experience).
- A strong publication, patent, and research collaboration history is a huge advantage.
- Applications for this job will be accepted at least until December 20, 2025.
- NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.