NVIDIA is seeking to advance the capabilities of large language and multimodal models through research and development, aiming to transfer this research into new products and solutions.
Requirements
- Python, C++, CUDA, Deep Learning Framworks (PyTorch, Tensorflow, JAX, etc.)
- Strong background in research with publications at top conferences.
- Experience with large-scale model training is a plus.
- Transformer architectures
- Knowledge distillation and data synthesis
- Long-context methods
- Model compression and pruning
Responsibilities
- Research and develop novel methods for advancing the capabilities of large language and multimodal models.
- Collaborate with other team members, teams, and/or external researchers.
- Transfer your research to product groups to enable new products or types of products.
- Deliverable results include prototypes, patents, products, and/or publishing original research.
Other
- Must be actively enrolled in a university pursuing a PhD degree in Computer Science, Electrical Engineering, or a related field, for the entire duration of the internship.
- Excellent communication and collaboration skills.