Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

NVIDIA Logo

Senior Deep Learning Software Engineer, Inference

NVIDIA

$148,000 - $287,500
Oct 15, 2025
Santa Clara, CA, US
Apply Now

NVIDIA seeks to improve the performance and efficiency of deep learning inference for AI applications by designing, building, and optimizing GPU-accelerated software

Requirements

  • Excellent C/C++ programming and software design skills
  • Prior experience with training, deploying or optimizing the inference of DL models in production is a plus
  • Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus
  • Python experience is a plus
  • Experience with Multi-GPU Communications (NCCL, NVSHMEM)
  • GPU programming experience (CUDA, OAI TRITON or CUTLASS)
  • Experience with deep learning frameworks like PyTorch, vLLM, and SGLang

Responsibilities

  • Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI
  • Scale performance of DL models across different architectures and types of NVIDIA accelerators
  • Contribute features and code to NVIDIA’s inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions
  • Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions
  • Implement the latest algorithms for public release in frameworks like SGLang and vLLM, as well as other DL frameworks
  • Identify and drive performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA accelerators
  • Implement and optimize model serving pipelines using open-source tools and plugins—including CUTLASS, OAI Triton, NCCL, and CUDA kernels

Other

  • Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI)
  • 5+ years of relevant software development experience
  • SW Agile skills are helpful
  • Experience building and shipping products to enterprise customers
  • Travel requirements not specified