Enhancing the shopping experience on Amazon through the conversational capabilities of large language models, and driving breakthrough innovations in LLM inference efficiency.
Requirements
- Experience programming with at least one software programming language
- Experience with Machine and Deep Learning toolkits such as MXNet, TensorFlow, Caffe and PyTorch
- Experience with CUDA, cuDNN, cuBLAS and other GPU kernel-level optimization techniques
- Experience in CUDA programming and GPU kernel development
- Experience in Neuron hardware (Inferentia and Trainium chips) and NKI kernel optimization
Responsibilities
- architect, design, develop, and optimize high-performance kernel implementations for large language model.
- contribute to creating and optimizing innovative kernels, custom operators, and low-level optimizations that maximize hardware utilization and minimize computational overhead.
- build expertise in kernel development, memory management, and parallel computing that dramatically reduce inference latency and boost throughput for transformer-based models.
- develop kernel fusion techniques, attention mechanism optimizations, and matrix multiplication accelerations at scale, partnering with engineers and scientists in a fast-paced environment to deliver measurable performance gains.
- contribute to our technical roadmap, performance benchmarking, and optimizations focused on kernel-level improvements.
Other
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market.