Amazon's AGI team is looking to solve the problem of advancing Generative Artificial Intelligence (GenAI) models by focusing on pre-training methodologies. They aim to push the boundaries of Large Language Models (LLMs) and multimodal systems to enhance customer experiences.
Requirements
- Experience with neural deep learning methods and machine learning
- Experience programming in Java, C++, Python or related language
- Relevant Generative Artificial Intelligence (GenAI) research experience with LLMs and multi-modalities
- For system researchers, familiarity with deep learning compilers, auto-parallelization, and XLA/MLIR ecosystems
Responsibilities
- Scaling laws
- Hardware-informed efficient model architecture, low-precision training
- Optimization methods, learning objectives, curriculum design
- Deep learning theories on efficient hyperparameter search and self-supervised learning
- Learning objectives and reinforcement learning methods
- Distributed training methods and solutions
- AI-assisted research
Other
- PhD, or Master's degree and 5+ years of applied research experience
- 3+ years of building machine learning models for business application experience
- Experience with patents or publications at top-tier peer-reviewed conferences or journals
- work safely and cooperatively with other employees, supervisors, and staff
- adhere to standards of excellence despite stressful conditions
- communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service
- follow all federal, state, and local laws and Company policies