The Artificial General Intelligence (AGI) team at Amazon is looking to solve the problem of advancing the state of the art with multi-modal systems, specifically with generative AI (GenAI) and multi-modal Large Language Models (LLMs) in Computer Vision, to provide the best-possible experience for customers.
Requirements
- Experience programming in Java, C++, Python or related language
- Experience with deep learning libraries such as PyTorch, TensorFlow, MxNet
- Experience with learning multi-modal LLMs and Gen AI in Computer Vision, both in the image and video domains
- PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
- 1+ years of building models for business application experience
- Research publications in computer vision, deep learning or machine learning at peer-reviewed workshops, conferences or journals
- Experience with multi-modal systems
Responsibilities
- Develop algorithms and modeling techniques to advance the state of the art with multi-modal systems
- Leverage Amazon’s large-scale computing resources to accelerate development with multi-modal Large Language Models (LLMs) and GenAI in Computer Vision
- Work with talented peers to develop algorithms and modeling techniques
- Build models for business application
- Work with multi-modal LLMs and GenAI in Computer Vision, both in the image and video domains
- Develop industry-leading technology with generative AI (GenAI) and multi-modal systems
- Advance the state of the art with multi-modal systems
Other
- PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
- 1+ years of building models for business application experience
- Work safely and cooperatively with other employees, supervisors, and staff
- Adhere to standards of excellence despite stressful conditions
- Communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service