Prime Video is looking to solve the problem of creating a best-in-class digital video experience for its customers, by applying advanced machine learning techniques in computer vision, Generative AI, multimedia understanding, and more.
Requirements
- PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
- 3+ years of building models for business application experience
- Experience programming in Java, C++, Python or related language
- Experience in generative models (diffusion, flow, transformers)
- Hands-on experience with image/video synthesis and editing techniques
- Proficiency in PyTorch and modern DL toolkits (e.g., Hugging Face ecosystem)
- Experience in professional software development
- Publications in top-tier AI/ML/Graphics Conferences (CVPR, ICCV/ECCV, SIGGRAPH, NeurIPS, ICLR)
Responsibilities
- Research and develop generative models for controllable synthesis across images, video, vector graphics, and multimedia
- Innovate in advanced diffusion and flow-based methods to improve efficiency, controllability, and scalability
- Advance visual grounding, depth and 3D estimation, segmentation, and matting for integration into pre-visualization, compositing, VFX, and post-production pipelines
- Design multimodal GenAI workflows including visual-language model tooling, structured prompt orchestration, agentic pipelines
Other
- PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
- Work safely and cooperatively with other employees, supervisors, and staff
- Adhere to standards of excellence despite stressful conditions
- Communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service
- Follow all federal, state, and local laws and Company policies