Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

ByteDance Logo

Student Researcher (Doubao (Seed) - Foundation Model - Video Generation) - 2025 Start (PhD)

ByteDance

$57 - $57
Sep 20, 2025
Seattle, WA, US
Apply Now

Doubao Vision team is looking to solve the visual intelligence problem for AI by conducting cutting-edge research on areas like vision and language, large vision models, and generative foundation models, and applying these technologies to their rich application scenarios.

Requirements

  • Research experience in multi-modal understanding, vision and language, such as video captioning, VQA, Text-to-video retrieval, audio/music understanding and generation, and other related topics.
  • Publications in top-tier venues, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, EMNLP, ACL, COLING, etc.
  • Highly competent in algorithms and programming; Strong coding skills in Python and popular deep learning frameworks.

Responsibilities

  • Conduct cutting-edge research and development in foundation model and multimodal machine learning, especially in the areas of generative AI (e.g. image, video generation).
  • The primary objective is to research cutting-edge video generation technology through innovation.
  • Develop the foundation model to enhance the strategic advantages for ByteDance products
  • Explore new downstream products with artificial intelligence technology at its core.

Other

  • Currently pursuing a PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.
  • Work and collaborate well with team members.
  • Ability to work independently; Strong communication skills.