Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

ByteDance Logo

Student Researcher [Seed Vision – Multimodal Interaction & World Model Pretraining] – 2026 Start (PhD)

ByteDance

$65 - $65
Sep 12, 2025
San Jose, CA, US
Apply Now

ByteDance's Seed Multimodal Interaction and World Model team is looking to solve the problem of developing models with human-level multimodal understanding and interaction capabilities, and to advance the exploration and development of multimodal assistant products by employing pre-training and simulation technologies.

Requirements

  • Currently pursuing a PhD in Computer Vision, Machine Learning, or a related technical field.
  • Familiarity with multimodal modeling, world models, or foundation model pretraining.
  • Strong coding skills and hands-on experience with PyTorch or JAX.
  • Experience with large-scale distributed training frameworks and GPU/TPU compute stacks.
  • Demonstrated research ability, with publications in top-tier conferences such as CVPR, ICCV, ECCV, NeurIPS, ICML, or ICLR.
  • Experience working with transformer-based architectures, including dense and Mixture-of-Experts (MoE) models.
  • Understanding of scaling behavior in foundation models and how to analyze them.

Responsibilities

  • Contribute to research and engineering to advance world models and multimodal understanding, enhancing models' reasoning and generation capabilities.
  • Design and prototype novel architectures that balance modeling performance, generalization, and efficiency.
  • Help establish scaling laws and conduct systematic ablations to derive transferrable insights across model families and tasks.
  • Employ pre-training and simulation technologies to model various environments of the virtual and physical world, providing foundational capabilities for multimodal interactive exploration.

Other

  • Currently pursuing a PhD in Computer Science, Machine Learning, or a related technical field.
  • PhD Internships at ByteDance aim to provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies.
  • Our dynamic internship experience blends hands-on learning, enriching community-building and development events, and collaboration with industry experts.
  • Applications will be reviewed on a rolling basis – we encourage you to apply early.
  • Please state your availability clearly in your resume (Start date, End date).