Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

ByteDance Logo

Student Researcher - Seed LLM Horizon - Multi-turn Tool Use

ByteDance

Salary not specified
Jul 31, 2025
San Jose, CA, USA
Apply Now

The Seed LLM Horizon Team is dedicated to developing the next-generation agent foundation model and building self-evolving, personalized Agent.

Requirements

  • Research experience in one or more of the following: reinforcement learning, LLM agents, memory systems, tool use, or interactive learning.
  • Strong coding skills and proficiency with modern deep learning frameworks.
  • Demonstrated ability to conduct independent research, with publications in top-tier ML/AI conferences such as NeurIPS, ICML, ICLR, ACL, EMNLP etc.
  • Experience with long-horizon reasoning, multi-turn tasks, or asynchronous agent behavior.
  • Familiarity with agent evaluation, personalization, or real-world tool integration.
  • Background in building or analyzing large-scale agent training pipelines.

Responsibilities

  • Enable models to perform deep usage of professional tools (e.g., search, code-interpreter) to solve complex problems.
  • Develop approaches to generalize model abilities to millions of out-of-distribution (OOD) tools and scenarios.
  • Scale up multi-turn tool-use training tasks and explore effective training methods.
  • Address challenges of long-horizon, multi-turn tasks in reinforcement learning.

Other

  • Currently pursuing a PhD in Computer Science, Software Engineering, Machine Learning, or a related field.
  • Ability to collaborate effectively in a fast-paced, research-driven team environment.
  • Please state your availability clearly in your resume (Start date, End date).