Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Student Researcher - Seed LLM Horizon - Multi-turn Tool Use

Salary not specified

Jul 31, 2025

San Jose, CA, USA

The Seed LLM Horizon Team is dedicated to developing the next-generation agent foundation model and building self-evolving, personalized Agent.

Research experience in one or more of the following: reinforcement learning, LLM agents, memory systems, tool use, or interactive learning.
Strong coding skills and proficiency with modern deep learning frameworks.
Demonstrated ability to conduct independent research, with publications in top-tier ML/AI conferences such as NeurIPS, ICML, ICLR, ACL, EMNLP etc.
Experience with long-horizon reasoning, multi-turn tasks, or asynchronous agent behavior.
Familiarity with agent evaluation, personalization, or real-world tool integration.
Background in building or analyzing large-scale agent training pipelines.

Enable models to perform deep usage of professional tools (e.g., search, code-interpreter) to solve complex problems.
Develop approaches to generalize model abilities to millions of out-of-distribution (OOD) tools and scenarios.
Scale up multi-turn tool-use training tasks and explore effective training methods.
Address challenges of long-horizon, multi-turn tasks in reinforcement learning.

Currently pursuing a PhD in Computer Science, Software Engineering, Machine Learning, or a related field.
Ability to collaborate effectively in a fast-paced, research-driven team environment.
Please state your availability clearly in your resume (Start date, End date).