Empower content understanding for TikTok's short-video business by developing advanced LLM/MLLM algorithms and applications to improve key business metrics and deliver state-of-the-art research outputs.
Requirements
- Proven experience in multimodal content understanding, with expertise in large language models (LLMs) and familiarity with cutting-edge progress in the field.
- Strong technical foundation in at least one major deep learning framework (e.g., PyTorch, TensorFlow).
- Hands-on experience deploying content understanding solutions in search, advertising, recommendation, or related domains.
Responsibilities
- Lead multimodal algorithm development for TikTok’s short-video business, explore applications of multimodal technologies in recommendation systems and other scenarios to improve key business metrics.
- Conduct cutting-edge research in multimodal and MLLM technologies, design advanced algorithms to solve business requirements while achieving technical breakthroughs.
- Drive engineering deployment and implementation, ensuring model stability, scalability, and efficiency in production environments.
- Focus on key areas including (but not limited to): General AI platform design and development, including few-shot/zero-shot on MLLM, AI-labeling, auto prompting, active-learning, continue pretraining and RL.
- Integration of content understanding with recommendation systems (e.g., UGC ecosystems, cold start, interest exploration, comment understanding).
- Leveraging multimodal techniques to develop next-generation recommendation systems, such as generative models and end-to-end approaches.
Other
- Proactive mindset, strong sense of ownership, excellent communication skills, and ability to collaborate across teams.
- Currently pursuing a Master degree with a background in computer science, machine learning, or similar fields.
- Able to commit to working for 12 weeks during Summer 2026.