Enhance the search experience for hundreds of millions of users globally by developing and applying cutting-edge machine learning technologies in real-time large-scale systems, impacting search requests served daily.
Requirements
- 5 years of related experience in one or more of the following areas: NLP, LLM, RL.
- Proficient coding skills and strong algorithm & data structure foundation.
- Experience in using data-driven methods to enhance the capability of LLMs through various stages of the model development
- Experience in RAG, Prompt Engineering or other inference time methods to enhance the performance of the system
Responsibilities
- Conduct research and develop state-of-the-art algorithms in various stages of the development of LLM, including continued pretraining, SFT, RLHF;
- Investigate and implement robust evaluation methodologies to assess model performance at various stages, unravel the underlying mechanisms and sources of their abilities, and utilize this understanding to drive model improvements.
- Using inference stage techniques such as RAG, CoT, Prompt Engineering to improve the model output
- Improve the performance of AI Search in the TikTok app to provide better search experience for users
- Exploring and developing large-scale language models and optimizing enterprise applications to the extreme;
- Data construction, instruction tuning, preference alignment, and model optimization;
- Implementation of relevant applications, including content generation, summary etc.;
Other
- In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department.
- This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.
- Candidates with top-tier conference papers, including ICML, NeurIPS, ICLR, CVPR, ICRA, KDD etc., relevant internship experience or winners of ACM competitions are preferred;