ByteDance is looking to solve the problem of designing, building, and operating a global, intelligent network infrastructure to meet the requirements of high availability, scalability, and high-performance for its hyperscale data-center networking solutions.
Requirements
- Experience with at least one of the following areas: network protocols like TCP and RoCEv2, have experience in network programming based on Socket and verbs APIs;
- Be familiar with Data Center congestion control algorithms, understand their pros and cons;
- Knowledge of scale-up protocols like PCIe, NVLink, UALink, and their differences with scale-out network protocols;
- Be familiar with the latest advances in the area of high-speed network systems, including RDMA, congestion control, AI network optimization and so on;
- Proficiency in one or several mainstream programming languages, including C/C++, Python, Go and so on;
- Have some knowledge of GPU architecture;
- Experience in developing high performance communication frameworks (including NCCL, MPI and RPC libraries) is a plus.
Responsibilities
- Design, optimization, implementation and deployment of high-performance transport protocols to support AI/LLM applications.
- Design, optimization, implementation and deployment of congestion control algorithms to support AI/LLM applications.
- Research and development of high-performance AI communication framework, network protocol stacks, and co-design optimization of host-network-application to improve the scalability, reliability and performance of AI/LLM networks.
- Follow the latest technologies from academia and industry, identify the innovative parts of the system and present in academic papers.
Other
- Currently pursuing a PhD in computer networking or a related technical discipline.
- Please state your availability clearly in your resume (Start date, End date).
- Interns have day one access to health insurance, life insurance, wellbeing benefits and more.
- Interns also receive 10 paid holidays per year and paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year).
- Interns who are not working 100% remote may also be eligible for housing allowance.