TikTok is looking to solve the problem of ensuring optimal performance of its services with reasonable operational costs, while improving observability and operability of the services.
Requirements
- Solid basic knowledge of computer software; understand the relevant principles of Linux operating system, storage, network IO, etc.
- Familiar with one or more programming languages, such as Python/Go/Java/PHP/C/C++
- Solid understanding of Docker, Kubernetes, or other container orchestration systems
- Experience with developing, deploying, and/or maintaining micro-services architecture with Kubernetes
Responsibilities
- Participate in oncall of the service you are supporting, working to improve response processes as well as improving observability of the service
- Participate in the construction of operation and maintenance tools and platforms, and improve the operability of the services
- Ensure stability of the service through proactive risk monitoring, and working with stakeholders to close off the risks
- Identify key system risks through comprehensive data operations to review system health
- Accumulate best practices in operations, and propose new approaches to improve operations in TikTok
Other
- Bachelor's Degree or above, Major in Computer Science or related majors
- Have the ability to solve problems systematically, good communication skills, and a strong sense of responsibility
- Minimum 1-5 years relevant work experience from a large-scale internet business
- Day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits