TikTok is looking to solve the problem of managing complex challenges of scale in large-scale, massively distributed, and fault-tolerant systems
Requirements
- Proficient in one of the following programming languages: Python, GoLang, Java, Shell
- Familiar with Linux system internals, networking, and distributed systems
- Experience in MySQL, Redis, Kubernetes, Docker, Hadoop, Spark, Flink, HDFS, etc.
- Experience in designing and analyzing large-scale distributed systems
Responsibilities
- Engage in and improve the whole lifecycle of services from inception and design, throughout development, capacity planning, and launch reviews, to deployment, operation, and automate
- Design and implement various dashboards and monitoring frameworks for efficient, automated, and intelligent service-oriented architecture (SOA) governance
- Scale systems elastically through mechanisms such as automation; evolve systems reliability, efficiency, and velocity by pushing for changes
- Practice efficient customer support, incident response, and blameless postmortems
Other
- Bachelor's degree in Computer Science or a related technical field
- Industrial or internship experience in accredited internet or cloud companies
- Strong interpersonal and communication skills
- Must be legally authorized to work in the United States
- Ability to work in the office 3 days a week, or as directed by their department