Bytedance's data infrastructure Site Reliability Engineering (SRE) team is looking for interns to help design, build, and manage large-scale, highly distributed systems, contributing to the next chapter of data infrastructure and shaping the future of technology.
Requirements
- Experience programming in one of the following Languages: C, C++, Java, Python, Go, and Rust
- Knowledge of Unix/Linux system internals, networking, and distributed systems
- Experience in MySQL, Redis, Ngnix, Kubernetes, Docker, OpenStack, Hadoop, Spark, Flink, etc.
Responsibilities
- Participate in and enhance the complete service lifecycle, from inception and design, through development, capacity planning, launch reviews, deployment, operation, and refinement.
- Design and implement software platforms and monitoring frameworks to govern service-oriented architecture (SOA) efficiently, automatically, and intelligently.
- Develop and manage components of cloud-managed data infrastructure, encompassing technologies such as Kubernetes, Redis, MySQL, Flink, and more.
- Establish sustainable mechanisms for scaling systems, such as automation, to drive enhancements in reliability, efficiency, and velocity.
- Provide sustainable user support, manage incident responses, and conduct blameless postmortems as part of our ongoing efforts to improve our systems.
Other
- Must be able to commit to a 12-week full-time work period during Summer or Fall 2026.
- Strong skills in fast learning and communication