Ensure the data, services and infrastructures in one of the largest data platforms in the world that directly supports the TikTok app are reliable, fault-tolerant, efficiently scalable and cost-effective.
Requirements
- Experience writing code in Java, Go, Python or a similar language
- Experience with algorithms ,data structures, complexity analysis and software design
- Hands-on experience managing Clickhouse and/or Apache Doris clusters in production environments
- Knowledge of columnar storage, data compression, and query execution engines
- Experience managing distributed OLAP databases with sharding, replication, and high availability setups
- Hands-on experience with observability tools such as Prometheus & Grafana.
- Experience with schema evolution, data compaction strategies, and partition pruning to optimize storage and query performance.
Responsibilities
- Engage in and improve the whole lifecycle of service, from inception and design, through to deployment, operation and refinement
- Ensure reliable, fault-tolerant, efficiently scalable and cost-effective data, services and infrastructure
- Design and maintain high-availability architecture for Clickhouse & Doris, including replication, sharding, and failover mechanisms.
- Monitor and optimize Clickhouse & Doris clusters for low-latency, high throughput query performance.
- Troubleshoot slow queries, resource contention, and performance bottlenecks in distributed storage engines.
- Plan and execute cluster scaling strategies to handle data growth.
- Design and implement reliable, scalable, robust and extensible big data systems that support core products and business.
Other
- organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department.
- Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of drive
- This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.