Apple's Cassandra Storage team develops storage systems that are correct, reliable, scalable, and fast. This work requires an innovative spirit and an extraordinary degree of care and rigor in engineering. Team members contribute to all major components of Apache Cassandra, including query coordination and execution, replication and persistence, transactions and consensus, compaction, client and internode messaging, and all other aspects of the database. As a member of this team, you will build and evolve major components of the database.
Requirements
- Fundamentals of system-level hardware and networking components (storage devices and controllers, network interfaces, CPU and memory layout in server-class systems).
- Operating systems concepts (process scheduling, disk and network I/O, performance).
- Datacenter architecture (networking topologies, host placement strategies, and failure modes); design of multi-datacenter systems; failure domains; and wide-area networking.
- Understanding of distributed systems concepts (fallacies of distributed computing, CAP, FLP, etc).
- Understanding of database concepts (consistency models, isolation levels, crash and recovery semantics).
- Advanced concepts such as failure detection, smart clients, load balancing, request pipelining, speculation / retry policies, and operational semantics of high-throughput distributed systems.
- Performance engineering (design concepts, profile-guided optimization).
- Software validation concepts (fault injection, property-based testing and model checking, workload replay, quality metrics).
- Interest and foundations for expertise in developing distributed systems including concepts such as traffic and load balancing; quota and rate limiting; multi-tenant isolation; and security engineering.
- Understanding of the fundamentals of database systems, storage engines, or performance engineering.
- Understanding of key data structures and algorithms in storage and indexing.
- Proficient in modern Java.
Responsibilities
- build and evolve major components of the database
- Traffic and load balancing
- Security and authorization
- Quota and rate limiting
- Tenant isolation
- contribute to all major components of Apache Cassandra, including query coordination and execution, replication and persistence, transactions and consensus, compaction, client and internode messaging, and all other aspects of the database
- develops storage systems that are correct, reliable, scalable, and fast
Other
- excellent communication
- ability to partner with our Site Reliability peers
- a high degree of customer focus when engaging with internal platform customers
- Ability to work effectively with colleagues based in other locations is also essential; experience in this area is a plus
- MS or Ph.D in Computer Science-related fields or 3+ years of equivalent work experience