Databricks aims to enable data teams to solve complex problems by providing the world's best data and AI infrastructure platform, simplifying the data lifecycle from ingestion to ML/AI with a unified platform, and replacing traditional data warehouse architectures with a Lakehouse architecture to address challenges like data staleness, reliability, cost, lock-in, and limited use-case support.
Requirements
- You have implementation skills with a general purpose program language, such as Java, or C++
- You have knowledge of algorithms, data structures, and OOP principles
- A passion for database systems, storage systems, distributed systems, language design, or performance optimization
Responsibilities
- Query compilation and optimization
- Distributed query execution and scheduling
- Vectorized execution engine
- Data security
- Resource management
- Transaction coordination
- Efficient storage structures (encodings, indexes)
Other
- You will graduate in Fall 2025 or Spring 2026 with a degree in Computer Science, Engineering, or related subject
- You have experience managing end-to-end projects