Databricks aims to leverage GenAI & data technologies to enable customers to perform complex business tasks with minimal effort by building new products from the ground up in the NYC Engineering office.
Requirements
- 5+ years of production level experience in one of: Java, Scala, C++, or similar language.
- Experience building data processing or analytics systems.
- Experience developing large-scale distributed systems.
- Experience working on a SaaS platform or with Service-Oriented Architectures.
- Experience with cloud technologies, e.g. AWS, Azure, GCP, Docker, or Kubernetes.
- Experience with security and systems that handle sensitive data.
- Good knowledge of SQL.
Responsibilities
- Build GenAI evals and post-training infrastructure to ensure our agents are learning while they’re doing work for our customers
- Implement data infrastructure that processes massive amounts of data with arbitrary flexibility in interactive time (ms to seconds)
- Creating a single, unified infrastructure to support real-time to micro-batch to batch data processing
- Create data processing engines for novel types of analytical workloads on massive datasets, such as time-series analysis of events, set operations (segmentation), and visualization of complex flows
Other
- A mindset of high velocity, able to ship high quality code at high speed.
- It’s a plus if you’ve worked on systems that process or analyze customer data and related use cases