Waymo is looking to solve the problem of improving access to mobility and saving lives lost to traffic crashes by developing and deploying its autonomous driving technology, the Waymo Driver. This internship role specifically aims to contribute to the underlying computational and hardware optimization aspects of this technology.
Requirements
- GPU, Pallas or other accelerator programming
- Computer architecture fundamentals
- Professional competency in C++ or Python
- Experience with compilers for CPU or GPU or ML accelerators
- Experience with NVIDIA CUTLASS, CuTile, Pallas, or Triton-like Domain Specific Languages
- Experience with MLIR, JAX, ONNX, StableHLO or similar compiler intermediate representations
Responsibilities
- Understand difficult-to-visualize manipulations and computations on 4-5 dimensional tensors, devise ways to express them cleanly and implement them on domain-specific architectures
- Deep dive into the capabilities of unfamiliar hardware to map operations to it
- Quickly create minimally featured software prototypes with light automated testing to explore a design space
Other
- This will be a hybrid onsite internship position.
- We will accept resumes on a rolling basis until the role is filled.
- To be in consideration for multiple roles, you will need to apply to each one individually - please apply to the top 3 roles you are interested in.