The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads while minimizing energy consumption and maximizing performance.
Requirements
- Research experience with hardware architectures such as CPUs, GPUs, TPUs, and NPUs.
- Published papers in Microarchitecture and Systems conferences preferred.
- Experiences in developing simulators for high-performance computing systems
- Understanding of system level characteristics of LLM, DLRM and CNN and other ML workloads.
- Familiarity with PyTorch, Tensorflow, or JAX.
Responsibilities
- Research novel techniques in collective communications for high-performance deep learning applications.
- Research design space exploration methodology and pruning for efficient architecture exploration.
- Design algorithms and microarchitectural features to optimize data locality to minimize energy consumption.
- Work closely with the compiler team to integrate new ML techniques and algorithms into the compiler.
- Contribute to ML compiler and architecture research and write research papers.
- Stay up-to-date with the latest trends and advancements in the field of ML architectures.
- Complete other responsibilities as assigned.
Other
- Pursuing Masters, or PhD in Computer Science or Electrical Engineering preferred.
- Strong analytical and problem-solving skills
- Excellent communication and interpersonal skills
- Ability to work independently and as part of a team
- Daily onsite presence at our San Jose, CA headquarters in alignment with our Flexible Work policy