ML developers today face significant friction in taking trained models into deployment. They work in a highly fragmented space, with incomplete and patchwork solutions that require significant performance tuning and non-generalizable/ model-specific enhancements. At Modular, we are building the next generation AI platform that will radically improve the way developers build and deploy AI models.
Requirements
- Experience with HPC programming and accelerator languages such as CUDA, OpenCL, SYCL, etc.
- In-depth knowledge of low-level (micro)architectural performance is required.
- 4+ years of experience working on complex code and systems.
- Experience with performance modeling and performance data analysis.
- Understanding of Parallelization techniques for ML / HPC Acceleration.
- Experience with layout optimization found in libraries such as CUTLASS and CUTE.
- Experience with performance profilers, performance data analysis tools, visualization tools, and debugging or experience working with embedded systems
Responsibilities
- Design and optimize high-performance ML numeric and data manipulation kernels/operators.
- Utilize low-level C/C++/Assembly programming to achieve state of the art performance.
- Your work will also entail potentially introducing new novel compiler and tools support.
- Work with compiler, framework, runtime and performance teams to deliver end-to-end performance that fully utilizes today’s complex server and mobile systems.
- Collaborate with architects and hardware engineers to co-design future accelerators, including ISA for new hardware features and evolving ISA.
- Collaborate with machine learning researchers to guide system development for future ML trends.
- crafting high-performance kernels for CPUs, GPUs, and emerging hardware architectures.
Other
- Candidates based in the US or Canada are welcome to apply.
- You can work in our office in Los Altos, CA or remotely from home.
- We offer relocation assistance for US-based candidates who will be relocating to work out of our Los Altos, CA office.
- Onboarding for new hires is conducted in-person in our Los Altos, CA office.
- Traveling 2-4 times a year is expected for all roles.