Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Modular Logo

Kernel Engineer

Modular

$167,000 - $242,000
Sep 12, 2025
Mountain View, CA, USA •
Apply Now

ML developers today face significant friction in taking trained models into deployment. They work in a highly fragmented space, with incomplete and patchwork solutions that require significant performance tuning and non-generalizable/ model-specific enhancements. At Modular, we are building the next generation AI platform that will radically improve the way developers build and deploy AI models.

Requirements

  • Experience with HPC programming and accelerator languages such as CUDA, OpenCL, SYCL, etc.
  • In-depth knowledge of low-level (micro)architectural performance is required.
  • 4+ years of experience working on complex code and systems.
  • Experience with performance modeling and performance data analysis.
  • Understanding of Parallelization techniques for ML / HPC Acceleration.
  • Experience with layout optimization found in libraries such as CUTLASS and CUTE.
  • Experience with performance profilers, performance data analysis tools, visualization tools, and debugging or experience working with embedded systems

Responsibilities

  • Design and optimize high-performance ML numeric and data manipulation kernels/operators.
  • Utilize low-level C/C++/Assembly programming to achieve state of the art performance.
  • Your work will also entail potentially introducing new novel compiler and tools support.
  • Work with compiler, framework, runtime and performance teams to deliver end-to-end performance that fully utilizes today’s complex server and mobile systems.
  • Collaborate with architects and hardware engineers to co-design future accelerators, including ISA for new hardware features and evolving ISA.
  • Collaborate with machine learning researchers to guide system development for future ML trends.
  • crafting high-performance kernels for CPUs, GPUs, and emerging hardware architectures.

Other

  • Candidates based in the US or Canada are welcome to apply.
  • You can work in our office in Los Altos, CA or remotely from home.
  • We offer relocation assistance for US-based candidates who will be relocating to work out of our Los Altos, CA office.
  • Onboarding for new hires is conducted in-person in our Los Altos, CA office.
  • Traveling 2-4 times a year is expected for all roles.