Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Ampere Logo

AI Accelerator Software Engineer-Runtime Library

Ampere

$169,500 - $297,500
Oct 30, 2025
Santa Clara, CA, United States of America
Apply Now

Ampere is looking to solve the business and technical problem of advancing AI capabilities by developing and optimizing cutting-edge AI frameworks and accelerators for high-performance, energy-efficient, and sustainable cloud computing.

Requirements

  • Previous work experience developing user mode driver or runtime library for any GPUs or deep learning accelerator in Linux environment.
  • strong expertise in programming languages such as Python, C/C++ with a strong background in performance tuning.
  • Previous software development with a focus on AI frameworks – PyTorch, llama.cpp, ONNX, etc is a big plus.
  • Solid understanding of AI and machine learning concepts, including neural networks and data processing frameworks is also preferred.
  • Experience with high-performance computing systems and cloud-based architectures.

Responsibilities

  • build a runtime library accelerator that will enable multiple frameworks and serving platforms for the Ampere deep learning accelerator.
  • Go deep into to the entire SW/HW stack to accelerate the deep learning including but not limited to inference serving, framework integration, compiler, runtime library, communication and compute kernel development, and performance tuning.
  • work on deep learning model enabling with performance and accuracy for popular frameworks like PyTorch and Llama.cpp and for serving platforms like vLLM and SGLang, positioning you at the forefront of AI innovation.
  • HW/SW codesign to optimize existing AI architectures to enhance computational efficiency, increase throughput, reduce latency, and improve the scalability, pushing the boundaries of what's possible in AI technology.
  • Be a key team member in building state-of-the-art software and hardware AI co-processors/accelerators, contribute to a collaborative and dynamic work environment, supporting continuous improvement and excellence.
  • Collaborate with cross-functional teams to integrate AI solutions into Ampere's cloud-native processor platforms and accelerators.

Other

  • BS Computer Science, Mathematics or a related technical field & 12 years of related experience; or MS degree & 8 years; or PhD & 5 years
  • Unlimited Flextime and 10+ paid holidays so that you can embrace a healthy work-life balance.
  • Ampere is an inclusive and equal opportunity employer and welcomes applicants from all backgrounds.