Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

AI Accelerator Software Engineer-Runtime Library

Ampere Computing

$169,500 - $297,500

Oct 8, 2025

Portland, OR, US

Ampere is looking to solve the problem of advancing AI capabilities and paving the way for high-performance and efficient computing solutions that will meet future AI demands by developing and optimizing cutting-edge AI frameworks.

Requirements

Previous work experience developing user mode driver or runtime library for any GPUs or deep learning accelerator in Linux environment.
This position requires strong expertise in programming languages such as Python, C/C++ with a strong background in performance tuning.
Previous software development with a focus on AI frameworks – PyTorch, llama.cpp, ONNX, etc is a big plus.
Solid understanding of AI and machine learning concepts, including neural networks and data processing frameworks is also preferred.
Experience with high-performance computing systems and cloud-based architectures.

Responsibilities

In this role, you will build a runtime library accelerator that will enable multiple frameworks and serving platforms for the Ampere deep learning accelerator.
Go deep into to the entire SW/HW stack to accelerate the deep learning including but not limited to inference serving, framework integration, compiler, runtime library, communication and compute kernel development, and performance tuning.
In this role, you will work on deep learning model enabling with performance and accuracy for popular frameworks like PyTorch and Llama.cpp and for serving platforms like vLLM and SGLang, positioning you at the forefront of AI innovation.
HW/SW codesign to optimize existing AI architectures to enhance computational efficiency, increase throughput, reduce latency, and improve the scalability, pushing the boundaries of what's possible in AI technology.
Be a key team member in building state-of-the-art software and hardware AI co-processors/accelerators, contribute to a collaborative and dynamic work environment, supporting continuous improvement and excellence.
Collaborate with cross-functional teams to integrate AI solutions into Ampere's cloud-native processor platforms and accelerators.

Other

BS Computer Science, Mathematics or a related technical field & 12 years of related experience; or MS degree & 8 years; or PhD & 5 years
Unlimited Flextime and 10+ paid holidays
Ampere is an inclusive and equal opportunity employer and welcomes applicants from all backgrounds.