AMD is looking to develop a high-performance GPU kernel generator to deliver machine learning tensor operations with support for layer fusion as part of the AMD Radeon Open Ecosystem (ROCm).
Requirements
- Hands on professional experience writing and optimizing GPU code
- Strong background developing applications and libraries in modern C++
- Experience with best practices in software development, including testing, profiling, debugging, documentation, version control, and issue tracking
- Experience with low-level optimization techniques; assembly programming and/or vectorization
Responsibilities
- Contribute to the design and implementation of AMD’S GPU kernel generator used in a variety of high-performance machine learning libraries and frameworks
- Develop modern build and testing systems
- Communicate and collaborate with key technical experts across AMD and with our partners and customers to improve ROCm applications, libraries, and tools, as well as hardware
- Deliver high-quality code and documentation following best practices for open-source software development
- Apply software engineering methodologies to ensure robust, maintainable code
- Develop technical relationships with peers and partners
Other
- Collaborating with teams involved with all aspects of machine learning and math libraries.
- A great candidate will be a strong collaborator who is open to continuous learning and is excited about innovative solutions.
- Bachelor’s or Master’s degree in Applied Mathematics, Computer Science, Computer Engineering, Electrical Engineering, or equivalent
- Austin, Texas
- LI-Hybrid