At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.
Requirements
- Strong proficiency in C++ and object-oriented programming
- Experience with performance analysis tools or profiling frameworks
- Familiarity with GPU programming models (HIP, OpenCL, or CUDA)
- Familiarity with ROCm ecosystem and tools is highly desirable
- Understanding of GPU architecture and system-level performance concepts
- Experience with multithreading and concurrency in modern C++
- Knowledge of Linux development environments; Windows experience is a plus
Responsibilities
- Design, develop, and maintain ROCprofiler-SDK components for GPU performance profiling
- Collaborate with architecture, driver, and runtime teams to enable profiling for next-generation AMD GPUs
- Implement new features and APIs to enhance profiling capabilities for AI and HPC workloads
- Optimize profiling tools for accuracy, scalability, and minimal overhead
- Debug and resolve issues in profiling workflows and improve tool reliability
- Participate in hardware bring-up and ensure profiling support for new ASICs
- Stay current with GPU architecture advancements and integrate them into ROCm profiling tools
Other
- Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent
- Strong problem-solving skills and ability to work independently and in a team
- Ability to communicate effectively across teams
- Detail-oriented, proactive in solving complex technical challenges
- Ability to work in a collaborative environment