Red Hat is seeking to enhance the performance and scalability of its OpenShift platform, particularly for emerging AI and Data/Analytics workloads, to ensure it meets the demands of modern applications and supports Red Hat's AI strategy.
Requirements
- Working knowledge of Kubernetes or OpenShift.
- Strong programming, debugging, and profiling skills in Python and/or Golang.
- Hands-on experience with performance measurement, analysis, and optimization.
- Experience with distributed systems.
- Very strong Linux system administration and system engineering skills.
- Solid scripting skills, particularly with Bash, Python, or Ansible.
- Experience with container technologies like Podman or Docker, and familiarity with building container images.
Responsibilities
- Work closely with management, product owners, developers, and quality engineers to understand product requirements and build suitable test plans to verify the performance and scale of OpenShift features and solutions for running AI workloads, such as Kubernetes Dynamic Resource Allocation (DRA), autoscaling, and operators for detection, configuration, and management of AI accelerators.
- Develop sophisticated tests that simulate user workloads through comprehensive end-to-end automation, leveraging custom-built and state-of-the-art open-source tools and frameworks.
- Deep dive into performance issues with the intent of discovering their root causes in complex distributed systems.
- Design and develop monitoring and reporting tools for performance and scale tests and analysis.
- Document your research and results clearly and concisely, and communicate findings both internally and externally.
- Engage in upstream communities to help test performance and scale early and influence design and development decisions.
- Triage, debug, and root cause customer issues related to OpenShift performance and scale.
Other
- Demonstrable experience, understanding, and passion for performance engineering.
- Excellent communication and interpersonal skills.
- Ability to work independently and proactively seek collaboration.
- Master’s Degree in Computer Science or a related field with 3+ years of relevant experience, or a Bachelor’s Degree in Computer Science or a related field with 5+ years of relevant experience.
- Experience with collaborative software development methodologies, tools, and version control.