NVIDIA is seeking a Senior Test Developer to join their Enterprise Software QA team to contribute to the design, construction, optimization, and testing of large-scale infrastructure for foundational NVIDIA unified cloud services and data center offerings.
Requirements
- Proven experience with AI tools for automation and test plan development directly applied to daily tasks. This expertise is crucial for enhancing performance, developing robust frameworks, and increasing test coverage.
- 4+ years of hands-on experience in cluster management and related tools, including Docker Containers, Slurm, Kubernetes, and Ansible.
- 2+ years strong experience with cloud infrastructure platforms like AWS, Azure, Google, OCI Cloud.
- Proficient in Unix/Linux and shell/python programming skills.
- Hands-on experience with network, storage, security, cluster configuration and debugging, cloud infrastructure management tools like terraform, ansible.
- Expertise in administering, operating, and configuring Kubernetes.
- Experience in CI/CD tools such as Gitlab and Jenkins and the GitOps model.
Responsibilities
- Work with development teams on test plans for all layers of SW stack for cloud infrastructure, execution, reviews, failure analysis and assessing overall quality and risk.
- Lead NVIDIA Cloud and Data Center bring up activities which will involve validation, reporting, working with engineering to debug issues, providing design input at times, adding coverage in different areas.
- Design, develop and maintain CI/CD pipelines for continuous testing in cloud environments when needed.
- Perform performance, scalability, and reliability testing of cloud services.
- Implement and maintain test environments in cloud platforms such as AWS, Azure, or Google Cloud.
- Supervise the infrastructure to alert on significant events, ensuring the highest level of system performance and reliability.
- Work with various different partner teams to ensure availability of clusters to test on and take the lead in resolve all issues.
Other
- A Master's or Ph.D. in Computer Science or a related field, or equivalent experience.
- If you are a dedicated engineer with a deep understanding of cloud infrastructure and distributed systems, and you thrive in an exciting, innovative environment, this could be the flawless role for you.
- Work with customer PMs on software issues including technical feedback from OEMs and CSPs.
- Develop key KPIs to track execution and deploy process improvements to improve efficiency
- NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.