The Azure High Performance Computing & Artificial Intelligence (HPC & AI) compute platform team needs to define and deliver the hardware roadmap, software, and services that enable users to run technical computing workloads in Azure, including extreme scale AI training and inference, traditional HPC modeling and simulations, remote visualization, and immersive gaming experiences. This role will drive the successful execution of complex Graphics Processing Unit (GPU) buildouts and contribute to large-scale data center infrastructure programs.
Requirements
- 4+ years of experience in product or program management with a focus on Cloud Infrastructure programs.
- 2+ years of proficiency delivering products and software features in a fast-paced online services environment.
- 2+ years of demonstrated track record in cluster buildouts, infrastructure deployment, and product execution at scale.
- 2+ year(s) of data center design/build, hardware lifecycle, vendor management, and inventory/supply chain logistics experience.
- 1+ year(s) exposure to InfiniBand, RDMA, GPU backend networking environments, Capacity delivery (GPU preferred) planning, supply and execution business.
- 3+ year(s) proficiency with infrastructure telemetry, or deployment tooling.
- Interest in automation, process optimization, and scalable frameworks.
Responsibilities
- Drive and manage GPU cluster capacity buildout partnering closely with various engineering and operations across Microsoft.
- Deliver supercomputers to our biggest customers, navigating rapid technology shifts to ensure smooth integration of cutting-edge SKUs into the production fleet.
- Partner with engineering, supply chain, data center operations, and finance—facilitating clear communication, shared understanding, and timely delivery.
- Serve as the strategic liaison with senior leaders within Microsoft and customer to Build trust by communicating roadmap clarity, surfacing blockers transparently.
- Engage directly with customer stakeholders to understand requirements, acceptance criteria, and ensure alignment on deployment expectations.
Other
- Bachelor's Degree AND 5+ years experience in product / service / project / program management or software development. + OR equivalent experience.
- PMP or similar certification is a plus.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- 3+ year(s) of ability managing multiple workstreams and adapting to changing priorities.