Microsoft's Azure HPC & AI compute platform team needs to define and deliver the hardware roadmap, software, and services that enable users to run extreme scale AI training and inference, traditional HPC modeling and simulations, remote visualization, and immersive gaming experiences. They are seeking an Infrastructure Senior Product Manager to drive the successful execution of complex GPU buildouts and contribute to large-scale data center infrastructure programs.
Requirements
- 2+ years of demonstrated track record in cluster buildouts, infrastructure deployment, and product execution at scale.
- 2+ year(s) of data center design/build, hardware lifecycle, vendor management, and inventory/supply chain logistics experience.
- 1+ year(s) exposure to InfiniBand, RDMA, GPU backend networking environments, Capacity delivery (GPU preferred) planning, supply and execution business.
- 3+ year(s) proficiency with infrastructure telemetry, or deployment tooling.
- Interest in automation, process optimization, and scalable frameworks.
- 3+ year(s) of ability managing multiple workstreams and adapting to changing priorities.
Responsibilities
- Drive and manage GPU cluster capacity buildout partnering closely with various engineering and operations across Microsoft.
- Partner with engineering, supply chain, data center operations, and finance—facilitating clear communication, shared understanding, and timely delivery.
- Deliver supercomputers to our biggest customers, navigating rapid technology shifts to ensure smooth integration of cutting-edge SKUs into the production fleet.
Other
- Bachelor's Degree AND 5+ years experience in product / service / project / program management or software development. + OR equivalent experience.
- PMP or similar certification is a plus.
- 4+ years of experience in product or program management with a focus on Cloud Infrastructure programs.
- 2+ years of proficiency delivering products and software features in a fast-paced online services environment.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.