Microsoft Azure is looking to expand the capacity and range of supported scenarios for AI and other GPU-based workloads on its Public Cloud platform to support the next 100X growth, ensuring performance, scalability, and infrastructure quality for customer workloads.
Requirements
- Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C-Sharp, Java, Powershell, or Python OR equivalent experience.
- Ability to work with HPC or Machine Learning for 2+ years.
- Familiarity with Deep Learning, AI Infrastructure.
- 1+ years experience on Distributed Systems, High Performance Computing / Machine Learning middleware, Co-Designing Hardware-Software, Accelerators, Profiling and Performance Analysis Tools.
- Preferred: Bachelor's Degree in Computer Science OR related technical field AND 5+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C-Sharp, Java, Powershell, OR Python.
- Preferred: Master's Degree in Computer Science or related technical field AND 3 years technical engineering experience with coding in languages including, but not limited to, C, C++, C-Sharp, Java, Powershell or Python.
- Preferred: Equivalent experience to the above.
Responsibilities
- Designing and delivering next generations of AI training, AI inferencing, virtual desktop, video, and gaming infrastructure for Azure.
- Working on defining, deploying, and sustaining hardware and software Azure infrastructure for AI and other GPU-based workloads.
- Engaging in hardware/software interaction, coding, and working with next-gen hardware.
- Performing end-to-end systems engineering across infrastructure layers including fiber networking, switches, GPU differentiation, rack design, and cluster design.
- Producing extensible and maintainable code, optimizing, debugging, refactoring, and reusing code to improve performance and maintainability.
- Driving identification of dependencies and development of design documents for products, applications, services, or platforms.
- Acting as a Designated Responsible Individual (DRI), guiding other engineers, and working on call to monitor and restore system/product/service for issues.
Other
- Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience OR equivalent experience.
- Ability to meet Microsoft, customer and/or government security screening requirements, including the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- Passion for quality, customer success, and getting things done.
- Willingness to dive deeply into any level or layer of a problem and learn emerging technologies.
- Ability to lead by example, produce maintainable code, and apply appropriate coding patterns and best practices.