Azure Specialized is collaboratively working to bring the next generation of workloads to our Public Cloud platform, enabling end-to-end new scenarios for Azure customers by imagining and building differentiating customer features and fundamental building blocks at the heart of the Azure platform.
Requirements
- coding in languages including, but not limited to, C, C++, C-Sharp, OR Java, JavaScript, or Python
- 4+ years of experience in building cloud services and supporting them during production process.
- 2+ years of Deep Learning, AI Infrastructure, Distributed Systems, High Performance Computing / Machine Learning middleware OR Co-Designing Hardware-Software.
- coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, OR Python
- Deep Learning, AI Infrastructure, Distributed Systems, High Performance Computing / Machine Learning middleware OR Co-Designing Hardware-Software.
Responsibilities
- Designing and delivering the next generations of AI training, AI inferencing, virtual desktop, video and gaming infrastructure for Azure.
- Defining and deliver an end-to-end vertical view, with continuous focus on customer value, quality, performance and automation.
- Defining, deploying and sustaining hardware and software Azure infrastructure for AI and other GPU-based workloads.
- Focuses on hardware/software interaction, coding and playing with next-gen hardware, end-to-end systems engineering anywhere in the infrastructure - from fiber networking, switches, GPU differentiation, rack design, cluster design and more.
- Helps ensure Azure platform is consistent on performance, can scale on-demand, and engineered to withstand the unparalleled computing demand from the customer workloads.
- Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
- Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system / product / service for simple and complex problems when appropriate.
Other
- 2+ years of experience with leading customer engagement and support during the different deployment stages.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- 4+ years of experience in customer engagement roles, developing tools and services for dealing with customer related issues.
- Maintains communication with customers, and key partners across the Microsoft ecosystem of engineers.
- Ensures alignment with partners' expectations.
- Considers partner teams across organizations and their end goals for products to drive and achieve desirable user experiences and fitting dynamic needs of partners/customers through product development.