Microsoft's CoreAI is building the AI data-plane that powers all LLM inferencing workloads across Microsoft and Azure customers, aiming to serve models at scale reliably, efficiently, and with ultra-low latency.
Requirements
- 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C, Java
- 4+ years of design and problem-solving experience, with a deep understanding of system performance, scalability, and engineering best practices
- Understanding of distributed systems specifically in request serving at scale, including high-performance storage, distributed databases, and networking across global-scale infrastructures
- Experience shipping with a high velocity and iterative approach
- Demonstrated experience in building high-quality, reliable systems at scale
Responsibilities
- Drive innovation and deliver impact on a critical Azure service that underpins Microsoft’s AI vision.
- Design, implement and deliver AI services to support product offerings for large-scale LLM serving.
- Innovate on technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products.
- Ship new product features and improvements at a high velocity where ideas to production is a matter of week(s).
- Innovate on product shape and offers to define how developers leverage cutting edge AI technologies and build world class applications.
- Engage with customers to gather feedback and resolve complex issues.
Other
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Proven ability to lead complex technical initiatives that span multiple teams and disciplines
- Customer-obsessed approach to problem solving, with empathy and a drive to deliver impactful solutions
- Embody our culture and values