Microsoft is looking to expand the core capabilities of the Ads serving stack that powers ads on several online services, which is a low-latency, high-scale geo-distributed system with multiple moving parts.
Requirements
- 2+ years experience in designing, implementing, and scaling large-scale, distributed online systems with a deep understanding of system architecture and proven ability to profile, analyze, and optimize performance and capacity of native C++ systems in complex, high-throughput environments.
- Proven experience in designing, implementing, and validating distributed systems for real-time online inference.
- Solid expertise in optimizing online service performance for performance-critical workloads.
- coding in languages including, but not limited to, C, C++, C, Java, JavaScript, or Python
Responsibilities
- Develops and maintains a large-scale distributed CPU/GPU ranking platform to support real-time processing for millions of requests per second.
- Implements the features with high efficiency, extensibility, diagnosability, reliability, and maintainability with few defects.
- Maintains operations of live service as issues arise on a rotational, on-call basis.
- Identifies solutions and mitigations to simple and complex issues and escalates as necessary.
- Acts as a Designated Responsible Individual (DRI) working on call to monitor system/product feature/service for degradation, downtime, or interruptions.
- Responds within Service Level Agreement (SLA) timeframe.
- Escalates issues to appropriate owners.
Other
- Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50-mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Microsoft will accept applications for the role until October 1, 2025.
- Microsoft is an equal opportunity employer.