Azure Databricks is looking to improve service resiliency, debug complex distributed systems, and deliver a world-class experience to customers by simplifying and democratizing data and artificial intelligence.
Requirements
4+ years hands-on experience managing live-site operations, leading incident response efforts for distributed systems, particularly within the Azure ecosystem, driving root cause analysis through detailed post-mortems to improve system reliability and performance tuning.
4+ years of Azure development experience.
5+ years of experience debugging complex systems across the Azure ecosystem, including Azure Resource Manager (ARM), Compute Resource Provider (CRP), Network Resource Provider (NRP), Storage Resource Provider (SRP), MySQL, and Azure Kubernetes Service (AKS), etc.
Responsibilities
Design and implement features and tools that enhance the resiliency, scalability, and reliability of Azure Databricks services.
Debug and solve complex issues across distributed systems—often without a playbook.
Investigate customer-reported problems and proactively identify systemic patterns to eliminate root causes.
Partner closely with teams across Azure to ensure that customer experience consistently exceeds expectations.
Build monitoring, automation, and self-healing capabilities to reduce operational overhead and human intervention.
Collaborate with Databricks Microsoft engineers and internal teams to strengthen our Azure-native integrations.
Other
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year.
There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.
Microsoft will accept applications for the role until September 19, 2025