Azure Service Fabric is a decentralized runtime and platform designed to address some of the most complex challenges in distributed systems, including failure detection, leader election, and replicated state machines—core components for building large-scale, reliable, and highly available solutions. It powers essential services such as the Azure Control Plane, Azure SQL Database, and Azure Cosmos Database, and operates seamlessly across both cloud and on-premises environments. As a Principal Software Engineer on the Azure Service Fabric team, you will design and deliver innovative, cloud-scale distributed systems while deepening your expertise in building software that supports fault tolerance, self-management, and scalable operations.
Requirements
- 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
- 3+years of experience in designing and developing large-scale, distributed cloud services with focus on reliability, scalability, and performance.
- 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, OR Python
- 5+years of experience in designing and developing large-scale, distributed cloud services with focus on reliability, scalability, and performance.
Responsibilities
- design and deliver innovative, cloud-scale distributed systems while deepening your expertise in building software that supports fault tolerance, self-management, and scalable operations.
- Partners with appropriate stakeholders to determine user requirements for a set of scenarios.
- Leads identification of dependencies and the development of design documents for a product, application, service, or platform.
- Leads by example and mentor others to produce extensible and maintainable code used across products.
- Leverages subject-matter expertise of cross-product features with appropriate stakeholders (e.g., project managers) to drive multiple group's project plans, release plans, and work items.
- Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.
- Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers.
Other
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- Microsoft will accept applications for the role until October 22, 2025.