The Azure Storage team is looking to build, manage, and operate the persistent cloud storage for Microsoft Azure, and is seeking a Principal Software Engineer to provide technical leadership in building next generation storage solutions, including support for new workloads like AI training and inferencing.
Requirements
- 8+ years of experience in software engineering practices, coding, and solid foundation in data structures and algorithms
- 8+ years of experience in Storage or File-Systems, Distributed Systems, and/or Applications at scale.
- 1+ year(s) of experience with storage systems in areas such as erasure coding, high performance, and durability schemes
- Experience with building high-performance storage servers using Data Processing Units (DPUs)
- Coding experience in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
- Solid foundation in data structures and algorithms
- Experience with distributed systems and applications at scale
Responsibilities
- Lead identification of areas of innovation to excel at support of AI workloads for our leading customers.
- Identify critical efficiency, data availability and durability improvements using advanced Erasure Coding algorithms.
- Determine necessary performance improvement plans and their priorities and document work needed, including any dependencies.
- Hands-on leadership of the engineering work needed to drive performance and efficiency improvements and to drive innovative new features.
- Develop and document new ideas and prototype them in partnership with others as needed.
- Hold accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor systems/products/services for degradation, downtime, or interruptions.
- Proactively seek new knowledge and adapt to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.
Other
- Bachelor's Degree in Computer Science or related technical field
- Ability to meet Microsoft, customer and/or government security screening requirements
- Ability to work on-call to monitor systems/products/services for degradation, downtime, or interruptions
- Travel up to 25%
- 0 days/week in-office - remote