Microsoft’s Cloud Operations and Innovation (CO+I) team builds and operates Microsoft datacenters, which in turn power Microsoft’s cloud business. We anticipate and provide capacity for continuous scale. The CO+I Engineering team (CO+IE) delivers services, applications, and automation supporting datacenter planning, construction, and operation.
Requirements
- 4+ years of experience in architectural leadership, driving reliability, telemetry, and operational workflows through safe deployment practices.
- 4+ years of experience working through the full product cycle from initial design to rapid production deployment.
- Proven track record of architecting and delivering distributed cloud services (Azure preferred).
- Experience creating and shipping V1 products using modern development practices.
- Experience using agile methodologies and/or test-driven development (TDD).
- Analytical, problem-solving, testing and debugging skills
Responsibilities
- Own complex, critical services and components end-to-end**.
- Lead design and implementation of scalable, secure, and reliable systems, make architecture decisions, define service boundaries.
- Author design documents, present tradeoffs and decisions to leadership, and provide status/health signals with clarity and accountability
- Lead technical strategy and roadmaps. Translate business objectives into engineering plans, set technical direction for a product area, decompose ambiguous problems into executable work, and drive multi‑release roadmaps with measurable outcomes.
- Establish and champion best practices for code quality, testing, observability, performance, availability, and operational excellence (including (Designated Responsible Individual) DRI/on‑call ownership, SLOs/SLIs, incident response, and post‑mortems)
- Design for resiliency and cost efficiency, leverage Azure and Microsoft services where appropriate; instrument services with telemetry to drive data‑informed iteration
- Write high quality, maintainable, reusable code following SOLID principles.
Other
- This role is located either in one or all hub locations - Atlanta, GA, Washington, D.C., Redmond, WA, San Antonio, TX or Phoenix, AZ.
- Relocation support will be provided, and successful candidates must relocate or reside within 50 miles of the hub office location.
- This role is eligible for hybrid or remote work, up to 50%.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.