The OneDrive Photos and Mobile (PhoMo) team is looking to improve the intelligence behind photo search and memories in OneDrive, building experiences that integrate personal nostalgia and cutting-edge AI. The goal is to make billions of photos searchable, meaningful, and delightful to revisit, touching millions of people daily across every device.
Requirements
- Coding in languages including, but not limited to, C, C++, C, Java, JavaScript, or Python
- Demonstrated ability to quickly learn new systems and adapt new methodologies to evolving service requirements.
- Proficiency in software engineering best practices including debugging, algorithm design, and complex problem solving.
- Experience designing and delivering large-scale, highly available, and reliable services, preferably with end-to-end ownership.
- 3+ years of experience designing and developing services on cloud platforms (Azure preferred) and working with distributed system architectures.
- 2+ years of experience building scalable infrastructure for AI products, including secure data pipelines, model deployment systems, and cloud environments, preferably having experience with AzureML, ONNX Runtime, PyTorch, TensorFlow, vector search, or equivalent technologies.
- Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.
Responsibilities
- Architect and evolve the large-scale services that index, understand, and curate photos for our customers.
- Integrate state-of-the-art AI into production systems.
- Partner closely with mobile and client teams to deliver end-to-end experiences.
- Drive technical decisions that span cloud service design, scalability, and performance.
- Designs and implements distributed, cloud-based services on Azure that process, store, and serve petabyte-scale photo and video content with high reliability, security, and performance.
- Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
- Acts as a Designated Responsible Individual (DRI) for critical services, driving root cause analysis, incident mitigation, and continuous improvements in reliability and scalability.
Other
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Experience collaborating in team-based environments and contributing to the growth of early-career engineers through structured mentorship and support.
- Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items.
- Microsoft is an equal opportunity employer.