Microsoft is seeking to reinvent productivity with AI, specifically through the development of AI-powered calendar experiences that help users organize and manage time more effectively.
Requirements
- Experience developing and deploying large language models (LLMs), including agentic systems, supervised fine-tuning, and Reinforcement Learning (RLHF).
- Experience designing, implementing, and optimizing Retrieval-Augmented Generation (RAG) pipelines and advanced context engineering.
- Experience with modern LLM evaluation techniques, including LLM-as-a-Judge, agentic evaluations, and RAG assessments.
- Experience with MLOps practices, including model versioning, automated testing, monitoring, and CI/CD for machine learning.
- Experience with a top-tier scientific venues (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, KDD).
- Ability to translate complex ML concepts into business value and communicate technical insights to non-technical stakeholders.
- Experience with NLP and large language models
Responsibilities
- Keep abreast of the latest breakthroughs in generative AI and large-language models and translate them into practical, high-impact calendar Copilot solutions that leverage M365 graph to deliver personalized, context-aware solutions for time management.
- Define and iterate on relevance metrics that measure how Copilot features truly serve user intent in M365 Calendar.
- Determine where fine-tuned LLMs (large language models), small language models, or other specialized approaches are required—and own their design, training, and deployment.
- Build high-fidelity synthetic and manufactured datasets, along with rigorous evaluation sets and benchmarks that mirror the workflows of enterprise information-workers.
- Drive the applied-science strategy for industry-leading calendar agents, partnering closely with engineering and product teams to ship at scale.
- Innovation with LLMs: Stay at the cutting edge of NLP (natural language processing) and large language models.
- Lead AI Feature Development: Lead the design and development of advanced ML/NLP models to power Copilot features in Outlook Calendar and Microsoft Teams.
Other
- Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Ability to work in a team environment and collaborate with cross-functional teams
- Strong communication and leadership skills
- Ability to translate complex technical concepts into business value