Microsoft is looking to solve business problems by developing next-generation AI systems at scale, specifically designing and deploying advanced AI models and agentic systems for real-world applications, to empower every person and every organization on the planet to achieve more.
Requirements
- 3+ years' experience developing and deploying large language models (LLMs), including agentic systems, supervised fine-tuning, and Reinforcement Learning (RLHF)
- 3+ years' experience designing, implementing, and optimizing Retrieval-Augmented Generation (RAG) pipelines and advanced context engineering.
- Hands-on experience with modern LLM evaluation techniques, including LLM-as-a-Judge, agentic evaluations, and RAG assessments.
- Deep understanding of fundamental ML algorithms (supervised, unsupervised) and modern neural network architectures.
- Experience with MLOps practices, including model versioning, automated testing, monitoring, and CI/CD for machine learning.
- A record of publication in top-tier scientific venues (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, KDD).
- Experience with Large Language Models, Natural Language Processing, Information Retrieval, and Machine Learning.
Responsibilities
- Lead the design and development of advanced AI models and agentic systems for real-world applications.
- Own and drive end-to-end model training, including data pipeline design, distributed training optimization, and performance evaluation.
- Stay up to date with the latest advancements in LLM, NLP, deep learning, search and AI research.
- Research and develop an understanding of the state-of-the-art tools, technologies, and methods being used in the research community and product groups.
- Collaborate closely with engineering, product, and research teams to productionize models, build scalable, robust pipelines and provide support for in production AI Models/Agents
- Lead end-to-end lifecycle of machine learning models, from prototyping and implementation to evaluation, deployment, and monitoring.
- Conduct applied science experiments, create and validate metrics, develop ML pipeline and modeling algorithm in the area of Large Language Models, Natural Language Processing, Information Retrieval, and Machine Learning.
Other
- Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 8+ years related experience
- Ability to meet Microsoft, customer and/or government security screening requirements
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- 5+ year(s) experience creating publications (e.g., patents, peer-reviewed academic papers).
- A track record of delivering successful, large-scale applied ML projects in an industry setting.