Microsoft is looking to solve the problem of transforming the Windows experience through state-of-the-art language and multimodal models, and is seeking a Principal Researcher to design and lead the implementation of these models.
Requirements
- Experience conducting research as part of a research program in academic or industry settings.
- 3+ years' experience training and/or finetuning large language models, model compression, distillation to small language models.
- 3+ years' experience with Python and deep learning libraries
- Experience in large language/multimodal model training, finetuning and distillation.
- Experience using dataset curation, data generation using prompting state of art LLMs and/or model alignment.
- Experience with reinforcement learning libraries or formal methods and verification is a plus.
- Experience in creating reusable software libraries and packages.
Responsibilities
- Research focusing on foundation models - Designing and training language and multimodal models (and finetunes) with supervised finetuning, distillation, reinforcement learning, low-rank adaptation.
- Research focusing on environment interaction, policy specification and verification and human intent alignment.
- Implementation: Train, distill, and finetune language and vision models in PyTorch, DeepSpeed, AzureML stack.
- Machine learning engineering - Build pipelines to test designs, algorithms and models.
- Data science - Research and develop synthetic data generation strategies.
- Proactively follow state of the art research and share latest work, write papers, attend conferences and share knowledge in the wider team.
Other
- Doctorate in relevant field AND 3+ years related research experience
- OR equivalent experience.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Demonstrated ability and passion for incubating new ideas, solving problems, and building working systems.