Copilot Tuning is a new product that aims to fine-tune LLMs (large language model) on tenant data, enabling task-specific agents and solutions. We are a small, nimble team that is advancing the state of the art of models in M365 Copilot. Come join our team and help transform the LLM experience in the enterprise.
Requirements
- 1+ years of experience training/fine tuning AI/ML models, preferably large language models.
- 1+ years of experience building Generative AI pipelines, e.g. with RAG (Retrieval augmented generation).
- 1+ years of experience with Python and/or PyTorch.
- Deep knowledge of transformer models and architectures.
- Experience with reinforcement learning algorithms and applications.
Responsibilities
- Train and deploy Language Models adapted to specific industry needs.
- Create and adapt novel training and fine-tuning algorithms for language models with special focus on reinforcement learning and alignment.
- Identifies, researches, and implements machine learning solutions given product requirements.
- Brings a research project to successful completion yielding new algorithms, prototypes, theories, tools, methods, analyses, insights, or collections of data which solve one or more open research problems.
- Document and share best practices across the organization.
- Supports mentorship by assisting with onboarding of research interns or other early in career team members.
Other
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Experience working with customers deploying AI solutions.