Microsoft's CoreAI Post-Training team is looking to advance post-training methods for both OpenAI and open-source models, and develop advanced AI technologies that integrate language and multi-modality for a range of Microsoft products.
Requirements
- 5+ years of coding experience in Python and experience with ML frameworks such as PyTorch and Triton
- 3+ years of experience in data curation and synthesis, creating and refining datasets to optimize training outcomes
- 3+ years of proven ability to design and scale training infrastructure and pipelines in production environments
- 3+ years of large-scale model training - especially with LLMs, SLMs, multimodal, or code-specific models
- Prior research publication record with over 3000 citations
- Extensive experience with foundation models, including large-scale training, model inference, reinforcement learning, reasoning models, vision-language integration, and audio-visual modeling
- Hands-on experience with large-scale distributed training or serving, and systems of thinking.
Responsibilities
- Perform large-scale model training - Especially with LLMs, SLMs, multimodal, or code-specific models.
- Perform data curation and synthesis - Creating and refining datasets to optimize training outcomes.
- Hands-on coding- write efficient, production-quality code and debug complex training jobs.
- Work on both proprietary and open-source frameworks - Demonstrated proficiency in training pipelines and architecture.
- Full-stack modeling responsibility - From data ingestion and training to evaluation and inference management.
- Contribute to or build on existing innovations like technical report of the well-known models.
- Develop novel AI solutions that bridge language, vision, and code understanding.
Other
- Doctorate in relevant field AND 5+ years related research experience
- OR equivalent experience
- Ability to meet Microsoft, customer and/or government security screening requirements
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Agile, solution-oriented, and able to operate with minimal overhead within a startup style mindset