Microsoft is building a planet-scale multi-modal database from the ground up and needs a Senior Principal Machine Learning Engineer to lead and collaborate with a team of passionate engineers to drive ideas to impactful results in a fast-paced environment.
Requirements
- Proven experience in training large-scale modern Machine Learning (ML) models (e.g., transformers, diffusion models, foundation models).
- Hands-on experience in key model optimization techniques (e.g., mixed precision training, distributed training, fine-tuning, RLHF, LoRA).
- Familiarity with model evaluation, data curation pipelines, and reproducible research practices.
- Deep understanding of modern deep learning frameworks such as PyTorch or TensorFlow, and scalable training infrastructure.
- First-author publication in top-tier machine learning conferences or journals (e.g., NeurIPS, ICML, ICLR, CVPR, ACL)
Responsibilities
- Develop and deploy scalable Machine Learning models.
- Develop robust evaluation frameworks to assess model performance, conduct systematic benchmarking, and address identified weaknesses while ensuring compliance with customer standards.
- Defines the vision and strategy for collaboration efforts between researchers and development teams at the individual product level.
- Brings new technology and approaches into production by applying long-term research efforts to solve immediate product needs.
- Drives high-stakes negotiations across teams to ensure cutting edge technology is being applied to products in a practical way that meets key business objectives.
- Ensures that teams apply an understanding of research approaches used across and outside of the company to leverage (and not re-invent) solutions.
- Represents the organization across the company.
Other
- Bachelor's Degree in Computer Science, Machine Learning or Artificial Intelligence, or related field AND 8+ years related experience
- Master's Degree in Computer Science, Machine Learning or Artificial Intelligence, or related field AND 6+ years related experience
- Doctorate in Computer Science, Machine Learning or Artificial Intelligence, or related field AND 5+ years related experience
- Embody our culture and values
- Travel requirements not specified