The AI Platform organization at Microsoft is looking to build the end-to-end Azure AI stack/PaaS and is core to Azure’s innovation and differentiation, as well as all of Microsoft’s flagship products, from Office to Teams, to Xbox.
Requirements
- Coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
- Experience with cloud platforms (e.g., Azure, AWS) and distributed computing (Kubernetes)
- Proficiency in Python and relevant ML libraries (e.g. PyTorch).
- Experience with transformer-based and diffuser-based models preferred(e.g. GPT, Llama, Stable diffusion).
- 5+ years of experience writing production code in building internet scale services and distributed systems.
Responsibilities
- Write clean and concise code with unit tests
- Collaborate with researchers and data scientists to implement model customization techniques, including Finetuning, Reinforcement Finetuning, Distillation.
- Optimize model performance, scalability, and efficiency.
- Conduct experiments to evaluate model performance, robustness, and generalization.
- Explore novel techniques and approaches to enhance model capabilities.
Other
- Bachelor's Degree in Computer Science or related technical field
- 4+ years technical engineering experience
- Travel 0-25%
- Work site: 3 days/week in-office
- Full-Time employment