Advance state-of-the-art language understanding and question answering technologies, while directly impacting the way customers interact with AI-powered systems for Azure CoreAI.
Requirements
- 2+ years of experience in data curation and synthesis for model optimization
- 2+ years of hands-on coding experience in Python and frameworks like PyTorch.
- 2+ years of experience in natural language processing (NLP), with proven experience in training and fine-tuning LLMs/SLMs.
- Experience with both proprietary and open-source frameworks.
- Hands-on experience with agentic model design, prompt optimization, and evaluation methodologies.
- Deep understanding of traditional ML techniques, along with the ability to innovate with hybrid approaches.
- Proven track record of impactful research with published work or real-world product deployments
Responsibilities
- Advance Conversational AI Models – Drive innovation in Conversational Language Understanding (CLU), Conversational Question Answering (CQA), and prompt optimization by training, fine-tuning, and evaluating large language models (LLMs), small language models (SLMs), and agentic models to push state-of-the-art conversational intelligence.
- Deliver Production-Ready Solutions – Translate research into scalable, high-quality code and collaborate with engineers to ship solutions that power Azure AI services and directly impact customers.
- Lead with Science + Strategy – Stay at the forefront of Natural Language Processing (NLP) research and Azure AI advancements, ensuring our services not only meet current customer needs but anticipate and shape future trends in conversational AI.
- Own end-to-end applied science contributions across Conversational Language Understanding (CLU), Conversational Question Answering (CQA), and Prompt Optimization for agentic scenarios.
- Design, train, and fine-tune large language models (LLMs), small language models (SLMs), and agentic models, pushing the boundaries of how conversational systems reason and respond.
- Develop evaluation techniques and metrics that capture both traditional Machine Language (ML) rigor and the nuances of conversational quality.
- Collaborate closely with software engineers and data scientists to deliver production-ready solutions that enhance chatbot and agentic capabilities for enterprise customers.
Other
- 3 days / week in-office
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Startup-style mindset: agile, solution-oriented, and self-driven
- Embody our Culture and Values.