Microsoft AI is looking to create new innovative AI experiences in Copilot by developing new methods to evaluate LLMs, creating new user-facing features with prompt engineering and fine tuning, or training classifiers to support the Copilot experience.
Requirements
- Experience prompting, evaluating, and working with large language models.
- Experience writing production-quality Python code.
Responsibilities
- Leverage subject matter expertise to improve model quality for interactive and agentive experiences in consumer Copilot.
- Oversee data acquisition or generation efforts, ensuring that the data meets product needs.
- Generalize machine learning (ML) solutions into repeatable frameworks.
- Lead evaluation efforts of models deployed within Copilot.
- Track advances in industry and academia, identifies relevant state-of-the-art research, and adapts algorithms and/or techniques to drive innovation and develop new solutions.
- Independently write efficient, readable, extensible code and model pipelines.
- Contribute to defining the model quality roadmap for Copilot, keeping in mind business and product goals.
Other
- strong communicator and great teammate
- takes the initiative, is user-centered and enjoys building world-class consumer experiences and products in a fast-paced environment
- By applying to this U.S. Mountain View, CA position, you are required to be local to the San Francisco area and in office 3 days a week.
- Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location.
- Commit to a customer-oriented focus by acknowledging customer needs and perspectives, and building AI products that delight customers