Microsoft is looking to solve the problem of language understanding, text analytics, and custom question answering by building and optimizing large-scale AI systems that power real-world applications.
Requirements
- Experience in building and deploying large-scale distributed systems.
- Experience with cloud platforms (e.g., Azure, AWS, GCP) and microservices architecture.
- Solid understanding of machine learning or natural language processing concepts.
- Familiarity with AI/ML frameworks such as PyTorch, TensorFlow, or ONNX.
- Experience working with large language models (LLMs), small language models (SLMs), or agent-based systems.
- Coding experience in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
Responsibilities
- Design, build, and maintain scalable AI services for language understanding, text analytics, and custom question answering.
- Collaborate with researchers and engineers to integrate cutting-edge large and small language models into production systems.
- Drive architecture and implementation decisions that ensure performance, reliability, and maintainability of AI services.
- Contribute to the development of agentic infrastructure that enables dynamic and intelligent interactions.
- Mentor engineers and contribute to a culture of technical excellence, inclusion, and continuous learning.
- Participate in code reviews, design discussions, and cross-team collaborations to deliver high-quality software.
Other
- Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience
- Ability to meet Microsoft, customer and/or government security screening requirements
- Master's Degree in Computer Science or related technical field AND 7+ years technical engineering experience (preferred)
- Demonstrated ability to lead technical initiatives and mentor team members
- Passion for innovation and delivering impactful AI solutions at scale