Microsoft is looking to shape the future of language understanding, text analytics, and custom question answering by building and optimizing large-scale AI systems that power real-world applications.
Requirements
- coding in languages including, but not limited to, C, C++, C-Sharp, Java, JavaScript, or Python
- Experience in building and deploying large-scale distributed systems.
- Experience with cloud platforms (e.g., Azure, AWS, GCP) and microservices architecture.
- Solid understanding of machine learning or natural language processing concepts.
- Experience working with large language models (LLMs), small language models (SLMs), or agent-based systems.
- Familiarity with AI/ML frameworks such as PyTorch, TensorFlow, or ONNX.
Responsibilities
- Design, build, and maintain scalable AI services for language understanding, text analytics, and custom question answering.
- Collaborate with researchers and engineers to integrate cutting-edge large and small language models into production systems.
- Drive architecture and implementation decisions that ensure performance, reliability, and maintainability of AI services.
- Contribute to the development of agentic infrastructure that enables dynamic and intelligent interactions.
- Participate in code reviews, design discussions, and cross-team collaborations to deliver high-quality software.
Other
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Proven problem-solving skills and ability to work independently and collaboratively in a fast-paced environment.
- Demonstrated ability to lead technical initiatives and mentor team members.
- Passion for innovation and delivering impactful AI solutions at scale.