Advancing Artificial Intelligence (AI) capabilities in areas including modeling, algorithms, reasoning, and agentic AI, and evolving pre-training, mid-training, and post-training codebases for models like Phi.
Requirements
- 1+ year(s) experiencedevelopingwithPython andPytorch/JAX.
- Familiarity witharchitecture and optimizations for large language models.
- Hands-on workin debugging and profilingPytorchdistributed code.
- Basic understanding ofworkingofCUDAkernels.
- Familiarity withpre-training, mid-training and/or post-training pipelines forlanguageand/or multimodal models.
- Foundational understanding ofreinforcement learning andkeychallenges in the field.
- Experience withverl, Ray, Megatronand/orvLLMis a significant plus.
Responsibilities
- help develop novel ideas in bleeding edge reinforcement learning research
- help evolve our pre-training, mid-training, and post-training codebases that gave birth to famous models such as Phi establishing many new records
- collaborate with researchers and engineers across many disciplines to help advance the state of the art in reasoning and agentic AI
- design, develop, execute, and implement technology research projects in collaboration with other researchers, engineers, and product groups
- play a crucial role in developing, improving, and exploring the capabilities of Large Language Models (LLMs), reasoning and agentic AI
Other
- Bachelors in Computer Scienceor relevant field AND6+ years relatedexperience
- Master's Degree inComputer Scienceor related field AND4+ years relatedexperience
- Doctorate inComputer Science or related fieldAND 3+ years related experience
- equivalent experience.
- Embody our culture and values.