Meta is seeking to build state-of-the-art Large Language Models (LLMs) and is looking for a Research Engineer to join their LLM Research team to conduct focused research and engineering
Requirements
Research experience in machine learning, deep learning, and/or natural language processing
Experience with developing machine learning models at scale from inception to business impact
Programming experience in Python and hands-on experience with frameworks such as PyTorch
Exposure to architectural patterns of large scale software applications
Experience in generative AI and LLM research
Experience with language model evaluation; data processing for pre-training and fine-tuning; responsible LLMs; LLM alignment; reinforcement learning for language model tuning; efficient training and inference; and/or multilingual and multimodal modeling
Responsibilities
Design methods, tools, and infrastructure to push forward the state of the art in large language models
Define research goals informed by practical engineering concerns
Contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results
Adapt standard machine learning methods to best exploit modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU)
Work with a large and globally distributed team
Contribute to publications and open-sourcing efforts
Other
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
A PhD in AI, computer science, data science, or related technical fields
Master's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
First author publications at peer-reviewed AI conferences (e.g., NeurIPS, CVPR, ICML, ICLR, ICCV, and ACL)