The Analytics and AI Methods at Scale (AAIMS) group in the National Center for Computational Science (NCCS) is hiring a Research Scientist to advance the frontier of AI for science, including scientific reasoning, federated & collaborative learning, and reinforcement learning (RL) for self-improving models on leadership-class supercomputers.
Requirements
- Demonstrated research in one or more areas of HPC or AI (e.g., large-scale training, scientific reasoning, reinforcement learning, or distributed systems).
- Strong programming skills (Python, C/C++, or equivalent) and experience with ML frameworks (e.g., PyTorch).
- Experience with large-scale experiments on HPC or cloud platforms.
- Familiarity with distributed training frameworks (e.g., DeepSpeed, Megatron-LM, Ray).
- Strong publication record commensurate with career stage.
- Interest in developing open-source tools and contributing to community efforts.
Responsibilities
- Conduct research in AI/ML at scale, working with cutting-edge HPC resources.
- Collaborate with senior researchers and domain scientists on AI methods and scientific applications.
- Contribute to peer-reviewed publications, technical reports, and proposals.
- Engage in collaborative software development and open-source contributions.
- Present research outcomes at conferences, workshops, and internal seminars.
- Contribute to a supportive, inclusive, and collaborative team culture.
Other
- Ph.D. in Computer Science, Computer Engineering, or a field closely related to the job duties of this position.
- Ability to obtain and maintain an HSPD-12 PIV badge.
- For employment at Oak Ridge National Laboratory (ORNL), a Real ID compliant form of identification will be required.
- Must be able to obtain and maintain a federal Personal Identity Verification (PIV) card as mandated by Homeland Security Presidential Directive 12 (HSPD-12) and Department of Energy (DOE) Order 473.1A.
- Must be able to pass a Federal Tier 1 background check investigation.