HHMI is investing $500 million over the next 10 years to support AI-driven projects and to embed AI systems throughout every stage of the scientific process in labs across HHMI, with a focus on integrating evolutionary biology principles into protein language models
Requirements
- Strong programming skills in Python and PyTorch
- Deep expertise in transformer architectures, attention mechanisms, and ideally biological sequence modeling
- Experience with ML model deployment, workflow orchestration, and high-throughput data processing
- Experience working with large biological datasets in GPU-based computing environments
- Domain expertise in sequence or protein structure analysis
- Experience with systematic experiments, ablation studies, and a commitment to reproducible research and open science
- Excellent technical documentation and communication skills
Responsibilities
- Investigate and implement alternative transformer architectures for biological sequence modeling using PyTorch
- Expand models to be multimodal, using a diversity of inputs and outputs
- Design and execute rigorous comparative experiments between model architectures
- Contribute to scientific publications and present findings at conferences
- Apply software engineering best practices, ensuring a maintainable, extensible and well documented codebase allowing seamless reproduction and extension of research results
- Stay up to date with the latest advancements in AI research
- Leverage agentic coding and develop practices that enable safe application of this technology
Other
- Minimum Requirements: Bachelor's degree in Computer Science, Data Science, Statistics, Applied Mathematics, or a related field
- Demonstrated record of impactful research, including publications in ML/AI or computational biology
- Excellent communication skills, with the ability to convey complex data concepts to both technical and non-technical audiences
- A detail-oriented, creative, and organized team player with a collaborative mindset
- Ability to move about workspace and perform physical requirements such as reaching and grasping