Building the world's first biological reasoning model to predict, understand, and program living systems
Requirements
- You have experience pre-training models and are proficient in distributed computing environments
- You are proficient in Python and have expertise in at least one major deep learning framework (PyTorch, TensorFlow, or JAX)
- You have experience with deep learning and generative architectures such as transformers, diffusion models and autoencoders
- You are skilled in working with terra-scale datasets and scaling models to billions of parameters
- You have a strong understanding of machine learning fundamentals, including various model architectures, optimization techniques, and evaluation metrics
Responsibilities
- You will build foundational models for biology capable of reading and writing biology at scale
- You will develop deep generative models for biological applications, exploring innovative architectures to capture the complexities of multi-scale biological systems
- You will work on distributed training systems to scale our models to billions of parameters, optimizing for performance and efficiency across multi-GPU and multi-node setups while handling large-scale biological datasets
- You will engineer efficient data pipelines to manage and process massive biological datasets, addressing challenges in data loading, splitting, and memory optimization
- You will develop and implement robust evaluation frameworks for complex biological models, ensuring data integrity and preventing leakage across dataset splits
Other
- You have a Bachelor's in Computer Science, Machine Learning, or a related technical field
- You have 3+ years of experience in developing and implementing deep generative learning models
- You have excellent problem-solving skills and the ability to quickly adapt to new challenges
- You have excellent communication skills and can clearly articulate complex technical concepts
- You are motivated by making a real impact and are committed to tackling problems of significant consequence with determination and creativity