Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Pioneering Intelligence Logo

Machine Learning Scientist: Foundation Models

Pioneering Intelligence

Salary not specified
Sep 2, 2025
Cambridge, MA, US
Apply Now

Cellarity seeks to fundamentally redesign the way drugs are created by shifting the focus from a single target to the underlying cellular dysfunction, unraveling the complexity of disease biology and creating medicines that were never before possible. The Machine Learning Scientist will play a critical role in bringing new medicines to patients by developing and advancing foundation models that capture biological networks driving cellular dysfunction and integrate diverse perturbation modalities to accelerate drug discovery.

Requirements

  • Strong foundation in statistics, deep learning and generative AI.
  • Proficient in Python and scalable ML frameworks (e.g. PyTorch, Hugging Face)
  • Experience with high-dimensional biological data analysis (bulk/single-cell RNA-seq, Gene regulatory networks, PPI networks, multi-omics integration).
  • Experience with chemical / CRISPR perturbation screen data (eg: Perturb-seq).
  • Experience developing deep generative models for representation learning.
  • Experience efficiently training and fine-tuning foundation models on omics data.
  • Experience with cloud computing (AWS/GCP) and MLOps best practices.

Responsibilities

  • Design and implement novel algorithms and architectures for large-scale foundation models (e.g. Diffusion models, Transformers, VAEs) using single-cell RNA-seq and other modalities.
  • Develop mechanistic interpretability methods to infer gene networks and regulatory mechanisms via attention, graph-based, and/or causal representation methods.
  • Build state-of-the-art perturbation models using multi-modal perturbation (small molecules, CRISPR, cytokines) and phenotypic data.
  • Pre-train, post-train and deploy generative AI models using proprietary datasets, and distributed training and inference on cloud platforms.
  • Establish clinically relevant benchmarking and evaluation frameworks to assess context generalization and guide model improvements.
  • Stay current with the latest research in foundation models, representation learning (across biology, NLP, vision, and audio), and perturbation modeling.
  • Collaborate with interdisciplinary scientists from biology, chemistry, and technology teams to translate research questions into cutting-edge ML solutions.

Other

  • PhD in Computer Science, Computational Biology, or related field, OR Master's degree with 3+ years of relevant ML research or industry experience.
  • Excellent communication skills and ability to work in interdisciplinary teams.
  • Ability to work in a fast-paced, multidisciplinary, biotech environment.
  • Communicate technical concepts clearly to diverse scientific audiences.
  • Contribute to strategic roadmap planning and technical decisions.