Prima Mente's goal is to deeply understand the brain, to protect the brain from neurological disease and enhance the brain in health. We do this by generating our own data, building brain foundation models, and translating discovery to real clinical and research impact.
Requirements
- Proficiency in Kubernetes, Docker, Terraform (or equivalent infrastructure automation tools), and cloud services (AWS, GCP, Azure).
- Deep experience with ML workflow orchestration tools (e.g., Kubeflow, Ray, Airflow, Metaflow).
- Excellent programming skills in Python; experience with Bash, Go, or C++ is beneficial.
- Strong understanding of ML frameworks (PyTorch, TensorFlow, JAX) and familiarity with distributed training methods, GPU acceleration, and optimization libraries (e.g., XLA, NCCL).
- Excellent understanding of software development best practices, CI/CD, and automation.
- Familiarity with GPU/TPU acceleration and performance optimization (XLA/NCCL).
- Experience with bioinformatics or biological data handling.
Responsibilities
- Architect, develop, and optimize robust ML training and inference infrastructure capable of supporting large-scale genomic foundation models.
- Design and implement scalable and efficient distributed computing platforms leveraging cloud (AWS/GCP/Azure) and HPC clusters.
- Develop highly automated, reproducible data pipelines and CI/CD workflows that accelerate model development, testing, and deployment.
- Performance-tune infrastructure and models, optimizing resource utilization (GPU/TPU) and significantly improving computation efficiency.
- Collaborate cross-functionally with ML researchers, bioinformaticians, and scientists to translate research needs into scalable engineering solutions.
- Ensure system reliability, robustness, and high availability, proactively implementing comprehensive monitoring, logging, and alerting solutions.
- Champion infrastructure-as-code (IaC) practices, promoting clarity, reproducibility, security, and auditability.
Other
- Ambitious and Impact-Driven: You're inspired by working at the forefront of AI and biology, motivated by challenges that can significantly advance human health.
- Technical Excellence: You thrive in highly technical, complex environments and have a track record of turning cutting-edge research into robust production systems.
- Collaborative & Communicative: You excel at collaborating across disciplines, clearly articulating complex ideas, and driving alignment among research and engineering teams.
- Demonstrated ability to solve complex problems independently, with exceptional troubleshooting and system debugging skills.
- Excellent communication skills and experience collaborating within multidisciplinary teams.