Moffitt Cancer Center seeks two Senior Generative AI Data Scientists to be subject-matter experts in Gen AI, providing strategic guidance and technical expertise for large projects. The goal is to drive cross-functional collaboration and deliver production-ready solutions for high-impact use cases to advance cancer research and clinical applications.
Requirements
- Extensive expertise in designing and deploying large language model (LLM) architectures, including transformer-based systems, such as GPT, MedPaLM, and LLaMA.
- Experience with agentic AI systems, including experience building autonomous agents capable of multi-step reasoning, tool use, and dynamic task execution.
- Familiarity with vector databases, including indexing, semantic search, and integration with LLM based applications.
- Experience with fine-tuning and prompt engineering for LLMs using frameworks like Hugging Face Transformers, OpenAI, or similar.
- Strong proficiency in Python/R and experience working with LLM APIs (e.g., OpenAI, Anthropic, Cohere) and/or open-source models (e.g., LLaMA, Mistral).
- Experience deploying GenAI systems in cloud environments (e.g., Azure, AWS) with MLOps tools and practices.
- Proficiency in embedding techniques and semantic search, including use of sentence transformers and similarity metrics.
Responsibilities
- Leads the end-to-end design, development, and deployment of cutting-edge generative AI solutions to advance cancer research and real-world clinical applications that are safe, explainable, and scalable solutions.
- Lead the full machine learning lifecycle, including data preprocessing, feature engineering, model development, evaluation, and optimization for scalable deployment in the cloud.
- Mentor and guide junior data scientists, fostering a culture of innovation, technical excellence, best practices, standards of development, and continuous learning within the AI team.
- Drive cross-functional collaboration within Health Data Services, IT architecture, engineering, and clinical domain experts to deliver production-ready solutions for high-impact use cases.
Other
- Master’s degree in computer science, Biomedical Informatics, Machine Learning, or a related field.
- Preference given to someone with a Ph.D.
- Eight (8) years of experience in applied machine learning & Large Language Models, inclusive of three (3) years in Advanced Generative AI solutions.
- Three (3) years of hands-on experience designing and implementing Retrieval-Augmented Generation (RAG) pipelines using frameworks such as LangChain or LlamaIndex.
- Ability to communicate complex technical concepts to non-technical stakeholders and influence strategic decisions.