Moffitt Cancer Center is building the next generation of regulatory-grade AI models in oncology and needs to transform complex clinical data into model-ready datasets to drive predictive modeling and digital biomarker discovery.
Requirements
- Fluent in SQL and Python; 3–5+ years building analytical datasets.
- Comfortable in modern data-warehouse environments (Snowflake, BigQuery, Redshift).
- Familiarity with healthcare, biomedical, or translational datasets (EMR, lab, imaging, omics).
- Three years of hands-on experience designing and querying relational data models or analytical datasets.
Responsibilities
- Query and integrate clinical, lab, imaging, and genomic data into clean, version-controlled datasets.
- Write efficient SQL and Python transformations in Snowflake or similar environments.
- Partner with ML scientists to define features and test new data sources, including LLM-based text extraction.
- Document lineage and reproducibility for regulatory-grade traceability.
Other
- Master’s degree in Computer Science, Data Science, Informatics, or a related quantitative field.
- In lieu of a master’s degree, a Bachelor's degree plus an additional 2 years of hands-on experience designing and querying analytical datasets in healthcare, biomedical research, or clinical environments for a total of 5 years of related experience as described above may be considered.
- Enjoy hands-on data work, fast iteration, and tangible clinical impact.
- Full Time, Day Shift, M-F, 8am - 5pm