The UPenn department of genetics is seeking a Senior Data Scientist for clinical genomic pipeline design and implementation to improve our understanding and ability to diagnose genetic disorders through large-scale genomic clinically grade testing.
Requirements
- experience with implementation skills at least one of R, Perl or Python
- comfortable in UNIX/Linux environments
- producing well-documented code
- Multi-year experience (>10years) with cloud-computing or HPC is essential
- practical experience in one of these two areas is essential
- Master of Science and 1 to 2 years of experience, or an equivalent combination of education and experience required
- Ph.D. preferred
Responsibilities
- developing and implementing new quantitative strategies to improve our understanding and ability to diagnose genetic disorders through a large-scale genomic clinically grade testing
- develop, extend and apply computational pipelines for data-analysis to patient cohorts ranging from tens of individuals to hundreds of thousands
- executed at scale via distributed software solutions on both local HPC and cloud-based assets
- work with molecular (WGS, panel-sequencing, RNA-Seq, proteomic) or imaging (digital pathology, radiomic) data
- implementation skills at least one of R, Perl or Python
- comfortable in UNIX/Linux environments
- producing well-documented code
Other
- strong background in biology to help ensure the right questions are asked
- strong technical and personal communication skills, to help ensure insights are broadly adopted and appropriate analyses are performed
- strong inter-personal skills
- This position is contingent upon grant funding.
- Background checks may be required after a conditional job offer is made.