Wave Life Sciences needs to design and execute sequence-based machine learning projects that generate critical information for drug discovery and target assessment, and create and deploy data and test hypotheses related to human genetics and bioinformatics.
Requirements
- Experience with common human genetics resources and techniques
- Deployment and deployment of interactive application from scripting languages like R and Python (Shiny and streamlit)
- Experience with relational databases; hands on experience with MySQL is a plus.
- Experience in common NGS analyses task such mapping reads, quantification of gene expression, and/or variant calling.
- Experience with common bioinformatics tools like Samtools, GATK, Bowtie, HISAT2, BLAST, htseq-count, DESeq, Salmon, STAR, or Subread
- Knowledge of Illumina sequencing data analysis is necessary and experience with PacBio and/or Nanopore sequencing is a plus
- Familiarity with UNIX computing environment and proficiency in either Python or R scripting. Experience with docker and Nextflow pipeline is also strongly recommended.
Responsibilities
- Lead and oversee large data projects; design and build pipelines that process and analyze related data, store summary results and integrate with the corporate ELN
- Build applications to support bioinformatics and human genetics pipelines in R (shiny) and Python (streamlit)
- Develop, deploy, manage, and run applications using Docker containers
- Contribute to best practices for Wave’s storage-and-compute scalability of large bioinformatics datasets
- Develop pipelines that determine mapping coordinates and homology of oligonucleotides to genomes and transcriptomes
- Espouse best practices for version control for developed software (Git)
Other
- highly motivated and detail-orientated individual
- work within a team
- strong coding and software development background
- willingness to work in an interdisciplinary team of chemists, biologists, and software developers
- Ability to communicate finding of NGS data to a general scientific audience in both oral and written forms.