Axle is seeking to solve complex data challenges that directly impact public health by building and maintaining the foundational infrastructure of the National Clinical Cohort Collaborative (N3C), the nation’s largest and most significant public repository of harmonized electronic health record (EHR) data.
Requirements
- Expert-level proficiency in Python and SQL, with a proven track record of building and maintaining complex, large-scale data pipelines and ETL processes.
- Significant experience with healthcare data is essential, with deep, practical knowledge of common data models (CDMs), particularly OMOP and/or FHIR, and experience with clinical terminologies (e.g., ICD, SNOMED, RxNorm).
- Strong experience with big data technologies (e.g., Apache Spark, Hadoop) and containerization using Docker for creating reproducible and scalable workflows.
- Proficiency with version control (Git) and CI/CD practices for data infrastructure.
- An architectural mindset with the ability to design for scalability, reliability, and security.
- Experience designing and deploying data solutions on cloud platforms (AWS, GCP, Azure)
- Proficiency with modern workflow management systems (e.g., Nextflow, Snakemake, Airflow)
Responsibilities
- Architect and Modernize National-Scale Data Pipelines: Design, develop, and optimize robust, disease-agnostic data acquisition and ingestion pipelines built to handle the complexity and scale of N3C.
- Master Data Integration and Harmonization: Tackle the complex challenge of harmonizing heterogeneous clinical data from countless sources.
- Build the Future with Dynamic Workspaces: Be a key technical player in developing the infrastructure for N3C's new Dynamic Workspaces.
- Champion Data Quality and Governance: Develop and implement sophisticated data quality frameworks, creating dashboards and feedback loops to ensure our data partners and researchers have transparent insight into data completeness, consistency, and quality.
- Innovate with Advanced Technologies: Integrate critical new data sources, including national mortality data and CMS.
- Collaborate and Lead: Work alongside a world-class team of scientists, project managers, and engineers to translate scientific needs into technical solutions.
Other
- Bachelor's or Master's degree in Computer Science, Data Engineering, Bioinformatics, or a related field, with 8+ years of hands-on experience in data engineering (or 5+ years with a Master's).
- A deep passion for using technology to solve meaningful problems in healthcare and medical research.
- Must be able to work in a regulated data environment (e.g., FISMA, HIPAA).
- Must be able to provide equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law.
- Accessibility: If you need an accommodation as part of the employment process please contact: careers@axleinfo.com