Job Board
LogoLogo

Get Jobs Tailored to Your Resume

Filtr uses AI to scan 1000+ jobs and finds postings that perfectly matches your resume

Axle Logo

Senior Healthcare Data Engineer

Axle

$150,000 - $190,000
Oct 15, 2025
Rockville, MD, US
Apply Now

Axle is seeking a Senior Healthcare Data Engineer to support the National Center for Advancing Translation Sciences (NCATS) at the National Institutes of Health (NIH) in building and maintaining the foundational infrastructure of the National Clinical Cohort Collaborative (N3C). This involves revolutionizing medical research by creating and managing a terabyte-scale public repository of harmonized electronic health record (EHR) data, enabling researchers to make discoveries faster and develop better treatments.

Requirements

  • Expert-level proficiency in Python and SQL, with a proven track record of building and maintaining complex, large-scale data pipelines and ETL processes.
  • Significant experience with healthcare data is essential. You must have deep, practical knowledge of common data models (CDMs), particularly OMOP and/or FHIR, and experience with clinical terminologies (e.g., ICD, SNOMED, RxNorm).
  • Strong experience with big data technologies (e.g., Apache Spark, Hadoop) and containerization using Docker for creating reproducible and scalable workflows.
  • Proficiency with version control (Git) and CI/CD practices for data infrastructure.
  • An architectural mindset with the ability to design for scalability, reliability, and security.
  • Experience designing and deploying data solutions on cloud platforms (AWS, GCP, Azure).
  • Proficiency with modern workflow management systems (e.g., Nextflow, Snakemake, Airflow).

Responsibilities

  • Architect and Modernize National-Scale Data Pipelines: Design, develop, and optimize robust, disease-agnostic data acquisition and ingestion pipelines built to handle the complexity and scale of N3C.
  • Master Data Integration and Harmonization: Tackle the complex challenge of harmonizing heterogeneous clinical data from countless sources. You will maintain and enhance the OMOP harmonization pipeline, improve interoperability between common data models (e.g., OMOP, PCORNet, FHIR), and ensure consistency for critical data like medications and lab values.
  • Build the Future with Dynamic Workspaces: Be a key technical player in developing the infrastructure for N3C's new Dynamic Workspaces. You will help build the systems that provision secure, project-specific analytical environments, giving researchers access to the specific data they need while providing institutions granular control.
  • Champion Data Quality and Governance: Develop and implement sophisticated data quality frameworks, creating dashboards and feedback loops to ensure our data partners and researchers have transparent insight into data completeness, consistency, and quality.
  • Innovate with Advanced Technologies: Integrate critical new data sources, including national mortality data and CMS. You will link datasets and help build the processes for integrating novel data types like geospatial and environmental data.
  • Collaborate and Lead: Work alongside a world-class team of scientists, project managers, and engineers to translate scientific needs into technical solutions. You will provide technical leadership and mentorship, driving best practices in an agile, mission-focused environment.

Other

  • A deep passion for using technology to solve meaningful problems in healthcare and medical research.
  • Bachelor's or Master's degree in Computer Science, Data Engineering, Bioinformatics, or a related field, with 8+ years of hands-on experience in data engineering (or 5+ years with a Master's).
  • Experience with privacy-preserving record linkage (PPRL) techniques and the challenges of working with de-identified patient data.
  • Familiarity with federated data systems and architectures.
  • Experience working in a regulated data environment (e.g., FISMA, HIPAA).