Ariadne Labs is seeking a data analyst to lead data analysis and coordination activities for a multi-state public health research project evaluating the integration of genomic sequencing into newborn screening programs, as well as contribute to other ongoing Lab projects. This includes developing and maintaining REDCap data systems, coordinating with study partners, applying advanced programming skills to manage and clean complex datasets, and conducting analyses in collaboration with research teams.
Requirements
- Expertise in REDCap, including project design, longitudinal data capture, data quality checks, and complex workflows
- Expertise in an analytical programming language (R or Python), with the ability to independently design, optimize, and troubleshoot complex data workflows
- Strong grasp of descriptive and inferential statistics
- Experience with data visualization tools (Tableau, Looker, etc.)
- Experience using version control tools (e.g., Git/GitHub) to manage code
- Familiarity with ETL/ELT processes and workflow automation
- Statistical modeling including regression analysis
Responsibilities
- Design, build, and maintain secure and scalable REDCap databases and data systems to manage complex, multi-source data streams.
- Coordinate with study partners and remote teams to design, implement, and maintain robust data systems that support the setup and ongoing operations of the study.
- Ensure data completeness, consistency, and integrity across multiple sites by developing and automating data quality checks and monitoring reports.
- Apply advanced programming techniques to clean and transform raw data into usable formats for analysis and reporting.
- Develop dashboards, reports, and visualisations to communicate findings to technical and non-technical stakeholders.
- Conduct exploratory and statistical analyses to identify trends, patterns, and correlations.
- Collaborate on other projects across Ariadne Labs by contributing to data-related activities as needed, including data collection, integration, cleaning, analysis, and visualization.
Other
- Serve as the lead data analyst for a large, NIH-funded, multi-site feasibility study examining implementation of genomic newborn screening within existing state public health programs, spanning ~14 implementation sites, ~7 public health laboratories, and up to 30,000 newborns.
- BA/BS required - preferably in Statistics, Data Science, Economics, Computer Science, Public Health, or a related field.
- 3–5 years of experience managing and analyzing complex health research datasets, preferably in multi-site clinical or public health studies.
- Strong critical thinking skills and exceptional attention to detail
- Strong communication and presentation skills and experience working collaboratively with cross-functional teams