The TRRC's mission of improving literacy outcomes requires high-quality data systems and rigorous data management practices. This role will support the preparation, validation, and management of large-scale educational research datasets to ensure data quality and support research and reporting.
Requirements
- Ability to use R and Python to query, clean, and concatenate data.
- Knowledge of SQL queries.
- Knowledge of database systems.
- Knowledge of local installation, testing and recovery tools and mechanisms.
- Ability to create tables, graphs, and other displays of data for different audiences.
- Knowledge of statistical analyses for multilevel data
Responsibilities
- Write code using database languages to create queries and reports in order to clean and standardize highly-variable data received from a variety of sources by identifying duplicate, missing, and/or inconsistent data, eliminating unnecessary data, performing character encodings, and formatting data consistently according to researcher needs.
- Assist PIs/researchers by performing preliminary analysis of data such as descriptive statistics, regression and correlation analyses to inform dataset quality.
- Write SQL queries to extract and migrate data from online data collection systems
- Write Python script to migrate data between databases
- Review existing database structure and provide suggestions to normalize data tables
- Check databases for errors during data collection
- Create databases, forms, and reports
Other
- Bachelor's degree in a relevant field; Mathematics, Computer Science, or a related discipline.
- Onsite
- Resume
- Cover Letter
- List of 3 Professional References