The Vera Institute of Justice's Incarceration and Inequality Project (IIP) aims to document how mass incarceration has impoverished communities and widened racial disparities in income and wealth over the past five decades by creating a national dataset and data products.
Requirements
- Proficiency developing code collaboratively using GitHub
- Experience working with large administrative or survey datasets
- Familiarity with U.S. Census data preferred
- Experience collecting and processing data using Python or R
- Python and/or R
- SQL
- GitHub
- Google Cloud Platform (BigQuery & Google Cloud Storage)
- Command Line
Responsibilities
- Identify and process complex datasets in collaboration with the Senior Data Scientist for incorporation into our public and internal analysis datasets.
- Transform individual-level monthly data into annual, county-level estimate
- Prepare summaries of data quality and completeness, and create documentation of data processing to inform methodologies and limitations of analyses
- Conduct exploratory data analysis and prepare descriptive statistics of core datasets in collaboration with the Senior Data Scientist to support IIP product development
- Identify additional sources for longitudinal, national economics data
- Participate in data science and IIP team meetings, and share updates with the tea
Other
- Currently enrolled in a graduate program in data science, computer science, statistics, economics, public policy, demography, or a related discipline
- Commitment to using research and analysis to end mass incarceration and undo racism and inequity in the criminal legal system
- Lived experience as a person directly impacted by the criminal legal system
- Strong attention to detail and organizational skills
- The intern should be available to work 15-20 hours per week during the Fall.