Patterns in Higher Education Outcome Gaps

Exploratory Analysis and Hypothesis Testing

Summary

This was an open-ended project for a class focused on reproducible data science. Using a federally maintained dataset on US Colleges, we defined “outcome gaps”, looking specifically at the difference in graduation rates between ethnic groups and post-graduation earnings between different household income groups. We looked for patterns in the data. We found a significant relationship between school expenditures per student and the difference in earnings between students from high and low income backgrounds.

Project Outline

Models and Tools

All scripts were written in R, with plots generated by ggplot, report written in Rnw, and slides in Rmd.

Here’s a simple shiny app visualizing this relationship.

The full repo is available here.

Direct Link to the paper.