Education Data Science

Deeper Roots Before the Storm: Utilizing Machine Learning to Alert School Districts of Permanent School Closures

Project Year
2025
Abstract

The increasing rate of permanent school closures in U.S. public school districts presents unprecedented challenges for administrators and communities alike. This study develops an early-warning indicator model to predict mass closure events- defined as a district closing at least 10% of its schools- five years in advance. Leveraging administrative data from the National Center for Education Statistics from 2000-2018, we evaluated a suite of supervised machine learning models- including elastic-net regularized logistic regression, random forests, XGBoost, LSTM neural net works, and SuperLearner ensembles- to determine the degree to which they could predict mass closures using enrollment, financial, and demographic predictors. Comparative analysis based on Area Under the Precision–Recall Curve (AUC-PR), and Recall revealed that XGBoost provided predictive accuracy while effectively handling class imbalance. Our findings demonstrate the technical feasibility of using advanced analytics in educational settings and also offer a glimpse into their potential for generating actionable insights for policy-makers to proactively manage resources and support equitable decision-making in the face of systemic challenges

EDS Students

Michael Chrzan
Michael Chrzan
Class: 2025
Areas of interest: Systems-level design for educational equity and justice, networks, causal methods, and education policy