Project Overview#
This research project aimed to identify and quantify the key factors that influence minority student outcomes in higher education. Through rigorous statistical analysis of institutional data, we developed predictive models to understand success patterns and inform policy decisions.
Key Objectives#
- Identify critical factors affecting minority student retention and graduation rates
- Develop statistical models to predict student success outcomes
- Provide actionable insights for educational policy makers
- Create data visualizations to communicate findings effectively
Methodology#
Data Collection and Preparation#
- Dataset: Multi-institutional database with 15,000+ student records
- Time Period: 5-year longitudinal study (2017-2022)
- Variables: 25+ demographic, academic, and socioeconomic factors
- Data Cleaning: Handled missing values using multiple imputation techniques
Statistical Analysis Approach#
1. Exploratory Data Analysis#
# Key demographic distribution analysis
ggplot(student_data, aes(x = ethnicity, fill = graduation_status)) +
geom_bar(position = "fill") +
theme_minimal() +
labs(title = "Graduation Rates by Ethnicity")
2. Multiple Linear Regression#
- Primary Model: Graduation rate as dependent variable
- Independent Variables: GPA, family income, first-generation status, campus involvement
- Model Selection: Stepwise regression with AIC criteria
3. Logistic Regression#
- Binary Outcome: Student retention (Yes/No)
- Feature Engineering: Created interaction terms for significant variables
- Validation: 10-fold cross-validation
Key Findings#
Critical Success Factors#
Academic Preparedness (β = 0.45, p < 0.001)
- High school GPA strongest predictor
- STEM readiness significantly impacts outcomes
Financial Support (β = 0.32, p < 0.001)
- Scholarship availability reduces dropout risk by 40%
- Work-study participation shows positive correlation
Campus Engagement (β = 0.28, p < 0.01)
- Student organization participation
- Faculty mentorship programs
- Peer support networks
First-Generation Status (β = -0.21, p < 0.05)
- Requires additional support structures
- Navigation assistance crucial for success
Statistical Results#
Factor | Coefficient | P-value | 95% CI |
---|---|---|---|
HS GPA | 0.45 | < 0.001 | [0.38, 0.52] |
Financial Aid | 0.32 | < 0.001 | [0.24, 0.40] |
Campus Involvement | 0.28 | < 0.01 | [0.18, 0.38] |
First-Gen Status | -0.21 | < 0.05 | [-0.35, -0.07] |
Model Performance:
- R² = 0.68 (Multiple Linear Regression)
- AUC = 0.82 (Logistic Regression)
- Accuracy = 78% on validation set
Data Visualizations#
Interactive Dashboard Components#
- Outcome Prediction Tool: Real-time risk assessment
- Demographic Breakdown: Success rates by subgroup
- Intervention Impact: Before/after program analysis
- Geographic Patterns: Regional success variations
Key Visualizations Created#
- Correlation Heatmap: Variable relationships
- Survival Curves: Time-to-graduation analysis
- Box Plots: GPA distributions by ethnicity
- Scatter Plots: Academic vs. social factors
Impact and Recommendations#
Policy Implications#
Early Intervention Programs
- Target students with risk factors in first semester
- Implement predictive modeling for proactive support
Financial Aid Reform
- Increase need-based scholarship programs
- Expand work-study opportunities in relevant fields
Mentorship Enhancement
- Mandatory faculty-student mentorship for first-generation students
- Peer tutoring programs in STEM fields
Campus Culture Initiative
- Inclusive student organization development
- Cultural competency training for faculty and staff
Technologies Used#
R ggplot2 dplyr Statistical Modeling Data Visualization Regression Analysis
R Packages Utilized#
- Data Manipulation: dplyr, tidyr, readr
- Visualization: ggplot2, plotly, shiny
- Statistical Analysis: stats, car, MASS
- Model Validation: caret, pROC, boot
Research Outcome#
This analysis successfully identified actionable factors for improving minority student success in higher education. The statistical models developed have been implemented by three universities to guide their diversity and inclusion initiatives, resulting in measurable improvements in retention rates.
This project demonstrates the power of statistical analysis in addressing real-world educational challenges and creating meaningful change in higher education outcomes.
View All Projects