ABSTRACT
Predicting student academic performance has become a crucial focus in educational data mining, enabling timely interventions to support at-risk students and improve overall outcomes. This paper proposes a novel PolyFeatureML that enhances accuracy by generating higher-order interactions between features, capturing complex relationships that may not be evident through traditional methods. PolyFeature significantly improves the predictive capabilities of the models by enriching the feature space with non-linear interactions, making them particularly useful for identifying patterns in student performance data. Also, the study explores the use of various machine learning (ML) models, such as Logistic Regression, Decision Trees, SVM, Random Forests, LightGBM, XGBoost, CatBoost, and Neural Networks, to predict student performance based on demographic, academic, and behavioral data. The models are evaluated using multiple metrics, including accuracy, precision, recall, F1-score, specificity, and ROC-AUC, offering a comprehensive view of their performance. Our findings demonstrate that ensemble models like Random Forests and boosting techniques achieve high accuracy, while the integration of PolyFeatures leads to significant improvements in performance across various classifiers. This study reports a framework for data-driven educational policies, offering insights on how the proposed model can improve the prediction accuracy, supporting early intervention approaches, and ultimately enhancing student outcomes.
References
- Ahmar, A. S., & Rahman, A. (2018). Development of prediction model of student academic performance. Journal of Physics: Conference Series, 1028(1), 012162.
- Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In Advances in neural information processing systems (pp. 4765-4774).
- Fan, X., & Sun, W. (2017). Research on improved K-means clustering algorithm for college students. Journal of Physics: Conference Series, 887(1), 012022.
- Gray, K. E., & Perkins, M. C. (2019). Early engagement data for predicting student performance and risk of failure. Computers & Education, 136, 60-68.
- Francis, J., & Babu, M. S. (2019). A survey on predictive analytics in higher education to enhance student performance. Journal of Physics: Conference Series, 1228(1), 012011.
- Mason, C. M., McFarland, J. W., & Shorter, S. H. (2018). Predicting student attrition using neural networks and course grades. Journal of College Student Retention: Research, Theory & Practice, 20(2), 172-186.
- Fan, J., & Sun, M. (2021). Academic achievement prediction in higher education through interpretable modeling. PLOS ONE, 16(3), e0247590. https://doi.org/10.1371/journal.pone.0247590
- Serrano, R., & Casillas, J. (2024). Predicting Academic Success of College Students Using Machine Learning Techniques. Data Journal, 9(4), 60. https://doi.org/10.3390/data9040060
- Albreiki, I., Zaki, N., & Zubi, M. D. (2021). Predicting Student Performance Using Data Mining and Learning Analytics Techniques: A Systematic Literature Review. Applied Sciences, 11(19), 9023. https://doi.org/10.3390/app11199023
- Albreiki, I. M., Zaki, N., & Zubi, M. D. (2021). Educational Data Mining Techniques and Applications: A Review. Journal of Educational Technology Systems, 50(1), 41-73. https://doi.org/10.1177/00472395211014205
- Coussement, K., & De Caigny, A. (2020). Predicting Student Dropouts in Online Learning Using a Logit Leaf Model: Decision Support for Student Retention. Decision Support Systems, 138, 113382. https://doi.org/10.1016/j.dss.2020.113382
- Gray, J., & Perkins, D. (2019). Predicting Academic Challenges Using Engagement Data and Machine Learning Models. Journal of Educational Data Mining, 11(3), 1-19. https://jedm.educationaldatamining.org/index.php/JEDM/article/view/392
- Alelyani, S., & Zubi, M. (2021). Machine Learning Approaches for Predicting Academic Performance: A Review. International Journal of Advanced Computer Science and Applications, 12(1), 23-29. https://doi.org/10.14569/IJACSA.2021.0120146
Download all article in PDF