Volume 81, Issue 2, November 2025, Pages 113–119



Achi Harrisson Thiziers1 and Moussa KONE2
1 ISN Training and Research Unit, Virtual University of , Abidjan, Côte d’Ivoi, Côte d’Ivoire
2 UFR des Sciences de la Nature, Pôle de Recherche Environnement et Développement Durable, Université Nangui Abrogoua, Abidjan, Côte d’Ivoire
Original language: English
Copyright © 2025 ISSR Journals. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Student dropout rates are a major challenge for higher education. Both parents and academic institutions are seeking to reduce this phenomenon by investigating its root causes, as it has economic, social and institutional consequences. Based on an academic dataset enriched by SMOTE balancing, one-hot encoding and scaling, this article explores the application of machine learning (Random Forest, Gradient Boosting) and deep learning (MLP and CNN) techniques to predict this phenomenon and consider possible solutions. Exploring various data, such as grades, absences, failures, study time and family support, the models were compared through these metrics: Accuracy, F1-score, AUC-ROC, PR-curve. The results reveal the effectiveness of Gradient Boosting and Random Forest models (with an F1-score close to 1) over those of Multilayer Perceptron (F1-score = 0.84) and Convolutional Neural Networks (F1-score = 0,82). Analysis of the variables confirms the importance of mid-term marks (G2), absences and previous failures as key predictors. The article provides recommendations, including the parameters to be taken into account in early prediction, and opens up prospects for future work.
Author Keywords: student dropout, machine learning, early prediction, random forest, gradient boosting.



Achi Harrisson Thiziers1 and Moussa KONE2
1 ISN Training and Research Unit, Virtual University of , Abidjan, Côte d’Ivoi, Côte d’Ivoire
2 UFR des Sciences de la Nature, Pôle de Recherche Environnement et Développement Durable, Université Nangui Abrogoua, Abidjan, Côte d’Ivoire
Original language: English
Copyright © 2025 ISSR Journals. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
Student dropout rates are a major challenge for higher education. Both parents and academic institutions are seeking to reduce this phenomenon by investigating its root causes, as it has economic, social and institutional consequences. Based on an academic dataset enriched by SMOTE balancing, one-hot encoding and scaling, this article explores the application of machine learning (Random Forest, Gradient Boosting) and deep learning (MLP and CNN) techniques to predict this phenomenon and consider possible solutions. Exploring various data, such as grades, absences, failures, study time and family support, the models were compared through these metrics: Accuracy, F1-score, AUC-ROC, PR-curve. The results reveal the effectiveness of Gradient Boosting and Random Forest models (with an F1-score close to 1) over those of Multilayer Perceptron (F1-score = 0.84) and Convolutional Neural Networks (F1-score = 0,82). Analysis of the variables confirms the importance of mid-term marks (G2), absences and previous failures as key predictors. The article provides recommendations, including the parameters to be taken into account in early prediction, and opens up prospects for future work.
Author Keywords: student dropout, machine learning, early prediction, random forest, gradient boosting.
How to Cite this Article
Achi Harrisson Thiziers and Moussa KONE, “Comparative analysis of machine learning and deep learning approaches for predicting student dropout in higher education,” International Journal of Innovation and Scientific Research, vol. 81, no. 2, pp. 113–119, November 2025.