TY - GEN
T1 - Predictive modeling based on machine learning strategies to forecast student dropout at a Peruvian university
T2 - 22nd LACCEI International Multi-Conference for Engineering, Education and Technology, LACCEI 2024
AU - Magdalena Aguilar Lopez, Kristelly
AU - Carbajal Ortega, Yuri
AU - Giovanni Martinez Hilario, Daril
AU - Rodriguez, Sol
N1 - Publisher Copyright:
© 2024 Latin American and Caribbean Consortium of Engineering Institutions. All rights reserved.
PY - 2024
Y1 - 2024
N2 - University students experiment different factors that bring as a consequence the abandonment of his professional career. In Perú, the dropout rate becomes a critical point of attention due to its increase since COVID-19. Despite the fact that the institutions join forces to improve student retention, these seem to be insufficient because of the root causes of the problem are not analyzed. Hence, this study aims to analyze the main causes associated to student dropout of a population of students from the academic period 2022-2 of a private university. For this purpose, three predictive models (random forest, logistic regression and decision tree) were designed to identify the main risks associated to abandonment of students. The predictive models were designed with the automatic learning method (Machine Learning) through Google Collab programming, obtaining a comparison of predicted dropout versus real dropouts, performing a model accuracy of 93% for the logistic regression model. Weighting the main risks identified, different retention strategies can be proposed to reduce the desertion rate.
AB - University students experiment different factors that bring as a consequence the abandonment of his professional career. In Perú, the dropout rate becomes a critical point of attention due to its increase since COVID-19. Despite the fact that the institutions join forces to improve student retention, these seem to be insufficient because of the root causes of the problem are not analyzed. Hence, this study aims to analyze the main causes associated to student dropout of a population of students from the academic period 2022-2 of a private university. For this purpose, three predictive models (random forest, logistic regression and decision tree) were designed to identify the main risks associated to abandonment of students. The predictive models were designed with the automatic learning method (Machine Learning) through Google Collab programming, obtaining a comparison of predicted dropout versus real dropouts, performing a model accuracy of 93% for the logistic regression model. Weighting the main risks identified, different retention strategies can be proposed to reduce the desertion rate.
KW - desertion
KW - machine learning
KW - predictive model
KW - University dropout
UR - http://www.scopus.com/inward/record.url?scp=85203795129&partnerID=8YFLogxK
U2 - 10.18687/LACCEI2024.1.1.1316
DO - 10.18687/LACCEI2024.1.1.1316
M3 - Conference contribution
AN - SCOPUS:85203795129
T3 - Proceedings of the LACCEI international Multi-conference for Engineering, Education and Technology
BT - Proceedings of the 22nd LACCEI International Multi-Conference for Engineering, Education and Technology
PB - Latin American and Caribbean Consortium of Engineering Institutions
Y2 - 17 July 2024 through 19 July 2024
ER -