Analisis Prediktif Dropout Mahasiswa Berdasarkan Kinerja Akademik Semester Awal Menggunakan Machine Learning
DOI:
https://doi.org/10.30998/ed38e865Keywords:
dropout, academic performance, machine learning, Random Forest, Gradient BoostingAbstract
Student dropout is a critical issue in higher education. This study aims to develop a predictive model of dropout based on early academic performance using Random Forest and Gradient Boosting algorithms. The dataset, sourced from the UCI Repository, contains 4,424 student records. Key features analyzed include the number of enrolled courses, evaluations, average grades, and enrollment age. Results show that the Gradient Boosting algorithm achieved 70.05% accuracy, while Random Forest reached 70.16%, both performing best in classifying graduates. The model successfully identifies high-risk students, although challenges remain in predicting “enrolled” status. These findings highlight the potential of machine learning for early dropout detection and support more targeted academic interventions.
Downloads
References
Addison, L., & Williams, D. (2023). Predicting student retention in higher education institutions (HEIs). Higher Education, Skills and Work-Based Learning, 13(5), 865–885. https://doi.org/10.1108/HESWBL-12-2022-0257
Alshboul, O., Shehadeh, A., Almasabha, G., & Almuflih, A. S. (2022). Extreme Gradient Boosting-Based Machine Learning Approach for Green Building Cost Prediction. Sustainability, 14(11), 6651. https://doi.org/10.3390/su14116651
Andrianof, H., Gusman, A. P., & Putra, O. A. (2025). Implementasi Algoritma Random Forest untuk Prediksi Kelulusan Mahasiswa Berdasarkan Data Akademik: Studi Kasus di Perguruan Tinggi Indonesia. Jurnal Sains Informatika Terapan (JSIT) E-ISSN, 4(1), 2025.
Colpo, M. P., Thompsen Primo, T., Aguiar, M. S. de, & Cechinel, C. (2024). Educational Data Mining for Dropout Prediction: Trends, Opportunities, and Challenges. Revista Brasileira de Informática Na Educação, 32, 220–256. https://doi.org/10.5753/rbie.2024.3559
Crowther, P., & Briant, S. (2021). Predicting Academic Success: A Longitudinal Study of University Design Students. International Journal of Art & Design Education, 40(1), 20–34. https://doi.org/10.1111/jade.12329
Feng, G., & Fan, M. (2024). Research on learning behavior patterns from the perspective of educational data mining: Evaluation, prediction and visualization. Expert Systems with Applications, 237, 121555. https://doi.org/10.1016/j.eswa.2023.121555
Fitriana, S., Rinianty, & Pratama, S. A. (2024). Prediksi Siswa Putus Sekolah dan Keberhasilan Akademik Menggunakan Machine Learning. The Indonesian Journal of Computer Science, 13(6).
Gabriel, K. F., & Flake, S. M. (2023). Teaching Unprepared Students. Routledge. https://doi.org/10.4324/9781003447450
Herbaut, E. (2021). Overcoming failure in higher education: Social inequalities and compensatory advantage in dropout patterns. Acta Sociologica, 64(4), 383–402. https://doi.org/10.1177/0001699320920916
Kabathova, J., & Drlik, M. (2021). Towards Predicting Student’s Dropout in University Courses Using Different Machine Learning Techniques. Applied Sciences, 11(7), 3130. https://doi.org/10.3390/app11073130
Laksita, A. L., & Sasi, K. (2024). Studi Komparasi Kurikulum Pendidikan Tingkat Menengah di Finlandia dan Norwegia. Jurnal Multidisiplin West Science, 3(10), 1592–1606. https://doi.org/10.58812/jmws.v3i10.1619
Mulyo, H., & Khanif Zyen, A. (2025). Pengaruh Hyperparameter Tuning Gradient Boosting Terhadap Prediksi Pemilihan Program Studi Mahasiswa Baru. BULLETIN OF COMPUTER SCIENCE RESEARCH, 5(2), 131–137. https://doi.org/10.47065/bulletincsr.v5i2.454
Nurmalitasari, Awang Long, Z., & Faizuddin Mohd Noor, M. (2023). Factors Influencing Dropout Students in Higher Education. Education Research International, 2023, 1–13. https://doi.org/10.1155/2023/7704142
Ouadah, A., Zemmouchi-Ghomari, L., & Salhi, N. (2022). Selecting an appropriate supervised machine learning algorithm for predictive maintenance. The International Journal of Advanced Manufacturing Technology, 119(7–8), 4277–4301. https://doi.org/10.1007/s00170-021-08551-9
Salman, H. A., Kalakech, A., & Steiti, A. (2024). Random Forest Algorithm Overview. Babylonian Journal of Machine Learning, 2024, 69–79. https://doi.org/10.58496/BJML/2024/007
Sudarman, E. J., & Budi, S. (2023). Pengembangan Model Kecerdasan Mesin Extreme Gradient Boosting untuk Prediksi Keberhasilan Studi Mahasiswa. Jurnal Strategi, 5(2), 297–314.
Syafii, A., Bahar, B., Shobicah, S., & Muharam, A. (2023). Pengukuran Indeks Mutu Pendidikan Berbasis Standar Nasional. Jurnal Multidisiplin Indonesia, 2(7), 1697–1701. https://doi.org/10.58344/jmi.v2i7.332
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Putu Satya Saputra (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.





