Analisis Prediktif Dropout Mahasiswa Berdasarkan Kinerja Akademik Semester Awal Menggunakan Machine Learning

Authors

  • Putu Satya Saputra Politeknik Negeri Bali Author

DOI:

https://doi.org/10.30998/ed38e865

Keywords:

dropout, academic performance, machine learning, Random Forest, Gradient Boosting

Abstract

Student dropout is a critical issue in higher education. This study aims to develop a predictive model of dropout based on early academic performance using Random Forest and Gradient Boosting algorithms. The dataset, sourced from the UCI Repository, contains 4,424 student records. Key features analyzed include the number of enrolled courses, evaluations, average grades, and enrollment age. Results show that the Gradient Boosting algorithm achieved 70.05% accuracy, while Random Forest reached 70.16%, both performing best in classifying graduates. The model successfully identifies high-risk students, although challenges remain in predicting “enrolled” status. These findings highlight the potential of machine learning for early dropout detection and support more targeted academic interventions.

Downloads

Download data is not yet available.

References

Addison, L., & Williams, D. (2023). Predicting student retention in higher education institutions (HEIs). Higher Education, Skills and Work-Based Learning, 13(5), 865–885. https://doi.org/10.1108/HESWBL-12-2022-0257

Alshboul, O., Shehadeh, A., Almasabha, G., & Almuflih, A. S. (2022). Extreme Gradient Boosting-Based Machine Learning Approach for Green Building Cost Prediction. Sustainability, 14(11), 6651. https://doi.org/10.3390/su14116651

Andrianof, H., Gusman, A. P., & Putra, O. A. (2025). Implementasi Algoritma Random Forest untuk Prediksi Kelulusan Mahasiswa Berdasarkan Data Akademik: Studi Kasus di Perguruan Tinggi Indonesia. Jurnal Sains Informatika Terapan (JSIT) E-ISSN, 4(1), 2025.

Colpo, M. P., Thompsen Primo, T., Aguiar, M. S. de, & Cechinel, C. (2024). Educational Data Mining for Dropout Prediction: Trends, Opportunities, and Challenges. Revista Brasileira de Informática Na Educação, 32, 220–256. https://doi.org/10.5753/rbie.2024.3559

Crowther, P., & Briant, S. (2021). Predicting Academic Success: A Longitudinal Study of University Design Students. International Journal of Art & Design Education, 40(1), 20–34. https://doi.org/10.1111/jade.12329

Feng, G., & Fan, M. (2024). Research on learning behavior patterns from the perspective of educational data mining: Evaluation, prediction and visualization. Expert Systems with Applications, 237, 121555. https://doi.org/10.1016/j.eswa.2023.121555

Fitriana, S., Rinianty, & Pratama, S. A. (2024). Prediksi Siswa Putus Sekolah dan Keberhasilan Akademik Menggunakan Machine Learning. The Indonesian Journal of Computer Science, 13(6).

Gabriel, K. F., & Flake, S. M. (2023). Teaching Unprepared Students. Routledge. https://doi.org/10.4324/9781003447450

Herbaut, E. (2021). Overcoming failure in higher education: Social inequalities and compensatory advantage in dropout patterns. Acta Sociologica, 64(4), 383–402. https://doi.org/10.1177/0001699320920916

Kabathova, J., & Drlik, M. (2021). Towards Predicting Student’s Dropout in University Courses Using Different Machine Learning Techniques. Applied Sciences, 11(7), 3130. https://doi.org/10.3390/app11073130

Laksita, A. L., & Sasi, K. (2024). Studi Komparasi Kurikulum Pendidikan Tingkat Menengah di Finlandia dan Norwegia. Jurnal Multidisiplin West Science, 3(10), 1592–1606. https://doi.org/10.58812/jmws.v3i10.1619

Mulyo, H., & Khanif Zyen, A. (2025). Pengaruh Hyperparameter Tuning Gradient Boosting Terhadap Prediksi Pemilihan Program Studi Mahasiswa Baru. BULLETIN OF COMPUTER SCIENCE RESEARCH, 5(2), 131–137. https://doi.org/10.47065/bulletincsr.v5i2.454

Nurmalitasari, Awang Long, Z., & Faizuddin Mohd Noor, M. (2023). Factors Influencing Dropout Students in Higher Education. Education Research International, 2023, 1–13. https://doi.org/10.1155/2023/7704142

Ouadah, A., Zemmouchi-Ghomari, L., & Salhi, N. (2022). Selecting an appropriate supervised machine learning algorithm for predictive maintenance. The International Journal of Advanced Manufacturing Technology, 119(7–8), 4277–4301. https://doi.org/10.1007/s00170-021-08551-9

Salman, H. A., Kalakech, A., & Steiti, A. (2024). Random Forest Algorithm Overview. Babylonian Journal of Machine Learning, 2024, 69–79. https://doi.org/10.58496/BJML/2024/007

Sudarman, E. J., & Budi, S. (2023). Pengembangan Model Kecerdasan Mesin Extreme Gradient Boosting untuk Prediksi Keberhasilan Studi Mahasiswa. Jurnal Strategi, 5(2), 297–314.

Syafii, A., Bahar, B., Shobicah, S., & Muharam, A. (2023). Pengukuran Indeks Mutu Pendidikan Berbasis Standar Nasional. Jurnal Multidisiplin Indonesia, 2(7), 1697–1701. https://doi.org/10.58344/jmi.v2i7.332

Downloads

Published

2026-01-15

How to Cite

Putu Satya Saputra. (2026). Analisis Prediktif Dropout Mahasiswa Berdasarkan Kinerja Akademik Semester Awal Menggunakan Machine Learning. Jurnal Riset Dan Aplikasi Mahasiswa Informatika (JRAMI), 7(01), 164-171. https://doi.org/10.30998/ed38e865