Predicting Diabetes Mellitus with Machine Learning Techniques
DOI:
https://doi.org/10.58564/IJSER.4.2.2025.315Keywords:
AES, RSA, ECC, ChaCha20, Hybrid Algorithms and Encryption Algorithms.Abstract
Blood sugar issues are a major health issue worldwide, with their incidence growing rapidly and affecting human health, economic systems, and societal structures. If diabetes remains untreated and undiagnosed, it can cause blood sugar levels to vary significantly, potentially damaging essential organs like the kidneys, eyes, and arteries of the heart in critical cases. As a result, there is an increasing focus on the prevention and early detection of diabetes mellitus within the medical community. Utilizing machine learning algorithms to analyze appropriate datasets for early disease prediction could prove life-saving. The objective of this paper is to examine four algorithms that are proposed to enhance the diagnosis of diabetes. This research analyzes the effectiveness of various machine learning algorithms in processing datasets with minority classes. The evaluation was based on the classification report (including accuracy, precision, recall, and F1-score), the confusion matrix, and the ROC AUC. The Diabetes Prediction Dataset is used to evaluate four machine learning algorithms. The classifier that deserves a singular mention is the Artificial Neural Network (ANN), which achieves a 97% accuracy rate. This demonstrates its capability of classifying instances that are common and less common types. The Random Forest and Decision Tree models also perform well in terms of their ability to deliver strong performance, and the outcome shows some incremental differences, suggesting their ability to manage the dataset is quite high. However, the Support Vector Machine (SVM) model performs worse than all the above models at 96.36% and seems to struggle with the correct classification of less frequent instances. Therefore, it would be problematic to distinguish between classes that are prominent and those that are not. Notably, the ANN, Random Forest, and Decision Tree models effectively identify cases that are more likely to capture rare cases, an important aspect when dealing with datasets that have class imbalance.
References
M. Maniruzzaman, M. J. Rahman, B. Ahammed, and M. M. Abedin, “Classification and prediction of diabetes disease using machine learning paradigm,” Heal. Inf. Sci. Syst., vol. 8, no. 1, 2020, doi: 10.1007/s13755-019-0095-z.
[2] S. Perveen, M. Shahbaz, A. Guergachi, and K. Keshavjee, “Performance Analysis of Data Mining Classification Techniques to Predict Diabetes,” Procedia Comput. Sci., vol. 82, pp. 115–121, 2016, doi: 10.1016/j.procs.2016.04.016.
[3] N. Nai-Arun and R. Moungmai, “Comparison of Classifiers for the Risk of Diabetes Prediction,” Procedia Comput. Sci., vol. 69, pp. 132–142, 2015, doi: 10.1016/j.procs.2015.10.014.
[4] J. P. Kandhasamy and S. Balamurali, “Performance analysis of classifier models to predict diabetes mellitus,” Procedia Comput. Sci., vol. 47, pp. 45–51, 2015, doi: 10.1016/j.procs.2015.03.182.
[5] Á. C. Ribeiro, A. K. Barros, E. Santana, and J. C. Príncipe, “Diabetes classification using a redundancy reduction preprocessor,” Rev. Bras. Eng. Biomed., vol. 31, no. 2, pp. 97–106, 2015, doi: 10.1590/1517-3151.0608.
[6] Y. Hayashi and S. Yukita, “Rule extraction using Recursive-Rule extraction algorithm with J48graft combined with sampling selection techniques for the diagnosis of type 2 diabetes mellitus in the Pima Indian dataset,” Informatics Med. Unlocked, vol. 2, pp. 92–104, 2016, doi: 10.1016/j.imu.2016.02.001.
[7] F. Mercaldo, V. Nardone, and A. Santone, “Diabetes Mellitus Affected Patients Classification and Diagnosis through Machine Learning Techniques,” Procedia Comput. Sci., vol. 112, pp. 2519–2528, 2017, doi: 10.1016/j.procs.2017.08.193.
[8] P. Indoria and Y. K. Rathore, “A Survey : Detection and Prediction of Diabetes Using Machine Learning Techniques,” Int. J. Eng. Res. Technol., vol. 7, no. 03, pp. 287–291, 2018.
[9] M. Nilashi, O. Bin Ibrahim, A. Mardani, A. Ahani, and A. Jusoh, “A soft computing approach for diabetes disease classification,” Health Informatics J., vol. 24, no. 4, pp. 379–393, 2018, doi: 10.1177/1460458216675500.
[10] R. B. Lukmanto, Suharjito, A. Nugroho, and H. Akbar, “Early detection of diabetes mellitus using feature selection and fuzzy support vector machine,” Procedia Comput. Sci., vol. 157, pp. 46–54, 2019, doi: 10.1016/j.procs.2019.08.140.
[11] V. Rawat and Suryakant, “A classification system for diabetic patients with machine learning techniques,” Int. J. Math. Eng. Manag. Sci., vol. 4, no. 3, pp. 729–744, 2019, doi: 10.33889/IJMEMS.2019.4.3-057.
[12] R. Deo and S. Panigrahi, “Performance Assessment of Machine Learning Based Models for Diabetes Prediction,” in 2019 IEEE Healthcare Innovations and Point of Care Technologies, HI-POCT 2019, 2019, no. 11, pp. 147–150, doi: 10.1109/HI-POCT45284.2019.8962811.
[13] H. R. Divakar, D. Ramesh, and B. R. Prakash, “An Ontology Driven System to Predict Diabetes with Machine Learning Techniques,” Int. J. Innov. Technol. Explor. Eng., vol. 9, no. 2, pp. 4005–4011, 2019, doi: 10.35940/ijitee.b7586.129219.
[14] J. Ye, L. Yao, J. Shen, R. Janarthanam, and Y. Luo, “Predicting mortality in critically ill patients with diabetes using machine learning and clinical notes,” BMC Med. Inform. Decis. Mak., vol. 20, no. 11, pp. 1–8, 2020, doi: 10.1186/s12911-020-01318-4.
[15] G. L. A. Kumari, P. Padmaja, and J. S. G, “ENN-Ensemble based Neural Network Method for Diabetes Classification,” Int. J. Eng. Adv. Technol., vol. 9, no. 3, pp. 574–579, 2020, doi: 10.35940/ijeat.c4819.029320.
[16] G. Mainenti, L. Campanile, F. Marulli, C. Ricciardi, and A. S. Valente, “Machine learning approaches for diabetes classification: Perspectives to artificial intelligence methods updating,” in IoTBDS 2020 - Proceedings of the 5th International Conference on Internet of Things, Big Data and Security, 2020, pp. 533–540, doi: 10.5220/0009839405330540.
[17] Z. Bashir and H. Ghous, “A Comparative Analysis on Diabetes Datasets Using Data Mining Techniques,” LC Int. J. STEM (ISSN 2708-7123), 2020.
[18] H. Zhou, R. Myrzashova, and R. Zheng, “Diabetes prediction model based on an enhanced deep neural network,” Eurasip J. Wirel. Commun. Netw., vol. 2020, no. 1, 2020, doi: 10.1186/s13638-020-01765-7.
[19] R. H. Abiyev and H. Altiparmak, “Type-2 Fuzzy Neural System for Diagnosis of Diabetes,” Math. Probl. Eng., vol. 2021, 2021, doi: 10.1155/2021/5854966.
[20] W. L. Alyoubi, M. F. Abulkhair, and W. M. Shalash, “Diabetic retinopathy fundus image classification and lesions localization system using deep learning,” Sensors, vol. 21, no. 11, pp. 1–22, 2021, doi: 10.3390/s21113704.
[21] S. NAHZAT and M. YAĞANOĞLU, “Makine Öğrenimi Sınıflandırma Algoritmalarını Kullanarak Diyabet Tahmini,” Eur. J. Sci. Technol., no. 24, pp. 53–59, 2021, doi: 10.31590/ejosat.899716.
[22] T. Dudkina, I. Meniailov, K. Bazilevych, S. Krivtsov, and A. Tkachenko, “Classification and prediction of diabetes disease using decision tree method,” in CEUR Workshop Proceedings, 2021, vol. 2824, pp. 163–172.
[23] L. Fregoso-Aparicio, J. Noguez, L. Montesinos, and J. A. García-García, “Machine learning and deep learning predictive models for type 2 diabetes: a systematic review,” Diabetology and Metabolic Syndrome, vol. 13, no. 1. BioMed Central, 2021, doi: 10.1186/s13098-021-00767-9.
[24] N. Izzati, A. Kader, U. K. Yusof, and S. Naim, “A Study of Diabetic Retinopathy Classification Using Support Vector Machine A Study of Diabetic Retinopathy Classification Using Support Vector Machine,” 2021.
[25] A. M. Abdulazeez, “Classification Based on Decision Tree Algorithm for Machine Learning,” 2021, doi: 10.38094/jastt20165.
[26] Mustafa T. (2021). Diabetes Prediction Dataset. Kaggle. Retrieved from https://www.kaggle.com/datasets/iammustafatz/diabetes-prediction-dataset
[27] Abdalkareem, T. A., Zidan, K. A., & Albahri, A. S. (2024). A Systematic Review of Adversarial Machine Learning and Deep Learning Applications. Al-Iraqia Journal for Scientific Engineering Research, 3(4), 14-40.

Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Heba Ahmed Jassim, Omar R. Kadhim, Zahraa Khduair Taha, Johnny Koh Siaw Paw, Yaw Chong Tak, Tiong Sieh Kiong

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.