Hennebelle Alain, Ismail Leila, Materwala Huned, Al Kaabi Juma, Ranjan Priya, Janardhanan Rajiv
School of Computing and Information Systems, The University of Melbourne, Australia.
Intelligent Distributed Computing and Systems Lab, Department of Computer Science and Software Engineering, College of Information Technology, United Arab Emirates University, United Arab Emirates.
Comput Struct Biotechnol J. 2023 Nov 23;23:212-233. doi: 10.1016/j.csbj.2023.11.038. eCollection 2024 Dec.
Diabetes Mellitus, one of the leading causes of death worldwide, has no cure to date and can lead to severe health complications, such as retinopathy, limb amputation, cardiovascular diseases, and neuronal disease, if left untreated. Consequently, it becomes crucial to be able to monitor and predict the incidence of diabetes. Machine learning approaches have been proposed and evaluated in the literature for diabetes prediction. This paper proposes an IoT-edge-Artificial Intelligence (AI)-blockchain system for diabetes prediction based on risk factors. The proposed system is underpinned by blockchain to obtain a cohesive view of the risk factors data from patients across different hospitals and ensure security and privacy of the user's data. We provide a comparative analysis of different medical sensors, devices, and methods to measure and collect the risk factors values in the system. Numerical experiments and comparative analysis were carried out within our proposed system, using the most accurate random forest (RF) model, and the two most used state-of-the-art machine learning approaches, Logistic Regression (LR) and Support Vector Machine (SVM), using three real-life diabetes datasets. The results show that the proposed system predicts diabetes using RF with 4.57% more accuracy on average in comparison with the other models LR and SVM, with 2.87 times more execution time. Data balancing without feature selection does not show significant improvement. When using feature selection, the performance is improved by 1.14% for PIMA Indian and 0.02% for Sylhet datasets, while it is reduced by 0.89% for MIMIC III.
糖尿病是全球主要死因之一,迄今为止无法治愈,如果不加以治疗,可能会导致严重的健康并发症,如视网膜病变、肢体截肢、心血管疾病和神经疾病。因此,能够监测和预测糖尿病的发病率变得至关重要。机器学习方法已在文献中被提出并用于糖尿病预测评估。本文提出了一种基于风险因素的物联网边缘人工智能(AI)区块链系统用于糖尿病预测。所提出的系统以区块链为基础,以获得来自不同医院患者的风险因素数据的连贯视图,并确保用户数据的安全性和隐私性。我们对系统中测量和收集风险因素值的不同医疗传感器、设备和方法进行了比较分析。在我们提出的系统内进行了数值实验和比较分析,使用最准确的随机森林(RF)模型,以及两种最常用的先进机器学习方法,逻辑回归(LR)和支持向量机(SVM),使用了三个真实的糖尿病数据集。结果表明,与其他模型LR和SVM相比,所提出的系统使用RF预测糖尿病的平均准确率提高了4.57%,执行时间多出2.87倍。无特征选择的数据平衡没有显示出显著改善。使用特征选择时,对于皮马印第安人数据集性能提高了1.14%,对于锡尔赫特数据集性能提高了0.02%,而对于MIMIC III数据集性能降低了0.89%。