University of York, York, YO10 5GH, UK.
Bradford Teaching Hospitals NHS Foundation Trust, Bradford BD9 6RJ, UK.
Stud Health Technol Inform. 2023 May 18;302:38-42. doi: 10.3233/SHTI230060.
Type 2 diabetes is a life-long health condition, and as it progresses, A range of comorbidities can develop. The prevalence of diabetes has increased gradually, and it is expected that 642 million adults will be living with diabetes by 2040. Early and proper interventions for managing diabetes-related comorbidities are important. In this study, we propose a Machine Learning (ML) model for predicting the risk of developing hypertension for patients who already have Type 2 diabetes. We used the Connected Bradford dataset, consisting of 1.4 million patients, as our main dataset for data analysis and model building. As a result of data analysis, we found that hypertension is the most frequent observation among patients having Type 2 diabetes. Since hypertension is very important to predict clinically poor outcomes such as risk of heart, brain, kidney, and other diseases, it is crucial to make early and accurate predictions of the risk of having hypertension for Type 2 diabetic patients. We used Naïve Bayes (NB), Neural Network (NN), Random Forest (RF), and Support Vector Machine (SVM) to train our model. Then we ensembled these models to see the potential performance improvement. The ensemble method gave the best classification performance values of accuracy and kappa values of 0.9525 and 0.2183, respectively. We concluded that predicting the risk of developing hypertension for Type 2 diabetic patients using ML provides a promising stepping stone for preventing the Type 2 diabetes progression.
2 型糖尿病是一种终身的健康状况,随着病情的发展,可能会出现一系列合并症。糖尿病的患病率逐渐增加,预计到 2040 年,将有 6.42 亿成年人患有糖尿病。早期和适当的干预措施对于管理与糖尿病相关的合并症非常重要。在这项研究中,我们提出了一种用于预测已经患有 2 型糖尿病的患者发生高血压风险的机器学习 (ML) 模型。我们使用了由 140 万患者组成的 Connected Bradford 数据集作为我们的主要数据集进行数据分析和模型构建。通过数据分析,我们发现高血压是 2 型糖尿病患者中最常见的观察结果。由于高血压对于预测心脏、大脑、肾脏和其他疾病等临床不良结局非常重要,因此对于 2 型糖尿病患者发生高血压的风险进行早期和准确的预测至关重要。我们使用朴素贝叶斯 (NB)、神经网络 (NN)、随机森林 (RF) 和支持向量机 (SVM) 来训练我们的模型。然后,我们对这些模型进行集成,以观察潜在的性能改进。集成方法给出了最佳的分类性能值,准确性和kappa 值分别为 0.9525 和 0.2183。我们得出结论,使用 ML 预测 2 型糖尿病患者发生高血压的风险为预防 2 型糖尿病的进展提供了一个有前途的起点。