Suppr超能文献

使用机器学习方法检测糖尿病的高危因素和早期诊断。

Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods.

机构信息

Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia.

Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia.

出版信息

Comput Intell Neurosci. 2022 Sep 29;2022:2557795. doi: 10.1155/2022/2557795. eCollection 2022.

Abstract

Diabetes is a chronic disease that can cause several forms of chronic damage to the human body, including heart problems, kidney failure, depression, eye damage, and nerve damage. There are several risk factors involved in causing this disease, with some of the most common being obesity, age, insulin resistance, and hypertension. Therefore, early detection of these risk factors is vital in helping patients reverse diabetes from the early stage to live healthy lives. Machine learning (ML) is a useful tool that can easily detect diabetes from several risk factors and, based on the findings, provide a decision-based model that can help in diagnosing the disease. This study aims to detect the risk factors of diabetes using ML methods and to provide a decision support system for medical practitioners that can help them in diagnosing diabetes. Moreover, besides various other preprocessing steps, this study has used the synthetic minority over-sampling technique integrated with the edited nearest neighbor (SMOTE-ENN) method for balancing the BRFSS dataset. The SMOTE-ENN is a more powerful method than the individual SMOTE method. Several ML methods were applied to the processed BRFSS dataset and built prediction models for detecting the risk factors that can help in diagnosing diabetes patients in the early stage. The prediction models were evaluated using various measures that show the high performance of the models. The experimental results show the reliability of the proposed models, demonstrating that k-nearest neighbor (KNN) outperformed other methods with an accuracy of 98.38%, sensitivity, specificity, and ROC/AUC score of 98%. Moreover, compared with the existing state-of-the-art methods, the results confirm the efficacy of the proposed models in terms of accuracy and other evaluation measures. The use of SMOTE-ENN is more beneficial for balancing the dataset to build more accurate prediction models. This was the main reason it was possible to achieve models more accurate than the existing ones.

摘要

糖尿病是一种慢性病,可对人体造成多种形式的慢性损害,包括心脏问题、肾衰竭、抑郁、眼部损伤和神经损伤。导致这种疾病的风险因素有几个,其中最常见的是肥胖、年龄、胰岛素抵抗和高血压。因此,早期发现这些风险因素对于帮助患者从早期阶段逆转糖尿病、过上健康的生活至关重要。机器学习 (ML) 是一种有用的工具,可以很容易地从几个风险因素中检测出糖尿病,并根据发现结果提供一个基于决策的模型,帮助诊断疾病。本研究旨在使用 ML 方法检测糖尿病的风险因素,并为医疗从业者提供决策支持系统,帮助他们诊断糖尿病。此外,除了各种其他预处理步骤外,本研究还使用了集成编辑最近邻 (SMOTE-ENN) 方法的合成少数过采样技术 (SMOTE-ENN) 来平衡 BRFSS 数据集。SMOTE-ENN 是一种比单独的 SMOTE 方法更强大的方法。将几种 ML 方法应用于处理后的 BRFSS 数据集,并构建用于检测有助于早期诊断糖尿病患者的风险因素的预测模型。使用各种度量标准评估预测模型,这些度量标准表明模型的性能很高。实验结果表明了所提出模型的可靠性,表明 K-最近邻 (KNN) 以 98.38%的准确率、敏感性、特异性和 ROC/AUC 分数超过了其他方法。此外,与现有的最先进方法相比,结果证实了所提出模型在准确性和其他评估指标方面的有效性。使用 SMOTE-ENN 更有利于平衡数据集以构建更准确的预测模型。这是实现比现有模型更准确模型的主要原因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/450b/9536939/961ecd1338e9/CIN2022-2557795.001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验