Suppr超能文献

比较机器学习与传统统计学方法在未诊断糖尿病预测模型中的应用。

Comparisons of the prediction models for undiagnosed diabetes between machine learning versus traditional statistical methods.

机构信息

Department of Sports Industry Studies, Yonsei University, Seoul, Republic of Korea.

Frontier Research Institute of Convergence Sports Science, Yonsei University, Seoul, Republic of Korea.

出版信息

Sci Rep. 2023 Aug 11;13(1):13101. doi: 10.1038/s41598-023-40170-0.

Abstract

We compared the prediction performance of machine learning-based undiagnosed diabetes prediction models with that of traditional statistics-based prediction models. We used the 2014-2020 Korean National Health and Nutrition Examination Survey (KNHANES) (N = 32,827). The KNHANES 2014-2018 data were used as training and internal validation sets and the 2019-2020 data as external validation sets. The receiver operating characteristic curve area under the curve (AUC) was used to compare the prediction performance of the machine learning-based and the traditional statistics-based prediction models. Using sex, age, resting heart rate, and waist circumference as features, the machine learning-based model showed a higher AUC (0.788 vs. 0.740) than that of the traditional statistical-based prediction model. Using sex, age, waist circumference, family history of diabetes, hypertension, alcohol consumption, and smoking status as features, the machine learning-based prediction model showed a higher AUC (0.802 vs. 0.759) than the traditional statistical-based prediction model. The machine learning-based prediction model using features for maximum prediction performance showed a higher AUC (0.819 vs. 0.765) than the traditional statistical-based prediction model. Machine learning-based prediction models using anthropometric and lifestyle measurements may outperform the traditional statistics-based prediction models in predicting undiagnosed diabetes.

摘要

我们比较了基于机器学习的未诊断糖尿病预测模型与基于传统统计学的预测模型的预测性能。我们使用了 2014-2020 年韩国国家健康和营养检查调查(KNHANES)(N=32827)的数据。KNHANES 2014-2018 年的数据用于训练和内部验证集,2019-2020 年的数据用于外部验证集。我们使用接收者操作特征曲线下的曲线面积(AUC)来比较基于机器学习和基于传统统计学的预测模型的预测性能。使用性别、年龄、静息心率和腰围作为特征,基于机器学习的模型显示出更高的 AUC(0.788 比 0.740)。使用性别、年龄、腰围、糖尿病家族史、高血压、饮酒和吸烟状况作为特征,基于机器学习的预测模型显示出更高的 AUC(0.802 比 0.759)。使用最大预测性能的特征的基于机器学习的预测模型显示出更高的 AUC(0.819 比 0.765)。使用人体测量和生活方式测量的基于机器学习的预测模型可能比基于传统统计学的预测模型在预测未诊断的糖尿病方面表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb3a/10421881/e705b1576043/41598_2023_40170_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验