• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

比较四种机器学习模型预测中国人群2型糖尿病发病的准确性:一项回顾性研究。

Comparing the accuracy of four machine learning models in predicting type 2 diabetes onset within the Chinese population: a retrospective study.

作者信息

Liu Hongzhou, Dong Song, Yang Hua, Wang Linlin, Liu Jia, Du Yangfan, Liu Jing, Lyu Zhaohui, Wang Yuhan, Jiang Li, Yu Shasha, Fu Xiaomin

机构信息

Department of Endocrinology, Aerospace Center Hospital, Beijing, China.

Department of Endocrinology, First Hospital of Handan City, Handan, China.

出版信息

J Int Med Res. 2024 Jun;52(6):3000605241253786. doi: 10.1177/03000605241253786.

DOI:10.1177/03000605241253786
PMID:38870271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11179491/
Abstract

OBJECTIVE

To evaluate the effectiveness of machine learning (ML) models in predicting 5-year type 2 diabetes mellitus (T2DM) risk within the Chinese population by retrospectively analyzing annual health checkup records.

METHODS

We included 46,247 patients (32,372 and 13,875 in training and validation sets, respectively) from a national health checkup center database. Univariate and multivariate Cox analyses were performed to identify factors influencing T2DM risk. Extreme Gradient Boosting (XGBoost), support vector machine (SVM), logistic regression (LR), and random forest (RF) models were trained to predict 5-year T2DM risk. Model performances were analyzed using receiver operating characteristic (ROC) curves for discrimination and calibration plots for prediction accuracy.

RESULTS

Key variables included fasting plasma glucose, age, and sedentary time. The LR model showed good accuracy with respective areas under the ROC (AUCs) of 0.914 and 0.913 in training and validation sets; the RF model exhibited favorable AUCs of 0.998 and 0.838. In calibration analysis, the LR model displayed good fit for low-risk patients; the RF model exhibited satisfactory fit for low- and high-risk patients.

CONCLUSIONS

LR and RF models can effectively predict T2DM risk in the Chinese population. These models may help identify high-risk patients and guide interventions to prevent complications and disabilities.

摘要

目的

通过回顾性分析年度健康体检记录,评估机器学习(ML)模型在中国人群中预测5年2型糖尿病(T2DM)风险的有效性。

方法

我们纳入了来自国家健康体检中心数据库的46247例患者(训练集和验证集分别为32372例和13875例)。进行单因素和多因素Cox分析以确定影响T2DM风险的因素。训练极端梯度提升(XGBoost)、支持向量机(SVM)、逻辑回归(LR)和随机森林(RF)模型以预测5年T2DM风险。使用受试者工作特征(ROC)曲线分析模型性能以进行鉴别,并使用校准图分析预测准确性。

结果

关键变量包括空腹血糖、年龄和久坐时间。LR模型显示出良好的准确性,训练集和验证集的ROC曲线下面积(AUC)分别为0.914和0.913;RF模型的AUC分别为0.998和0.838。在校准分析中,LR模型对低风险患者显示出良好的拟合度;RF模型对低风险和高风险患者均显示出令人满意的拟合度。

结论

LR和RF模型可以有效预测中国人群的T2DM风险。这些模型可能有助于识别高危患者,并指导采取干预措施以预防并发症和残疾。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/b1cba08d32da/10.1177_03000605241253786-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/72af7c15fe14/10.1177_03000605241253786-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/465c09967665/10.1177_03000605241253786-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/2606ec4db216/10.1177_03000605241253786-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/a434ec6baba8/10.1177_03000605241253786-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/b1cba08d32da/10.1177_03000605241253786-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/72af7c15fe14/10.1177_03000605241253786-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/465c09967665/10.1177_03000605241253786-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/2606ec4db216/10.1177_03000605241253786-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/a434ec6baba8/10.1177_03000605241253786-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e824/11179491/b1cba08d32da/10.1177_03000605241253786-fig5.jpg

相似文献

1
Comparing the accuracy of four machine learning models in predicting type 2 diabetes onset within the Chinese population: a retrospective study.比较四种机器学习模型预测中国人群2型糖尿病发病的准确性:一项回顾性研究。
J Int Med Res. 2024 Jun;52(6):3000605241253786. doi: 10.1177/03000605241253786.
2
Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study.基于中国农村人群的机器学习特征分析 2 型糖尿病风险:河南农村队列研究。
Sci Rep. 2020 Mar 10;10(1):4406. doi: 10.1038/s41598-020-61123-x.
3
Interpretable machine learning for allergic rhinitis prediction among preschool children in Urumqi, China.中国乌鲁木齐学龄前儿童变应性鼻炎预测的可解释机器学习。
Sci Rep. 2024 Sep 27;14(1):22281. doi: 10.1038/s41598-024-73733-w.
4
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
5
A retrospective study differentiating nontuberculous mycobacterial pulmonary disease from pulmonary tuberculosis on computed tomography using radiomics and machine learning algorithms.一项使用放射组学和机器学习算法在计算机断层扫描上区分非结核分枝杆菌肺病与肺结核的回顾性研究。
Ann Med. 2024 Dec;56(1):2401613. doi: 10.1080/07853890.2024.2401613. Epub 2024 Sep 16.
6
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
7
Predictive etiological classification of acute ischemic stroke through interpretable machine learning algorithms: a multicenter, prospective cohort study.通过可解释的机器学习算法对急性缺血性脑卒中进行预测病因分类:一项多中心前瞻性队列研究。
BMC Med Res Methodol. 2024 Sep 10;24(1):199. doi: 10.1186/s12874-024-02331-1.
8
Prediction of poststroke independent walking using machine learning: a retrospective study.基于机器学习的脑卒中后独立行走预测:一项回顾性研究。
BMC Neurol. 2024 Sep 10;24(1):332. doi: 10.1186/s12883-024-03849-z.
9
Machine Learning Models in Type 2 Diabetes Risk Prediction: Results from a Cross-sectional Retrospective Study in Chinese Adults.机器学习模型在 2 型糖尿病风险预测中的应用:一项中国成年人横断面回顾性研究的结果。
Curr Med Sci. 2019 Aug;39(4):582-588. doi: 10.1007/s11596-019-2077-4. Epub 2019 Jul 25.
10
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.

引用本文的文献

1
Type 2 Diabetes Prediction Model in China: A Five-Year Systematic Review.中国2型糖尿病预测模型:一项为期五年的系统综述。
Healthcare (Basel). 2025 Aug 15;13(16):2007. doi: 10.3390/healthcare13162007.

本文引用的文献

1
2. Diagnosis and Classification of Diabetes: Standards of Care in Diabetes-2024.2. 糖尿病的诊断与分类:《2024年糖尿病医疗护理标准》
Diabetes Care. 2024 Jan 1;47(Suppl 1):S20-S42. doi: 10.2337/dc24-S002.
2
Generalizability challenges of mortality risk prediction models: A retrospective analysis on a multi-center database.死亡率风险预测模型的可推广性挑战:对多中心数据库的回顾性分析
PLOS Digit Health. 2022 Apr 5;1(4):e0000023. doi: 10.1371/journal.pdig.0000023. eCollection 2022 Apr.
3
Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach.
机器学习分类模型预测 2 型糖尿病的准确性:系统调查和荟萃分析方法。
Int J Environ Res Public Health. 2022 Nov 1;19(21):14280. doi: 10.3390/ijerph192114280.
4
Micro Learning Support Vector Machine for Pattern Classification: A High-Speed Algorithm.微学习支持向量机模式分类:一种高速算法。
Comput Intell Neurosci. 2022 Aug 3;2022:4707637. doi: 10.1155/2022/4707637. eCollection 2022.
5
Impact of Income and Industry on New-Onset Diabetes among Employees: A Retrospective Cohort Study.收入和行业对员工新发糖尿病的影响:一项回顾性队列研究。
Int J Environ Res Public Health. 2022 Jan 19;19(3):1090. doi: 10.3390/ijerph19031090.
6
Machine learning and deep learning predictive models for type 2 diabetes: a systematic review.用于2型糖尿病的机器学习和深度学习预测模型:一项系统综述
Diabetol Metab Syndr. 2021 Dec 20;13(1):148. doi: 10.1186/s13098-021-00767-9.
7
Artificial Intelligence in Current Diabetes Management and Prediction.人工智能在当前糖尿病管理和预测中的应用。
Curr Diab Rep. 2021 Dec 13;21(12):61. doi: 10.1007/s11892-021-01423-2.
8
Secular incidence trends and effect of population aging on mortality due to type 1 and type 2 diabetes mellitus in China from 1990 to 2019: findings from the Global Burden of Disease Study 2019.1990 年至 2019 年中国 1 型和 2 型糖尿病死亡率的世俗发病趋势及人口老龄化的影响:来自 2019 年全球疾病负担研究的发现。
BMJ Open Diabetes Res Care. 2021 Nov;9(2). doi: 10.1136/bmjdrc-2021-002529.
9
Model-Based ROC Curve: Examining the Effect of Case Mix and Model Calibration on the ROC Plot.基于模型的 ROC 曲线:检验病例组合和模型校准对 ROC 图的影响。
Med Decis Making. 2022 May;42(4):487-499. doi: 10.1177/0272989X211050909. Epub 2021 Oct 16.
10
Triglyceride glucose-body mass index and the risk of diabetes: a general population-based cohort study.甘油三酯-葡萄糖-体重指数与糖尿病风险:一项基于一般人群的队列研究。
Lipids Health Dis. 2021 Sep 6;20(1):99. doi: 10.1186/s12944-021-01532-7.