• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的遗传风险评分提高了 2 型糖尿病预测模型的判别效率:队列研究。

Genetic Risk Score Increased Discriminant Efficiency of Predictive Models for Type 2 Diabetes Mellitus Using Machine Learning: Cohort Study.

机构信息

Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou, China.

School of Information Engineering, Zhengzhou University, Zhengzhou, China.

出版信息

Front Public Health. 2021 Feb 17;9:606711. doi: 10.3389/fpubh.2021.606711. eCollection 2021.

DOI:10.3389/fpubh.2021.606711
PMID:33681127
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7925839/
Abstract

Previous studies have constructed prediction models for type 2 diabetes mellitus (T2DM), but machine learning was rarely used and few focused on genetic prediction. This study aimed to establish an effective T2DM prediction tool and to further explore the potential of genetic risk scores (GRS) via various classifiers among rural adults. In this prospective study, the GRS for a total of 5,712 participants from the Henan Rural Cohort Study was calculated. Cox proportional hazards (CPH) regression was used to analyze the associations between GRS and T2DM. CPH, artificial neural network (ANN), random forest (RF), and gradient boosting machine (GBM) were used to establish prediction models, respectively. The area under the receiver operating characteristic curve (AUC) and net reclassification index (NRI) were used to assess the discrimination ability of the models. The decision curve was plotted to determine the clinical-utility for prediction models. Compared with the individuals in the lowest quintile of the GRS, the HR (95% CI) was 2.06 (1.40 to 3.03) for those with the highest quintile of GRS ( < 0.05). Based on conventional predictors, the AUCs of the prediction model were 0.815, 0.816, 0.843, and 0.851 via CPH, ANN, RF, and GBM, respectively. Changes with the integration of GRS for CPH, ANN, RF, and GBM were 0.001, 0.002, 0.018, and 0.033, respectively. The reclassifications were significantly improved for all classifiers when adding GRS (NRI: 41.2% for CPH; 41.0% for ANN; 46.4% for ANN; 45.1% for GBM). Decision curve analysis indicated the clinical benefits of model combined GRS. The prediction model combined with GRS may provide incremental predictions of performance beyond conventional factors for T2DM, which demonstrated the potential clinical use of genetic markers to screen vulnerable populations. The Henan Rural Cohort Study is registered in the Chinese Clinical Trial Register (Registration number: ChiCTR-OOC-15006699). http://www.chictr.org.cn/showproj.aspx?proj=11375.

摘要

先前的研究已经构建了 2 型糖尿病(T2DM)的预测模型,但很少使用机器学习,且很少关注遗传预测。本研究旨在建立一种有效的 T2DM 预测工具,并通过农村成年人中的各种分类器进一步探索遗传风险评分(GRS)的潜力。

在这项前瞻性研究中,我们计算了来自河南农村队列研究的 5712 名参与者的 GRS。Cox 比例风险(CPH)回归用于分析 GRS 与 T2DM 之间的关联。分别使用 CPH、人工神经网络(ANN)、随机森林(RF)和梯度提升机(GBM)建立预测模型。使用接受者操作特征曲线下面积(AUC)和净重新分类指数(NRI)评估模型的区分能力。绘制决策曲线以确定预测模型的临床实用性。

与 GRS 最低五分位数的个体相比,GRS 最高五分位数的个体的 HR(95%CI)为 2.06(1.40 至 3.03)(<0.05)。基于常规预测因子,CPH、ANN、RF 和 GBM 预测模型的 AUC 分别为 0.815、0.816、0.843 和 0.851。CPH、ANN、RF 和 GBM 纳入 GRS 后,AUC 分别变化 0.001、0.002、0.018 和 0.033。添加 GRS 后,所有分类器的重新分类均显著改善(NRI:CPH 为 41.2%;ANN 为 41.0%;RF 为 46.4%;GBM 为 45.1%)。决策曲线分析表明,模型结合 GRS 具有临床获益。

综上所述,与传统因素相结合的 GRS 预测模型可能为 T2DM 提供额外的预测性能,这表明遗传标志物在筛选高危人群方面具有潜在的临床应用价值。

河南农村队列研究在中国临床试验注册中心(注册号:ChiCTR-OOC-15006699)注册。http://www.chictr.org.cn/showproj.aspx?proj=11375。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47da/7925839/8ccab305f105/fpubh-09-606711-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47da/7925839/68780274c9a1/fpubh-09-606711-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47da/7925839/e080339e2054/fpubh-09-606711-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47da/7925839/8ccab305f105/fpubh-09-606711-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47da/7925839/68780274c9a1/fpubh-09-606711-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47da/7925839/e080339e2054/fpubh-09-606711-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47da/7925839/8ccab305f105/fpubh-09-606711-g0003.jpg

相似文献

1
Genetic Risk Score Increased Discriminant Efficiency of Predictive Models for Type 2 Diabetes Mellitus Using Machine Learning: Cohort Study.基于机器学习的遗传风险评分提高了 2 型糖尿病预测模型的判别效率:队列研究。
Front Public Health. 2021 Feb 17;9:606711. doi: 10.3389/fpubh.2021.606711. eCollection 2021.
2
Genetic factors increase the identification efficiency of predictive models for dyslipidaemia: a prospective cohort study.遗传因素提高血脂异常预测模型识别效率的研究:一项前瞻性队列研究。
Lipids Health Dis. 2021 Feb 12;20(1):11. doi: 10.1186/s12944-021-01439-3.
3
Identifying the predictive effectiveness of a genetic risk score for incident hypertension using machine learning methods among populations in rural China.利用机器学习方法在中国农村人群中识别预测高血压事件的遗传风险评分的预测有效性。
Hypertens Res. 2021 Nov;44(11):1483-1491. doi: 10.1038/s41440-021-00738-7. Epub 2021 Sep 3.
4
Nonlaboratory-based risk assessment model for coronary heart disease screening: Model development and validation.用于冠心病筛查的非基于实验室的风险评估模型:模型开发与验证
Int J Med Inform. 2022 Mar 18;162:104746. doi: 10.1016/j.ijmedinf.2022.104746.
5
Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study.基于中国农村人群的机器学习特征分析 2 型糖尿病风险:河南农村队列研究。
Sci Rep. 2020 Mar 10;10(1):4406. doi: 10.1038/s41598-020-61123-x.
6
Enhanced osteoporotic fracture prediction in postmenopausal women using Bayesian optimization of machine learning models with genetic risk score.使用具有遗传风险评分的机器学习模型的贝叶斯优化增强绝经后女性骨质疏松性骨折预测
J Bone Miner Res. 2024 May 2;39(4):462-472. doi: 10.1093/jbmr/zjae025.
7
Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records.使用机器学习预测急诊入院风险:基于电子健康记录的开发和验证。
PLoS Med. 2018 Nov 20;15(11):e1002695. doi: 10.1371/journal.pmed.1002695. eCollection 2018 Nov.
8
Prevalence and characteristics of alcohol consumption and risk of type 2 diabetes mellitus in rural China.中国农村地区饮酒流行情况及其与 2 型糖尿病发病风险的关系。
BMC Public Health. 2021 Sep 9;21(1):1644. doi: 10.1186/s12889-021-11681-0.
9
Lifestyle Score and Genetic Factors With Hypertension and Blood Pressure Among Adults in Rural China.中国农村成年人的生活方式评分与高血压及血压的遗传因素。
Front Public Health. 2021 Aug 17;9:687174. doi: 10.3389/fpubh.2021.687174. eCollection 2021.
10
An early prediction model for type 2 diabetes mellitus based on genetic variants and nongenetic risk factors in a Han Chinese cohort.基于汉族队列中遗传变异和非遗传风险因素的 2 型糖尿病早期预测模型。
Front Endocrinol (Lausanne). 2023 Oct 25;14:1279450. doi: 10.3389/fendo.2023.1279450. eCollection 2023.

引用本文的文献

1
Evaluation of Machine Learning and Traditional Statistical Models to Assess the Value of Stroke Genetic Liability for Prediction of Risk of Stroke Within the UK Biobank.评估机器学习和传统统计模型以评估中风遗传易感性在英国生物银行中对中风风险预测的价值。
Healthcare (Basel). 2025 Apr 26;13(9):1003. doi: 10.3390/healthcare13091003.
2
Incorporating polygenic risk into the Leicester Risk Assessment score for 10-year risk prediction of type 2 diabetes.将多基因风险纳入莱斯特风险评估评分,以预测 10 年内 2 型糖尿病的发病风险。
Diabetes Metab Syndr. 2024 Apr;18(4):102996. doi: 10.1016/j.dsx.2024.102996. Epub 2024 Mar 29.
3

本文引用的文献

1
The Association of Body Fat Percentage With Hypertension in a Chinese Rural Population: The Henan Rural Cohort Study.中国农村人群体脂百分比与高血压的关联:河南农村队列研究
Front Public Health. 2020 Mar 20;8:70. doi: 10.3389/fpubh.2020.00070. eCollection 2020.
2
Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study.基于中国农村人群的机器学习特征分析 2 型糖尿病风险:河南农村队列研究。
Sci Rep. 2020 Mar 10;10(1):4406. doi: 10.1038/s41598-020-61123-x.
3
Global and regional estimates and projections of diabetes-related health expenditure: Results from the International Diabetes Federation Diabetes Atlas, 9th edition.
Genetic risk scores: are they important for diabetes management? results from multiple cross-sectional studies.
遗传风险评分:它们对糖尿病管理重要吗?多项横断面研究的结果
Diabetol Metab Syndr. 2023 Nov 10;15(1):227. doi: 10.1186/s13098-023-01204-9.
4
A scoping review of artificial intelligence-based methods for diabetes risk prediction.基于人工智能的糖尿病风险预测方法的范围综述。
NPJ Digit Med. 2023 Oct 25;6(1):197. doi: 10.1038/s41746-023-00933-5.
5
Machine Learning in Nutrition Research.机器学习在营养研究中的应用。
Adv Nutr. 2022 Dec 22;13(6):2573-2589. doi: 10.1093/advances/nmac103.
6
Machine learning for diabetes clinical decision support: a review.用于糖尿病临床决策支持的机器学习:综述
Adv Comput Intell. 2022;2(2):22. doi: 10.1007/s43674-022-00034-y. Epub 2022 Apr 13.
7
HNF1A Mutations and Beta Cell Dysfunction in Diabetes.HNF1A 突变与糖尿病中的β细胞功能障碍。
Int J Mol Sci. 2022 Mar 16;23(6):3222. doi: 10.3390/ijms23063222.
全球及各区域糖尿病相关卫生支出估计和预测:国际糖尿病联盟糖尿病地图集第 9 版结果。
Diabetes Res Clin Pract. 2020 Apr;162:108072. doi: 10.1016/j.diabres.2020.108072. Epub 2020 Feb 13.
4
Prevalence and Influencing Factors of Coronary Heart Disease and Stroke in Chinese Rural Adults: The Henan Rural Cohort Study.中国农村成年人冠心病和脑卒中的患病率及影响因素:河南农村队列研究
Front Public Health. 2020 Jan 21;7:411. doi: 10.3389/fpubh.2019.00411. eCollection 2019.
5
The Play of Genes and Non-genetic Factors on Type 2 Diabetes.基因与非遗传因素在2型糖尿病中的作用
Front Public Health. 2019 Nov 19;7:349. doi: 10.3389/fpubh.2019.00349. eCollection 2019.
6
Machine Learning Prediction of Mortality and Hospitalization in Heart Failure With Preserved Ejection Fraction.机器学习预测射血分数保留的心力衰竭患者的死亡率和住院率。
JACC Heart Fail. 2020 Jan;8(1):12-21. doi: 10.1016/j.jchf.2019.06.013. Epub 2019 Oct 9.
7
Building Risk Prediction Models for Type 2 Diabetes Using Machine Learning Techniques.利用机器学习技术构建 2 型糖尿病风险预测模型。
Prev Chronic Dis. 2019 Sep 19;16:E130. doi: 10.5888/pcd16.190109.
8
Machine Learning for Health Services Researchers.机器学习在卫生服务研究中的应用。
Value Health. 2019 Jul;22(7):808-815. doi: 10.1016/j.jval.2019.02.012.
9
Cohort Profile: The Henan Rural Cohort: a prospective study of chronic non-communicable diseases.队列简介:河南农村队列:一项慢性非传染性疾病的前瞻性研究。
Int J Epidemiol. 2019 Dec 1;48(6):1756-1756j. doi: 10.1093/ije/dyz039.
10
Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data.从大规模泛基因组数据预测大肠杆菌的抗生素耐药性。
PLoS Comput Biol. 2018 Dec 14;14(12):e1006258. doi: 10.1371/journal.pcbi.1006258. eCollection 2018 Dec.