• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别阿扎尔队列人群中糖尿病发病相关危险因素的人工智能生存模型。

Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population.

作者信息

Gilani Neda, Somi Mohammadhossein, Hamidi Farzaneh, Santaguida Pasqualina, Faramarzi Elnaz, Arabi Belaghi Reza

机构信息

Department of Statistics and Epidemiology, Faculty of Health, Tabriz University of Medical Sciences, Tabriz, Iran.

Liver and Gastrointestinal Diseases Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.

出版信息

Health Promot Perspect. 2025 May 6;15(1):82-92. doi: 10.34172/hpp.025.43105. eCollection 2025 May.

DOI:10.34172/hpp.025.43105
PMID:40453689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12125507/
Abstract

BACKGROUND

This study aimed to identify some risk factors associated with time to diabetes type II events using artificial intelligence (AI) survival models (SM) in a population cohort from East Azerbaijan, Iran.

METHODS

Data from Azar-Cohort spanning from 2014 to 2020 was analyzed using the random forest (RF) variable selection method along with Cox regression to identify the most relevant risk factors associated with diabetes. We then developed prediction models using RF survival analysis. Lasso-variable selection and RF variable selection were used to select the most important variables. The concordance index (C-index) was used to evaluate the concordance of the prediction models.

RESULTS

Our LASSO-Cox regression identified six factors to be significantly associated with diabetes: age, mean corpuscular hemoglobin concentration (MCHC), waist circumference (WC), body mass index (BMI), use of sleep medication, and hypertension stage 1 and stage 2. The model included all variables with a C-index of 76.3%. In contrast, the RF analysis identified 21 important variables predicting a higher probability of having diabetes. Of those, WC, MCHC, triglyceride, and age were the most important predictors of diabetes. The RF model converged after 500 trees with an out-of-bag (OOB) of 0.28 and a C-index of 79.5%.

CONCLUSION

RF machine learning algorithms and LASSO-Cox regression analyses consistently identified WC, hypertension, and MCHC as the main risk factors for developing diabetes. The RF approach demonstrated slightly better accuracy in predicting the likelihood of diabetes at different time points.

摘要

背景

本研究旨在利用人工智能(AI)生存模型(SM),在伊朗东阿塞拜疆省的人群队列中,确定与II型糖尿病发病时间相关的一些风险因素。

方法

使用随机森林(RF)变量选择方法和Cox回归分析2014年至2020年阿扎尔队列的数据,以确定与糖尿病最相关的风险因素。然后,我们使用RF生存分析开发预测模型。采用套索变量选择和RF变量选择来选择最重要的变量。一致性指数(C指数)用于评估预测模型的一致性。

结果

我们的套索Cox回归确定了六个与糖尿病显著相关的因素:年龄、平均红细胞血红蛋白浓度(MCHC)、腰围(WC)、体重指数(BMI)、睡眠药物使用情况以及1期和2期高血压。该模型纳入了所有变量,C指数为76.3%。相比之下,RF分析确定了21个预测患糖尿病概率较高的重要变量。其中,WC、MCHC、甘油三酯和年龄是糖尿病最重要的预测因素。RF模型在500棵树后收敛,袋外估计值(OOB)为0.28,C指数为79.5%。

结论

RF机器学习算法和套索Cox回归分析一致确定WC、高血压和MCHC是患糖尿病的主要风险因素。RF方法在预测不同时间点患糖尿病的可能性方面表现出略高的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/c95f7a32c471/hpp-15-82-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/ff27488b50b8/hpp-15-82-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/5d2970aca3a7/hpp-15-82-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/c18ec0654588/hpp-15-82-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/d6c80951071a/hpp-15-82-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/c95f7a32c471/hpp-15-82-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/ff27488b50b8/hpp-15-82-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/5d2970aca3a7/hpp-15-82-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/c18ec0654588/hpp-15-82-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/d6c80951071a/hpp-15-82-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a09f/12125507/c95f7a32c471/hpp-15-82-g005.jpg

相似文献

1
Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population.用于识别阿扎尔队列人群中糖尿病发病相关危险因素的人工智能生存模型。
Health Promot Perspect. 2025 May 6;15(1):82-92. doi: 10.34172/hpp.025.43105. eCollection 2025 May.
2
Predicting recurrent gestational diabetes mellitus using artificial intelligence models: a retrospective cohort study.利用人工智能模型预测复发性妊娠期糖尿病:一项回顾性队列研究。
Arch Gynecol Obstet. 2024 Sep;310(3):1621-1630. doi: 10.1007/s00404-024-07551-w. Epub 2024 Jul 30.
3
Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.COVID-19大流行期间临床护理中的人工智能:一项系统综述。
Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7.
4
Artificial intelligence for predicting survival following deceased donor liver transplantation: Retrospective multi-center study.人工智能预测脑死亡供肝移植术后患者的生存情况:回顾性多中心研究。
Int J Surg. 2022 Sep;105:106838. doi: 10.1016/j.ijsu.2022.106838. Epub 2022 Aug 24.
5
Interpretable machine learning method to predict the risk of pre-diabetes using a national-wide cross-sectional data: evidence from CHNS.利用全国性横断面数据预测糖尿病前期风险的可解释机器学习方法:来自中国健康与营养调查的证据
BMC Public Health. 2025 Mar 26;25(1):1145. doi: 10.1186/s12889-025-22419-7.
6
A comparative study of forest methods for time-to-event data: variable selection and predictive performance.森林方法在生存时间数据中的比较研究:变量选择和预测性能。
BMC Med Res Methodol. 2021 Sep 25;21(1):193. doi: 10.1186/s12874-021-01386-8.
7
Development and validation of a new diagnostic prediction model for NAFLD based on machine learning algorithms in NHANES 2017-2020.3.基于2017 - 2020年美国国家健康与营养检查调查(NHANES)中机器学习算法的非酒精性脂肪性肝病(NAFLD)新诊断预测模型的开发与验证。
Hormones (Athens). 2025 Feb 13. doi: 10.1007/s42000-025-00634-6.
8
Prediction of Incident Diabetes in the Jackson Heart Study Using High-Dimensional Machine Learning.利用高维机器学习预测杰克逊心脏研究中的新发糖尿病
PLoS One. 2016 Oct 11;11(10):e0163942. doi: 10.1371/journal.pone.0163942. eCollection 2016.
9
[Body mass index, waist circumference and waist-to-height ratio associated with the incidence of type 2 diabetes mellitus: a cohort study].[体重指数、腰围及腰高比与2型糖尿病发病率的相关性:一项队列研究]
Zhonghua Yu Fang Yi Xue Za Zhi. 2016 Apr;50(4):328-33. doi: 10.3760/cma.j.issn.0253-9624.2016.04.009.
10
TRM: a powerful two-stage machine learning approach for identifying SNP-SNP interactions.TRM:一种用于识别单核苷酸多态性(SNP)-SNP相互作用的强大的两阶段机器学习方法。
Ann Hum Genet. 2012 Jan;76(1):53-62. doi: 10.1111/j.1469-1809.2011.00692.x. Epub 2011 Dec 11.

本文引用的文献

1
Hypertriglyceridemic Waist Phenotype and Lipid Accumulation Product: Two Comprehensive Obese Indicators of Waist Circumference and Triglyceride to Predict Type 2 Diabetes Mellitus in Chinese Population.高甘油三酯血症性腰围表型和脂质蓄积产物:腰围和甘油三酯的两个综合肥胖指标,用于预测中国人群 2 型糖尿病。
J Diabetes Res. 2020 Dec 2;2020:9157430. doi: 10.1155/2020/9157430. eCollection 2020.
2
Prevalence of elevated liver enzymes and its association with type 2 diabetes: A cross-sectional study in Bangladeshi adults.肝酶升高的患病率及其与2型糖尿病的关联:一项针对孟加拉国成年人的横断面研究。
Endocrinol Diabetes Metab. 2020 Feb 12;3(2):e00116. doi: 10.1002/edm2.116. eCollection 2020 Apr.
3
The Dose-Response Relationship between Gamma-Glutamyl Transferase and Risk of Diabetes Mellitus Using Publicly Available Data: A Longitudinal Study in Japan.
利用公开数据研究γ-谷氨酰转移酶与糖尿病风险之间的剂量反应关系:日本的一项纵向研究
Int J Endocrinol. 2020 Feb 21;2020:5356498. doi: 10.1155/2020/5356498. eCollection 2020.
4
Epidemiology of diabetes mellitus, pre-diabetes, undiagnosed and uncontrolled diabetes in Central Iran: results from Yazd health study.伊朗中部地区糖尿病、糖尿病前期、未诊断和未控制糖尿病的流行病学:亚兹德健康研究结果。
BMC Public Health. 2020 Feb 3;20(1):166. doi: 10.1186/s12889-020-8267-y.
5
Classification and prediction of diabetes disease using machine learning paradigm.使用机器学习范式对糖尿病疾病进行分类和预测。
Health Inf Sci Syst. 2020 Jan 3;8(1):7. doi: 10.1007/s13755-019-0095-z. eCollection 2020 Dec.
6
Waist circumference trajectories and risk of type 2 diabetes mellitus in Korean population: the Korean genome and epidemiology study (KoGES).腰围变化轨迹与韩国人群 2 型糖尿病发病风险:韩国基因组与流行病学研究(KoGES)
BMC Public Health. 2019 Jun 13;19(1):741. doi: 10.1186/s12889-019-7077-6.
7
The Role of Inflammation in Diabetes: Current Concepts and Future Perspectives.炎症在糖尿病中的作用:当前概念与未来展望
Eur Cardiol. 2019 Apr;14(1):50-59. doi: 10.15420/ecr.2018.33.1.
8
Predicting Diabetes Mellitus With Machine Learning Techniques.运用机器学习技术预测糖尿病
Front Genet. 2018 Nov 6;9:515. doi: 10.3389/fgene.2018.00515. eCollection 2018.
9
Cohort Profile: The AZAR cohort, a health-oriented research model in areas of major environmental change in Central Asia.队列简介:阿扎尔队列,中亚主要环境变化地区的一种以健康为导向的研究模式。
Int J Epidemiol. 2019 Apr 1;48(2):382-382h. doi: 10.1093/ije/dyy215.
10
The PERSIAN Cohort: Providing the Evidence Needed for Healthcare Reform.波斯队列研究:为医疗改革提供所需证据。
Arch Iran Med. 2017 Nov 1;20(11):691-695.