• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用定制的异构特征子集进行2型糖尿病诊断和预后的机器学习方法。

A machine learning approach for type 2 diabetes diagnosis and prognosis using tailored heterogeneous feature subsets.

作者信息

Navarro-Cerdán J Ramón, Pons-Suñer Pedro, Arnal Laura, Arlandis Joaquim, Llobet Rafael, Perez-Cortes Juan-Carlos, Lara-Hernández Francisco, Moya-Valera Celeste, Quiroz-Rodriguez Maria Elena, Rojo-Martinez Gemma, Valdés Sergio, Montanya Eduard, Calle-Pascual Alfonso L, Franch-Nadal Josep, Delgado Elias, Castaño Luis, García-García Ana-Bárbara, Chaves Felipe Javier

机构信息

Universitat Politècnica de València, Camí de Vera, s/n, 46022, València, Spain.

ITI, Universitat Politècnica de València, Camino de Vera s/n, 46022, València, Spain.

出版信息

Med Biol Eng Comput. 2025 Apr 8. doi: 10.1007/s11517-025-03355-5.

DOI:10.1007/s11517-025-03355-5
PMID:40198441
Abstract

Type 2 diabetes (T2D) is becoming one of the leading health problems in Western societies, diminishing quality of life and consuming a significant share of healthcare resources. This study presents machine learning models for T2D diagnosis and prognosis, developed using heterogeneous data from a Spanish population dataset (Di@bet.es study). The models were trained exclusively on individuals classified as controls and undiagnosed diabetics, ensuring that the results are not influenced by treatment effects or behavioral changes due to disease awareness. Two data domains are considered: environmental (patient lifestyle questionnaires and measurements) and clinical (biochemical and anthropometric measurements). The preprocessing pipeline consists of four key steps: geospatial data extraction, feature engineering, missing data imputation, and quasi-constancy filtering. Two working scenarios (Environmental and Healthcare) are defined based on the features used, and applied to two targets (diagnosis and prognosis), resulting in four distinct models. The feature subsets that best predict the target have been identified based on permutation importance and sequential backward selection, reducing the number of features and, consequently, the cost of predictions. In the Environmental scenario, models achieved an AUROC of 0.86 for diagnosis and 0.82 for prognosis. The Healthcare scenario performed better, with an AUROC of 0.96 for diagnosis and 0.88 for prognosis. A partial dependence analysis of the most relevant features is also presented. An online demo page showcasing the Environmental and Healthcare T2D prognosis models is available upon request.

摘要

2型糖尿病(T2D)正成为西方社会主要的健康问题之一,它降低了生活质量,并消耗了大量医疗资源。本研究提出了用于T2D诊断和预后的机器学习模型,这些模型是使用来自西班牙人群数据集(Di@bet.es研究)的异构数据开发的。这些模型仅在被归类为对照和未确诊糖尿病患者的个体上进行训练,以确保结果不受治疗效果或疾病认知导致的行为变化的影响。考虑了两个数据领域:环境数据(患者生活方式问卷和测量数据)和临床数据(生化和人体测量数据)。预处理管道包括四个关键步骤:地理空间数据提取、特征工程、缺失数据插补和准恒定性过滤。根据所使用的特征定义了两种工作场景(环境场景和医疗场景),并将其应用于两个目标(诊断和预后),从而产生了四个不同的模型。基于排列重要性和顺序向后选择,确定了最能预测目标的特征子集,减少了特征数量,从而降低了预测成本。在环境场景中,模型诊断的曲线下面积(AUROC)为0.86,预后的AUROC为0.82。医疗场景表现更好,诊断的AUROC为0.96,预后的AUROC为0.88。还对最相关特征进行了部分依赖分析。如有需要,可提供展示环境和医疗T2D预后模型的在线演示页面。

相似文献

1
A machine learning approach for type 2 diabetes diagnosis and prognosis using tailored heterogeneous feature subsets.一种使用定制的异构特征子集进行2型糖尿病诊断和预后的机器学习方法。
Med Biol Eng Comput. 2025 Apr 8. doi: 10.1007/s11517-025-03355-5.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
5
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
6
Automated feature learning and survival prognostication in grade 4 glioma using supervised machine learning models.使用监督式机器学习模型对四级胶质瘤进行自动特征学习和生存预后分析。
J Neurooncol. 2025 Jun 16. doi: 10.1007/s11060-025-05099-6.
7
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
8
Genetic determinants of testicular sperm extraction outcomes: insights from a large multicentre study of men with non-obstructive azoospermia.睾丸精子提取结果的遗传决定因素:来自一项针对非梗阻性无精子症男性的大型多中心研究的见解
Hum Reprod Open. 2025 Aug 29;2025(3):hoaf049. doi: 10.1093/hropen/hoaf049. eCollection 2025.
9
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
10
Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.基于机器学习的算法用于预测胫骨平台骨折治疗后2年和5年全膝关节置换风险的研究进展
Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.

本文引用的文献

1
Combination of Multiple Low-Risk Lifestyle Behaviors and Incident Type 2 Diabetes: A Systematic Review and Dose-Response Meta-analysis of Prospective Cohort Studies.多种低风险生活方式行为与2型糖尿病发病:前瞻性队列研究的系统评价和剂量反应荟萃分析
Diabetes Care. 2023 Mar 1;46(3):643-656. doi: 10.2337/dc22-1024.
2
Gamma-glutamyl transferase to high-density lipoprotein cholesterol ratio: A valuable predictor of type 2 diabetes mellitus incidence.谷氨酰转移酶与高密度脂蛋白胆固醇比值:预测 2 型糖尿病发病的有价值指标。
Front Endocrinol (Lausanne). 2022 Sep 29;13:1026791. doi: 10.3389/fendo.2022.1026791. eCollection 2022.
3
To explore association between gamma-glutamyl transferase and type 2 diabetes using a real-world study and mendelian randomization analysis.
利用真实世界研究和孟德尔随机化分析探讨γ-谷氨酰转移酶与 2 型糖尿病的相关性。
Front Endocrinol (Lausanne). 2022 Jul 25;13:899008. doi: 10.3389/fendo.2022.899008. eCollection 2022.
4
Developing a simple and practical decision model to predict the risk of incident type 2 diabetes among the general population: The Di@bet.es Study.开发一个简单实用的决策模型,以预测普通人群中 2 型糖尿病事件风险:Di@bet.es 研究。
Eur J Intern Med. 2022 Aug;102:80-87. doi: 10.1016/j.ejim.2022.05.005. Epub 2022 May 13.
5
Anthropometric and adiposity indicators and risk of type 2 diabetes: systematic review and dose-response meta-analysis of cohort studies.人体测量学和肥胖指标与 2 型糖尿病风险:队列研究的系统评价和剂量反应荟萃分析。
BMJ. 2022 Jan 18;376:e067516. doi: 10.1136/bmj-2021-067516.
6
Machine learning and deep learning predictive models for type 2 diabetes: a systematic review.用于2型糖尿病的机器学习和深度学习预测模型:一项系统综述
Diabetol Metab Syndr. 2021 Dec 20;13(1):148. doi: 10.1186/s13098-021-00767-9.
7
The links between sleep duration, obesity and type 2 diabetes mellitus.睡眠时间、肥胖与 2 型糖尿病之间的联系。
J Endocrinol. 2021 Dec 13;252(2):125-141. doi: 10.1530/JOE-21-0155.
8
Confounding by Socioeconomic Status in Epidemiological Studies of Air Pollution and Health: Challenges and Opportunities.《空气污染与健康的流行病学研究中的社会经济地位混杂:挑战与机遇》
Environ Health Perspect. 2021 Jun;129(6):65001. doi: 10.1289/EHP7980. Epub 2021 Jun 14.
9
HDL-C and Cardiovascular Risk: You Don't Need to Worry about Extremely High HDL-C Levels.高密度脂蛋白胆固醇与心血管风险:无需担忧高密度脂蛋白胆固醇水平极高的情况。
J Lipid Atheroscler. 2021 Jan;10(1):57-61. doi: 10.12997/jla.2021.10.1.57. Epub 2021 Jan 18.
10
A multi-class classification model for supporting the diagnosis of type II diabetes mellitus.一种支持II型糖尿病诊断的多分类模型。
PeerJ. 2020 Sep 10;8:e9920. doi: 10.7717/peerj.9920. eCollection 2020.