• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用非配对临床和基因数据对2型糖尿病进行迁移学习预测。

Transfer learning prediction of type 2 diabetes with unpaired clinical and genetic data.

作者信息

Jung YounSung, Han SeanKyo, Kang EunHee, Park SoYoung, Kim MinHee, Kim NanHee, Ahn TaeJin

机构信息

Department of Life Science, Handong Global University, Pohang, Republic of Korea.

Division of Endocrinology and Metabolism, Department of Internal Medicine, Korea University Ansan Hospital, Ansan, Republic of Korea.

出版信息

Sci Rep. 2025 Jul 29;15(1):27695. doi: 10.1038/s41598-025-05532-w.

DOI:10.1038/s41598-025-05532-w
PMID:40730802
Abstract

The prevalence of type 2 diabetes mellitus (T2DM) in Korea has risen in recent years, yet many cases remain undiagnosed. Advanced artificial intelligence models using multi-modal data have shown promise in disease prediction, but two major challenges persist: the scarcity of samples containing all desired data modalities and class imbalance in T2DM datasets. We propose a novel transfer learning framework to predict T2DM onset within five years, using two Korean cohorts (KoGES and SNUH). To utilize unpaired multi-modal data, our approach transfers knowledge between clinical and genetic domains, leveraging unpaired clinical data alongside paired data. We also address class imbalance by applying a positively weighted binary cross-entropy (BCE) loss and a weighted random sampler (WRS). The transfer learning framework improved T2DM prediction performance. Using WRS and weighted BCE loss increased the model's balanced accuracy and AUC (achieving test AUC 0.8441). Furthermore, combining transfer learning with intermediate data fusion yielded even higher performance (test AUC 0.8715). These enhancements were achieved despite limited paired multi-modal samples. Our framework effectively handles scarce paired data and class imbalance, leading to improved T2DM risk prediction. This approach can be adapted to other medical prediction tasks and integrated with additional data modalities, potentially aiding earlier diagnosis and better disease management in clinical settings.

摘要

近年来,韩国2型糖尿病(T2DM)的患病率有所上升,但仍有许多病例未被诊断出来。使用多模态数据的先进人工智能模型在疾病预测方面显示出了前景,但仍存在两个主要挑战:包含所有所需数据模态的样本稀缺以及T2DM数据集中的类别不平衡。我们提出了一种新颖的迁移学习框架,利用两个韩国队列(KoGES和SNUH)来预测五年内T2DM的发病情况。为了利用未配对的多模态数据,我们的方法在临床和基因领域之间转移知识,利用未配对的临床数据以及配对数据。我们还通过应用正加权二元交叉熵(BCE)损失和加权随机采样器(WRS)来解决类别不平衡问题。迁移学习框架提高了T2DM的预测性能。使用WRS和加权BCE损失提高了模型的平衡准确率和AUC(测试AUC达到0.8441)。此外,将迁移学习与中间数据融合相结合产生了更高的性能(测试AUC为0.8715)。尽管配对的多模态样本有限,但仍实现了这些改进。我们的框架有效地处理了稀缺的配对数据和类别不平衡问题,从而改善了T2DM风险预测。这种方法可以适用于其他医学预测任务,并与其他数据模态集成,有可能在临床环境中帮助早期诊断和更好地管理疾病。

相似文献

1
Transfer learning prediction of type 2 diabetes with unpaired clinical and genetic data.利用非配对临床和基因数据对2型糖尿病进行迁移学习预测。
Sci Rep. 2025 Jul 29;15(1):27695. doi: 10.1038/s41598-025-05532-w.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
4
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.
5
Predictive modeling of complications arising from early-onset preeclampsia in pregnant women.早发型子痫前期孕妇并发症的预测模型
Womens Health (Lond). 2025 Jan-Dec;21:17455057251348978. doi: 10.1177/17455057251348978. Epub 2025 Jul 21.
6
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
7
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
8
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
9
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
10
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

本文引用的文献

1
Monitoring individualized glucose levels predicts risk for bradycardia in type 2 diabetes patients with chronic kidney disease: a pilot study.监测个体化血糖水平可预测2型糖尿病合并慢性肾脏病患者发生心动过缓的风险:一项初步研究。
Sci Rep. 2024 Dec 5;14(1):30290. doi: 10.1038/s41598-024-81983-x.
2
Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records.利用五年队列电子健康记录预测 2 型糖尿病新发病例的大型语言多模态模型。
Sci Rep. 2024 Sep 6;14(1):20774. doi: 10.1038/s41598-024-71020-2.
3
A comprehensive multi-task deep learning approach for predicting metabolic syndrome with genetic, nutritional, and clinical data.
一种用于利用遗传、营养和临床数据预测代谢综合征的综合多任务深度学习方法。
Sci Rep. 2024 Aug 1;14(1):17851. doi: 10.1038/s41598-024-68541-1.
4
Predicting type 2 diabetes via machine learning integration of multiple omics from human pancreatic islets.通过整合人类胰岛的多种组学的机器学习预测 2 型糖尿病。
Sci Rep. 2024 Jun 25;14(1):14637. doi: 10.1038/s41598-024-64846-3.
5
Advancing diabetes prediction with a progressive self-transfer learning framework for discrete time series data.利用渐进式自迁移学习框架提升糖尿病预测能力:离散时间序列数据分析。
Sci Rep. 2023 Nov 29;13(1):21044. doi: 10.1038/s41598-023-48463-0.
6
Prediction of type 2 diabetes using genome-wide polygenic risk score and metabolic profiles: A machine learning analysis of population-based 10-year prospective cohort study.基于全基因组多基因风险评分和代谢谱预测 2 型糖尿病:基于人群的 10 年前瞻性队列研究的机器学习分析。
EBioMedicine. 2022 Dec;86:104383. doi: 10.1016/j.ebiom.2022.104383. Epub 2022 Nov 30.
7
Early Prediction for Prediabetes and Type 2 Diabetes Using the Genetic Risk Score and Oxidative Stress Score.使用遗传风险评分和氧化应激评分对糖尿病前期和2型糖尿病进行早期预测。
Antioxidants (Basel). 2022 Jun 17;11(6):1196. doi: 10.3390/antiox11061196.
8
Diabetes Fact Sheet in Korea 2021.2021 年韩国糖尿病概况。
Diabetes Metab J. 2022 May;46(3):417-426. doi: 10.4093/dmj.2022.0106. Epub 2022 May 25.
9
Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches.使用逻辑回归和机器学习方法预测 2 型糖尿病。
Int J Environ Res Public Health. 2021 Jul 9;18(14):7346. doi: 10.3390/ijerph18147346.
10
Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study.基于中国农村人群的机器学习特征分析 2 型糖尿病风险:河南农村队列研究。
Sci Rep. 2020 Mar 10;10(1):4406. doi: 10.1038/s41598-020-61123-x.