• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的 2 型糖尿病亚型可重现预测。

Machine learning-based reproducible prediction of type 2 diabetes subtypes.

机构信息

Department of Diabetes, Endocrinology, and Metabolism, Fukushima Medical University School of Medicine, Fukushima, Japan.

Department of Diabetes, Metabolism and Endocrinology, Tohoku University Graduate School of Medicine, Miyagi, Japan.

出版信息

Diabetologia. 2024 Nov;67(11):2446-2458. doi: 10.1007/s00125-024-06248-8. Epub 2024 Aug 21.

DOI:10.1007/s00125-024-06248-8
PMID:39168869
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11519166/
Abstract

AIMS/HYPOTHESIS: Clustering-based subclassification of type 2 diabetes, which reflects pathophysiology and genetic predisposition, is a promising approach for providing personalised and effective therapeutic strategies. Ahlqvist's classification is currently the most vigorously validated method because of its superior ability to predict diabetes complications but it does not have strong consistency over time and requires HOMA2 indices, which are not routinely available in clinical practice and standard cohort studies. We developed a machine learning (ML) model to classify individuals with type 2 diabetes into Ahlqvist's subtypes consistently over time.

METHODS

Cohort 1 dataset comprised 619 Japanese individuals with type 2 diabetes who were divided into training and test sets for ML models in a 7:3 ratio. Cohort 2 dataset, comprising 597 individuals with type 2 diabetes, was used for external validation. Participants were pre-labelled (T2D) by unsupervised k-means clustering based on Ahlqvist's variables (age at diagnosis, BMI, HbA, HOMA2-B and HOMA2-IR) to four subtypes: severe insulin-deficient diabetes (SIDD), severe insulin-resistant diabetes (SIRD), mild obesity-related diabetes (MOD) and mild age-related diabetes (MARD). We adopted 15 variables for a multiclass classification random forest (RF) algorithm to predict type 2 diabetes subtypes (T2D). The proximity matrix computed by RF was visualised using a uniform manifold approximation and projection. Finally, we used a putative subset with missing insulin-related variables to test the predictive performance of the validation cohort, consistency of subtypes over time and prediction ability of diabetes complications.

RESULTS

T2D demonstrated a 94% accuracy for predicting T2D type 2 diabetes subtypes (AUCs ≥0.99 and F1 score [an indicator calculated by harmonic mean from precision and recall] ≥0.9) and retained the predictive performance in the external validation cohort (86.3%). T2D showed an accuracy of 82.9% for detecting T2D, also in a putative subset with missing insulin-related variables, when used with an imputation algorithm. In Kaplan-Meier analysis, the diabetes clusters of T2D demonstrated distinct accumulation risks of diabetic retinopathy in SIDD and that of chronic kidney disease in SIRD during a median observation period of 11.6 (4.5-18.3) years, similarly to the subtypes using T2D. The predictive accuracy was improved after excluding individuals with low predictive probability, who were categorised as an 'undecidable' cluster. T2D, after excluding undecidable individuals, showed higher consistency (100% for SIDD, 68.6% for SIRD, 94.4% for MOD and 97.9% for MARD) than T2D.

CONCLUSIONS/INTERPRETATION: The new ML model for predicting Ahlqvist's subtypes of type 2 diabetes has great potential for application in clinical practice and cohort studies because it can classify individuals with missing HOMA2 indices and predict glycaemic control, diabetic complications and treatment outcomes with long-term consistency by using readily available variables. Future studies are needed to assess whether our approach is applicable to research and/or clinical practice in multiethnic populations.

摘要

目的/假设:基于聚类的 2 型糖尿病亚分类,反映了病理生理学和遗传易感性,是提供个性化和有效的治疗策略的有前途的方法。由于其预测糖尿病并发症的能力较强,Ahlqvist 分类目前是最有力的验证方法,但它在时间上一致性不强,且需要 HOMA2 指数,而这些在临床实践和标准队列研究中并不常用。我们开发了一种机器学习 (ML) 模型,可以随着时间的推移始终如一地将 2 型糖尿病患者分为 Ahlqvist 亚型。

方法

队列 1 数据集包含 619 名日本 2 型糖尿病患者,他们被分为训练集和测试集,用于 ML 模型的比例为 7:3。队列 2 数据集包含 597 名 2 型糖尿病患者,用于外部验证。参与者根据 Ahlqvist 的变量(诊断时的年龄、BMI、HbA、HOMA2-B 和 HOMA2-IR)通过无监督的 k-均值聚类预先标记为(T2D),分为 4 种亚型:严重胰岛素缺乏型糖尿病(SIDD)、严重胰岛素抵抗型糖尿病(SIRD)、轻度肥胖相关型糖尿病(MOD)和轻度年龄相关型糖尿病(MARD)。我们采用了 15 个变量进行多类分类随机森林 (RF) 算法,以预测 2 型糖尿病亚型(T2D)。由 RF 计算的接近度矩阵使用一致流形逼近和投影进行可视化。最后,我们使用一个具有缺失胰岛素相关变量的假定子集来测试验证队列的预测性能、亚型随时间的一致性以及糖尿病并发症的预测能力。

结果

T2D 对预测 2 型糖尿病亚型(AUC≥0.99,F1 得分[由精度和召回率的调和平均值计算得出]≥0.9)的准确率为 94%,并在外部验证队列中保留了预测性能(86.3%)。T2D 在使用缺失胰岛素相关变量的假定子集中的准确率为 82.9%,用于检测 T2D,当与一个插补算法一起使用时。在 Kaplan-Meier 分析中,T2D 的糖尿病聚类在 SIDD 中显示出糖尿病视网膜病变的明显累积风险,在 SIRD 中显示出慢性肾脏病的风险,在中位数为 11.6(4.5-18.3)年的观察期间,与使用 T2D 的亚型相似。在排除低预测概率的个体(归类为“不可判定”聚类)后,预测准确性得到提高。在排除不可判定个体后,T2D 的一致性更高(SIDD 为 100%,SIRD 为 68.6%,MOD 为 94.4%,MARD 为 97.9%)。

结论/解释:用于预测 2 型糖尿病 Ahlqvist 亚型的新 ML 模型在临床实践和队列研究中具有很大的应用潜力,因为它可以对缺失 HOMA2 指数的个体进行分类,并使用现成的变量长期保持一致性地预测血糖控制、糖尿病并发症和治疗结果。需要进一步研究以评估我们的方法是否适用于多民族人群的研究和/或临床实践。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/baa532856137/125_2024_6248_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/a3c81a8053bb/125_2024_6248_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/a176c9c29377/125_2024_6248_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/0e0f58ece910/125_2024_6248_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/e387637412de/125_2024_6248_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/baa532856137/125_2024_6248_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/a3c81a8053bb/125_2024_6248_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/a176c9c29377/125_2024_6248_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/0e0f58ece910/125_2024_6248_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/e387637412de/125_2024_6248_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be56/11519166/baa532856137/125_2024_6248_Fig5_HTML.jpg

相似文献

1
Machine learning-based reproducible prediction of type 2 diabetes subtypes.基于机器学习的 2 型糖尿病亚型可重现预测。
Diabetologia. 2024 Nov;67(11):2446-2458. doi: 10.1007/s00125-024-06248-8. Epub 2024 Aug 21.
2
Replication and cross-validation of type 2 diabetes subtypes based on clinical variables: an IMI-RHAPSODY study.基于临床变量的 2 型糖尿病亚型的复制和验证:一项 IMI-RHAPSODY 研究。
Diabetologia. 2021 Sep;64(9):1982-1989. doi: 10.1007/s00125-021-05490-8. Epub 2021 Jun 10.
3
Assessing reproducibility and utility of clustering of patients with type 2 diabetes and established CV disease (SAVOR -TIMI 53 trial).评估 2 型糖尿病合并已确诊心血管疾病患者的聚类的重现性和实用性(SAVOR-TIMI 53 试验)。
PLoS One. 2021 Nov 19;16(11):e0259372. doi: 10.1371/journal.pone.0259372. eCollection 2021.
4
Clinical application of cluster analysis in patients with newly diagnosed type 2 diabetes.聚类分析在新诊断2型糖尿病患者中的临床应用
Hormones (Athens). 2025 Mar;24(1):109-122. doi: 10.1007/s42000-024-00593-4. Epub 2024 Sep 4.
5
Type 2 diabetes classification: a data-driven cluster study of the Danish Centre for Strategic Research in Type 2 Diabetes (DD2) cohort.2型糖尿病分类:基于丹麦2型糖尿病战略研究中心(DD2)队列的数据驱动聚类研究
BMJ Open Diabetes Res Care. 2022 Apr;10(2). doi: 10.1136/bmjdrc-2021-002731.
6
Risk of diabetes-associated diseases in subgroups of patients with recent-onset diabetes: a 5-year follow-up study.近期诊断糖尿病患者亚组中糖尿病相关疾病的风险:一项为期 5 年的随访研究。
Lancet Diabetes Endocrinol. 2019 Sep;7(9):684-694. doi: 10.1016/S2213-8587(19)30187-1. Epub 2019 Jul 22.
7
Rates of diabetic retinopathy among cluster analysis-identified type 2 diabetic mellitus subgroups.聚类分析确定的2型糖尿病亚组中的糖尿病视网膜病变发生率。
Graefes Arch Clin Exp Ophthalmol. 2024 Feb;262(2):411-419. doi: 10.1007/s00417-023-06260-5. Epub 2023 Oct 16.
8
Validation of the Swedish Diabetes Re-Grouping Scheme in Adult-Onset Diabetes in China.瑞典糖尿病重新分组方案在中国成人发病型糖尿病中的验证
J Clin Endocrinol Metab. 2020 Oct 1;105(10). doi: 10.1210/clinem/dgaa524.
9
The efficacy of canagliflozin in diabetes subgroups stratified by data-driven clustering or a supervised machine learning method: a post hoc analysis of canagliflozin clinical trial data.基于数据驱动聚类或有监督机器学习方法的糖尿病亚组中卡格列净的疗效:卡格列净临床试验数据的事后分析。
Diabetologia. 2022 Sep;65(9):1424-1435. doi: 10.1007/s00125-022-05748-9. Epub 2022 Jul 8.
10
Subtypes of type 2 diabetes and their association with outcomes in Korean adults - A cluster analysis of community-based prospective cohort.2 型糖尿病亚型及其与韩国成年人结局的关系——基于社区的前瞻性队列聚类分析。
Metabolism. 2023 Apr;141:155514. doi: 10.1016/j.metabol.2023.155514. Epub 2023 Feb 4.

引用本文的文献

1
Development and Validation of DIANA (Diabetes Novel Subgroup Assessment tool): A web-based precision medicine tool to determine type 2 diabetes endotype membership and predict individuals at risk of microvascular disease.DIANA(糖尿病新型亚组评估工具)的开发与验证:一种基于网络的精准医学工具,用于确定2型糖尿病的内型归属并预测微血管疾病风险个体。
PLOS Digit Health. 2025 Aug 5;4(8):e0000702. doi: 10.1371/journal.pdig.0000702. eCollection 2025 Aug.
2
Recognising, quantifying and accounting for classification uncertainty in type 2 diabetes subtypes.识别、量化并说明2型糖尿病亚型中的分类不确定性。
Diabetologia. 2025 Jul 25. doi: 10.1007/s00125-025-06486-4.
3

本文引用的文献

1
Trajectories of clinical characteristics, complications and treatment choices in data-driven subgroups of type 2 diabetes.基于数据驱动的 2 型糖尿病亚组的临床特征、并发症和治疗选择的轨迹。
Diabetologia. 2024 Jul;67(7):1343-1355. doi: 10.1007/s00125-024-06147-y. Epub 2024 Apr 16.
2
Exploring the genetic diversity of the Japanese population: Insights from a large-scale whole genome sequencing analysis.探索日本人群的遗传多样性:大规模全基因组测序分析的启示。
PLoS Genet. 2023 Dec 7;19(12):e1010625. doi: 10.1371/journal.pgen.1010625. eCollection 2023 Dec.
3
Artificial intelligence in endocrinology: a comprehensive review.
Energy landscape analysis of health checkup data clarified multiple pathways to diabetes development in obese and non-obese subjects.
健康检查数据的能量景观分析阐明了肥胖和非肥胖受试者患糖尿病的多种途径。
Front Endocrinol (Lausanne). 2025 May 6;16:1576431. doi: 10.3389/fendo.2025.1576431. eCollection 2025.
4
The research progress and future directions in the pathophysiological mechanisms of type 2 diabetes mellitus from the perspective of precision medicine.从精准医学角度看2型糖尿病病理生理机制的研究进展与未来方向
Front Med (Lausanne). 2025 Mar 5;12:1555077. doi: 10.3389/fmed.2025.1555077. eCollection 2025.
内分泌学中的人工智能:全面综述。
J Endocrinol Invest. 2024 May;47(5):1067-1082. doi: 10.1007/s40618-023-02235-9. Epub 2023 Nov 16.
4
Precision subclassification of type 2 diabetes: a systematic review.2型糖尿病的精准亚分类:一项系统综述
Commun Med (Lond). 2023 Oct 5;3(1):138. doi: 10.1038/s43856-023-00360-3.
5
Machine learning approach to stratify complex heterogeneity of chronic heart failure: A report from the CHART-2 study.机器学习方法分析慢性心力衰竭的复杂异质性:来自 CHART-2 研究的报告。
ESC Heart Fail. 2023 Jun;10(3):1597-1604. doi: 10.1002/ehf2.14288. Epub 2023 Feb 14.
6
2. Classification and Diagnosis of Diabetes: Standards of Care in Diabetes-2023.2. 糖尿病的分类和诊断:2023 年糖尿病护理标准。
Diabetes Care. 2023 Jan 1;46(Suppl 1):S19-S40. doi: 10.2337/dc23-S002.
7
Management of Hyperglycemia in Type 2 Diabetes, 2022. A Consensus Report by the American Diabetes Association (ADA) and the European Association for the Study of Diabetes (EASD).2022 年美国糖尿病协会 (ADA) 和欧洲糖尿病研究协会 (EASD) 共识报告:2 型糖尿病患者高血糖管理。
Diabetes Care. 2022 Nov 1;45(11):2753-2786. doi: 10.2337/dci22-0034.
8
Phenotypic and genetic classification of diabetes.糖尿病的表型和基因型分类。
Diabetologia. 2022 Nov;65(11):1758-1769. doi: 10.1007/s00125-022-05769-4. Epub 2022 Aug 12.
9
A roadmap to achieve pharmacological precision medicine in diabetes.实现糖尿病药理学精准医学的路线图。
Diabetologia. 2022 Nov;65(11):1830-1838. doi: 10.1007/s00125-022-05732-3. Epub 2022 Jun 24.
10
Genome-wide association analyses highlight etiological differences underlying newly defined subtypes of diabetes.全基因组关联分析突出了新定义的糖尿病亚型背后的病因差异。
Nat Genet. 2021 Nov;53(11):1534-1542. doi: 10.1038/s41588-021-00948-2. Epub 2021 Nov 4.