• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的2型糖尿病亚型分类研究。

Machine learning based study for the classification of Type 2 diabetes mellitus subtypes.

作者信息

Ordoñez-Guillen Nelson E, Gonzalez-Compean Jose Luis, Lopez-Arevalo Ivan, Contreras-Murillo Miguel, Aldana-Bobadilla Edwin

机构信息

Cinvestav Tamaulipas, Carretera Victoria-Soto la Marina km 5.5, Victoria, 87130, Tamaulipas, Mexico.

CONAHCYT-Centro de Investigación y de Estudios Avanzados del IPN, Unidad Tamaulipas, Carretera Victoria-Soto la Marina km 5.5, Victoria, Tamaulipas, 87130, Mexico.

出版信息

BioData Min. 2023 Aug 22;16(1):24. doi: 10.1186/s13040-023-00340-2.

DOI:10.1186/s13040-023-00340-2
PMID:37608329
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10463725/
Abstract

PURPOSE

Data-driven diabetes research has increased its interest in exploring the heterogeneity of the disease, aiming to support in the development of more specific prognoses and treatments within the so-called precision medicine. Recently, one of these studies found five diabetes subgroups with varying risks of complications and treatment responses. Here, we tackle the development and assessment of different models for classifying Type 2 Diabetes (T2DM) subtypes through machine learning approaches, with the aim of providing a performance comparison and new insights on the matter.

METHODS

We developed a three-stage methodology starting with the preprocessing of public databases NHANES (USA) and ENSANUT (Mexico) to construct a dataset with N = 10,077 adult diabetes patient records. We used N = 2,768 records for training/validation of models and left the remaining (N = 7,309) for testing. In the second stage, groups of observations -each one representing a T2DM subtype- were identified. We tested different clustering techniques and strategies and validated them by using internal and external clustering indices; obtaining two annotated datasets Dset A and Dset B. In the third stage, we developed different classification models assaying four algorithms, seven input-data schemes, and two validation settings on each annotated dataset. We also tested the obtained models using a majority-vote approach for classifying unseen patient records in the hold-out dataset.

RESULTS

From the independently obtained bootstrap validation for Dset A and Dset B, mean accuracies across all seven data schemes were [Formula: see text] ([Formula: see text]) and [Formula: see text] ([Formula: see text]), respectively. Best accuracies were [Formula: see text] and [Formula: see text]. Both validation setting results were consistent. For the hold-out dataset, results were consonant with most of those obtained in the literature in terms of class proportions.

CONCLUSION

The development of machine learning systems for the classification of diabetes subtypes constitutes an important task to support physicians for fast and timely decision-making. We expect to deploy this methodology in a data analysis platform to conduct studies for identifying T2DM subtypes in patient records from hospitals.

摘要

目的

数据驱动的糖尿病研究对探索该疾病的异质性兴趣日增,旨在为所谓的精准医学中更具针对性的预后和治疗方法的开发提供支持。最近,其中一项研究发现了五个糖尿病亚组,其并发症风险和治疗反应各不相同。在此,我们通过机器学习方法来处理用于对2型糖尿病(T2DM)亚型进行分类的不同模型的开发和评估,目的是提供性能比较并就此问题给出新的见解。

方法

我们开发了一种三阶段方法,首先对美国国家健康与营养检查调查(NHANES)和墨西哥全国健康与营养状况调查(ENSANUT)等公共数据库进行预处理,以构建一个包含N = 10,077条成年糖尿病患者记录的数据集。我们使用N = 2,768条记录进行模型的训练/验证,其余(N = 7,309条)用于测试。在第二阶段,确定了观察组,每个观察组代表一种T2DM亚型。我们测试了不同的聚类技术和策略,并通过使用内部和外部聚类指标对其进行验证;获得了两个带注释的数据集Dset A和Dset B。在第三阶段,我们开发了不同的分类模型,在每个带注释的数据集上测试四种算法、七种输入数据方案和两种验证设置。我们还使用多数投票方法对保留数据集中未见过的患者记录进行分类,以此测试所获得的模型。

结果

从对Dset A和Dset B独立获得的自助法验证结果来看,所有七种数据方案的平均准确率分别为[公式:见原文]([公式:见原文])和[公式:见原文]([公式:见原文])。最佳准确率分别为[公式:见原文]和[公式:见原文]。两种验证设置的结果一致。对于保留数据集,就类别比例而言,结果与文献中获得的大多数结果一致。

结论

开发用于糖尿病亚型分类的机器学习系统是一项重要任务,可为医生提供快速及时的决策支持。我们期望在数据分析平台中部署此方法,以开展研究来识别医院患者记录中的T2DM亚型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/cb0ac442b88e/13040_2023_340_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/b091220041ab/13040_2023_340_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/5a3da7c2c7fc/13040_2023_340_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/996e3b33ab2e/13040_2023_340_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/f4065db4055d/13040_2023_340_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/881fa6d51d3a/13040_2023_340_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/cb0ac442b88e/13040_2023_340_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/b091220041ab/13040_2023_340_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/5a3da7c2c7fc/13040_2023_340_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/996e3b33ab2e/13040_2023_340_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/f4065db4055d/13040_2023_340_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/881fa6d51d3a/13040_2023_340_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c977/10463725/cb0ac442b88e/13040_2023_340_Fig6_HTML.jpg

相似文献

1
Machine learning based study for the classification of Type 2 diabetes mellitus subtypes.基于机器学习的2型糖尿病亚型分类研究。
BioData Min. 2023 Aug 22;16(1):24. doi: 10.1186/s13040-023-00340-2.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
A convolutional neural network with self-attention for fully automated metabolic tumor volume delineation of head and neck cancer in [Formula: see text]F]FDG PET/CT.一种具有自注意力机制的卷积神经网络,用于在 [Formula: see text]F]FDG PET/CT 中全自动勾画头颈部癌症的代谢肿瘤体积。
Eur J Nucl Med Mol Imaging. 2023 Jul;50(9):2751-2766. doi: 10.1007/s00259-023-06197-1. Epub 2023 Apr 20.
4
Identifying prostate cancer and its clinical risk in asymptomatic men using machine learning of high dimensional peripheral blood flow cytometric natural killer cell subset phenotyping data.利用高维外周血细胞流式自然杀伤细胞亚群表型数据的机器学习识别无症状男性中的前列腺癌及其临床风险。
Elife. 2020 Jul 28;9:e50936. doi: 10.7554/eLife.50936.
5
Single-trial extraction of event-related potentials (ERPs) and classification of visual stimuli by ensemble use of discrete wavelet transform with Huffman coding and machine learning techniques.使用离散小波变换结合霍夫曼编码和机器学习技术的集合对事件相关电位(ERPs)进行单试提取和视觉刺激分类。
J Neuroeng Rehabil. 2023 Jun 2;20(1):70. doi: 10.1186/s12984-023-01179-8.
6
Comparison of different feature extraction methods for applicable automated ICD coding.不同特征提取方法在适用的自动化 ICD 编码中的比较。
BMC Med Inform Decis Mak. 2022 Jan 12;22(1):11. doi: 10.1186/s12911-022-01753-5.
7
Enhancing classification accuracy of fNIRS-BCI using features acquired from vector-based phase analysis.利用基于向量的相位分析获取的特征提高 fNIRS-BCI 的分类精度。
J Neural Eng. 2020 Oct 15;17(5):056025. doi: 10.1088/1741-2552/abb417.
8
A filter approach for feature selection in classification: application to automatic atrial fibrillation detection in electrocardiogram recordings.一种用于分类特征选择的滤波器方法:在心电图记录中自动检测心房颤动的应用。
BMC Med Inform Decis Mak. 2021 May 4;21(Suppl 4):130. doi: 10.1186/s12911-021-01427-8.
9
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
10
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

本文引用的文献

1
Detecting Sarcopenia Risk by Diabetes Clustering: A Japanese Prospective Cohort Study.通过糖尿病聚类检测肌肉减少症风险:一项日本前瞻性队列研究。
J Clin Endocrinol Metab. 2022 Sep 28;107(10):2729-2736. doi: 10.1210/clinem/dgac430.
2
Machine learning algorithm to evaluate risk factors of diabetic foot ulcers and its severity.机器学习算法评估糖尿病足溃疡及其严重程度的危险因素。
Med Biol Eng Comput. 2022 Aug;60(8):2349-2357. doi: 10.1007/s11517-022-02617-w. Epub 2022 Jun 25.
3
Classification of painful or painless diabetic peripheral neuropathy and identification of the most powerful predictors using machine learning models in large cross-sectional cohorts.
利用机器学习模型对大型横断面队列中疼痛性或无痛性糖尿病周围神经病变进行分类,并确定最强预测因子。
BMC Med Inform Decis Mak. 2022 May 29;22(1):144. doi: 10.1186/s12911-022-01890-x.
4
Heterogeneity in phenotype, disease progression and drug response in type 2 diabetes.2 型糖尿病表型、疾病进展和药物反应的异质性。
Nat Med. 2022 May;28(5):982-988. doi: 10.1038/s41591-022-01790-7. Epub 2022 May 9.
5
A Comprehensive Review of Various Diabetic Prediction Models: A Literature Survey.各种糖尿病预测模型的综合综述:文献调查。
J Healthc Eng. 2022 Apr 12;2022:8100697. doi: 10.1155/2022/8100697. eCollection 2022.
6
Differences in the prevalence of erectile dysfunction between novel subgroups of recent-onset diabetes.新近发病糖尿病亚组之间勃起功能障碍的患病率差异。
Diabetologia. 2022 Mar;65(3):552-562. doi: 10.1007/s00125-021-05607-z. Epub 2021 Nov 20.
7
Heterogeneity of Diabetes: β-Cells, Phenotypes, and Precision Medicine: Proceedings of an International Symposium of the Canadian Institutes of Health Research's Institute of Nutrition, Metabolism and Diabetes and the U.S. National Institutes of Health's National Institute of Diabetes and Digestive and Kidney Diseases.糖尿病的异质性:β 细胞、表型和精准医学:加拿大卫生研究院营养、代谢与糖尿病研究所和美国国立卫生研究院国家糖尿病、消化和肾脏疾病研究所的国际研讨会论文集。
Diabetes Care. 2022 Jan 1;45(1):3-22. doi: 10.2337/dci21-0051.
8
Prevalence Trends of Diabetes Subgroups in the United States: A Data-driven Analysis Spanning Three Decades From NHANES (1988-2018).美国糖尿病亚组患病率趋势:NHANES(1988-2018)跨越三十年的基于数据的分析。
J Clin Endocrinol Metab. 2022 Feb 17;107(3):735-742. doi: 10.1210/clinem/dgab762.
9
Validation of the classification for type 2 diabetes into five subgroups: a report from the ORIGIN trial.2型糖尿病分为五个亚组的分类验证:来自ORIGIN试验的报告。
Diabetologia. 2022 Jan;65(1):206-215. doi: 10.1007/s00125-021-05567-4. Epub 2021 Oct 21.
10
Artificial intelligence and diabetes technology: A review.人工智能与糖尿病技术:综述。
Metabolism. 2021 Nov;124:154872. doi: 10.1016/j.metabol.2021.154872. Epub 2021 Sep 1.