• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合代谢组学和基于树的提升方法来增强 2 型糖尿病预测。

Enhancing type 2 diabetes mellitus prediction by integrating metabolomics and tree-based boosting approaches.

机构信息

Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, Malatya, Türkiye.

Central Labs, King Khalid University, Abha, Saudi Arabia.

出版信息

Front Endocrinol (Lausanne). 2024 Nov 11;15:1444282. doi: 10.3389/fendo.2024.1444282. eCollection 2024.

DOI:10.3389/fendo.2024.1444282
PMID:39588339
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11586166/
Abstract

BACKGROUND

Type 2 diabetes mellitus (T2DM) is a global health problem characterized by insulin resistance and hyperglycemia. Early detection and accurate prediction of T2DM is crucial for effective management and prevention. This study explores the integration of machine learning (ML) and explainable artificial intelligence (XAI) approaches based on metabolomics panel data to identify biomarkers and develop predictive models for T2DM.

METHODS

Metabolomics data from T2DM (n = 31) and healthy controls (n = 34) were analyzed for biomarker discovery (mostly amino acids, fatty acids, and purines) and T2DM prediction. Feature selection was performed using the least absolute shrinkage and selection operator (LASSO) regression to enhance the model's accuracy and interpretability. Advanced three tree-based ML algorithms (KTBoost: Kernel-Tree Boosting; XGBoost: eXtreme Gradient Boosting; NGBoost: Natural Gradient Boosting) were employed to predict T2DM using these biomarkers. The SHapley Additive exPlanations (SHAP) method was used to explain the effects of metabolomics biomarkers on the prediction of the model.

RESULTS

The study identified multiple metabolites associated with T2DM, where LASSO feature selection highlighted important biomarkers. KTBoost [Accuracy: 0.938; CI: (0.880-0.997), Sensitivity: 0.971; CI: (0.847-0.999), Area under the Curve (AUC): 0.965; CI: (0.937-0.994)] demonstrated its effectiveness in using complex metabolomics data for T2DM prediction and achieved better performance than other models. According to KTBoost's SHAP, high levels of phenylactate (pla) and taurine metabolites, as well as low concentrations of cysteine, laspartate, and lcysteate, are strongly associated with the presence of T2DM.

CONCLUSION

The integration of metabolomics profiling and XAI offers a promising approach to predicting T2DM. The use of tree-based algorithms, in particular KTBoost, provides a robust framework for analyzing complex datasets and improves the prediction accuracy of T2DM onset. Future research should focus on validating these biomarkers and models in larger, more diverse populations to solidify their clinical utility.

摘要

背景

2 型糖尿病(T2DM)是一种全球性健康问题,其特征为胰岛素抵抗和高血糖。早期发现和准确预测 T2DM 对于有效管理和预防至关重要。本研究探讨了基于代谢组学面板数据的机器学习(ML)和可解释人工智能(XAI)方法的整合,以识别生物标志物并开发 T2DM 预测模型。

方法

对 T2DM(n=31)和健康对照组(n=34)的代谢组学数据进行分析,以发现生物标志物(主要为氨基酸、脂肪酸和嘌呤)并预测 T2DM。使用最小绝对收缩和选择算子(LASSO)回归进行特征选择,以提高模型的准确性和可解释性。使用三种先进的基于树的 ML 算法(KTBoost:核树增强;XGBoost:极端梯度增强;NGBoost:自然梯度增强),使用这些生物标志物预测 T2DM。使用 SHapley Additive exPlanations(SHAP)方法解释代谢组学生物标志物对模型预测的影响。

结果

本研究确定了多个与 T2DM 相关的代谢物,其中 LASSO 特征选择突出了重要的生物标志物。KTBoost[准确性:0.938;置信区间(0.880-0.997);敏感性:0.971;置信区间(0.847-0.999);曲线下面积(AUC):0.965;置信区间(0.937-0.994)]在使用复杂的代谢组学数据预测 T2DM 方面表现出有效性,并且优于其他模型。根据 KTBoost 的 SHAP,高苯丙氨酸(pla)和牛磺酸代谢物水平以及低半胱氨酸、天冬氨酸和 L-半胱氨酸浓度与 T2DM 的存在密切相关。

结论

代谢组学分析和 XAI 的整合为预测 T2DM 提供了一种很有前途的方法。基于树的算法,特别是 KTBoost 的使用,为分析复杂数据集提供了一个强大的框架,并提高了 T2DM 发病预测的准确性。未来的研究应集中在验证这些生物标志物和模型在更大、更多样化的人群中的有效性,以巩固其临床实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/01cead23ab29/fendo-15-1444282-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/237761841733/fendo-15-1444282-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/e195a6cecc5a/fendo-15-1444282-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/766b480df27a/fendo-15-1444282-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/5db9e3b4994c/fendo-15-1444282-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/01cead23ab29/fendo-15-1444282-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/237761841733/fendo-15-1444282-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/e195a6cecc5a/fendo-15-1444282-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/766b480df27a/fendo-15-1444282-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/5db9e3b4994c/fendo-15-1444282-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45e9/11586166/01cead23ab29/fendo-15-1444282-g005.jpg

相似文献

1
Enhancing type 2 diabetes mellitus prediction by integrating metabolomics and tree-based boosting approaches.通过整合代谢组学和基于树的提升方法来增强 2 型糖尿病预测。
Front Endocrinol (Lausanne). 2024 Nov 11;15:1444282. doi: 10.3389/fendo.2024.1444282. eCollection 2024.
2
Proposed Comprehensive Methodology Integrated with Explainable Artificial Intelligence for Prediction of Possible Biomarkers in Metabolomics Panel of Plasma Samples for Breast Cancer Detection.结合可解释人工智能的拟议综合方法,用于预测血浆样本代谢组学面板中乳腺癌检测的潜在生物标志物。
Medicina (Kaunas). 2025 Mar 25;61(4):581. doi: 10.3390/medicina61040581.
3
Platelet Metabolites as Candidate Biomarkers in Sepsis Diagnosis and Management Using the Proposed Explainable Artificial Intelligence Approach.使用所提出的可解释人工智能方法,血小板代谢物作为脓毒症诊断和管理中的候选生物标志物。
J Clin Med. 2024 Aug 23;13(17):5002. doi: 10.3390/jcm13175002.
4
Machine Learning Models Integrating Dietary Indicators Improve the Prediction of Progression from Prediabetes to Type 2 Diabetes Mellitus.整合饮食指标的机器学习模型改善了对糖尿病前期进展为2型糖尿病的预测。
Nutrients. 2025 Mar 8;17(6):947. doi: 10.3390/nu17060947.
5
Machine learning algorithms for diabetic kidney disease risk predictive model of Chinese patients with type 2 diabetes mellitus.用于中国2型糖尿病患者糖尿病肾病风险预测模型的机器学习算法
Ren Fail. 2025 Dec;47(1):2486558. doi: 10.1080/0886022X.2025.2486558. Epub 2025 Apr 7.
6
Untargeted Lipidomic Biomarkers for Liver Cancer Diagnosis: A Tree-Based Machine Learning Model Enhanced by Explainable Artificial Intelligence.用于肝癌诊断的非靶向脂质组学生物标志物:一种由可解释人工智能增强的基于树的机器学习模型。
Medicina (Kaunas). 2025 Feb 26;61(3):405. doi: 10.3390/medicina61030405.
7
Interpretable machine learning method to predict the risk of pre-diabetes using a national-wide cross-sectional data: evidence from CHNS.利用全国性横断面数据预测糖尿病前期风险的可解释机器学习方法:来自中国健康与营养调查的证据
BMC Public Health. 2025 Mar 26;25(1):1145. doi: 10.1186/s12889-025-22419-7.
8
Identification of novel hypertension biomarkers using explainable AI and metabolomics.使用可解释人工智能和代谢组学鉴定新型高血压生物标志物。
Metabolomics. 2024 Nov 3;20(6):124. doi: 10.1007/s11306-024-02182-3.
9
Pilot-Study to Explore Metabolic Signature of Type 2 Diabetes: A Pipeline of Tree-Based Machine Learning and Bioinformatics Techniques for Biomarkers Discovery.探索 2 型糖尿病代谢特征的初步研究:基于树的机器学习和生物信息学技术的生物标志物发现管道。
Nutrients. 2024 May 20;16(10):1537. doi: 10.3390/nu16101537.
10
Predicting dry matter intake in cattle at scale using gradient boosting regression techniques and Gaussian process boosting regression with Shapley additive explanation explainable artificial intelligence, MLflow, and its containerization.使用梯度提升回归技术以及带有夏普利值加法解释的高斯过程提升回归、可解释人工智能、MLflow及其容器化来大规模预测牛的干物质摄入量。
J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf041.

引用本文的文献

1
Emerging Insights into the Relationship Between Amino Acid Metabolism and Diabetic Cardiomyopathy.对氨基酸代谢与糖尿病性心肌病之间关系的新见解
Biomolecules. 2025 Jun 22;15(7):916. doi: 10.3390/biom15070916.
2
Illuminating diabetes multi-omics: Unraveling disease mechanisms and advancing personalized therapy.解读糖尿病多组学:揭示疾病机制与推进个性化治疗
World J Diabetes. 2025 Jul 15;16(7):106218. doi: 10.4239/wjd.v16.i7.106218.
3
On Selecting Robust Approaches for Learning Predictive Biomarkers in Metabolomics Data Sets.
关于选择稳健方法以在代谢组学数据集中学习预测性生物标志物
Anal Chem. 2025 Jun 24;97(24):12669-12678. doi: 10.1021/acs.analchem.5c01049. Epub 2025 Jun 12.