Suppr超能文献

利用肠道微生物群谱进行多种疾病预测的多标签分类研究

Toward Multilabel Classification for Multiple Disease Prediction Using Gut Microbiota Profiles.

作者信息

Huang Zhi-An, Hu Pengwei, Hu Lun, You Zhu-Hong, Tan Kay Chen, Huang Yu-An

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Sep 12;PP. doi: 10.1109/TNNLS.2024.3453967.

Abstract

Advancements in high-throughput technologies have yielded large-scale human gut microbiota profiles, sparking considerable interest in exploring the relationship between the gut microbiome and complex human diseases. Through extracting and integrating knowledge from complex microbiome data, existing machine learning (ML)-based studies have demonstrated their effectiveness in the precise identification of high-risk individuals. However, these approaches struggle to address the heterogeneity and sparsity of microbial features and explore the intrinsic relatedness among human diseases. In this work, we reframe human gut microbiome-based disease detection as a multilabel classification (MLC) problem and integrate a range of innovative techniques within the proposed MLC framework, aptly named GutMLC. Specifically, the entity semantic similarity as priori knowledge is incorporated into multilabel feature selection and loss functions by capturing the shared attributes and inherent associations among diseases and microbes. To tackle the issue of label imbalance, both within and between labels, we adapt the focal loss (FL) function for MLC using debiased inverse weighting. Extensive experiment results consistently demonstrate the competitive performance of GutMLC in comparison with commonly used MLC and single-label classification (SLC) algorithms. This work seeks to unlock the potential of gut microbiota as robust biomarkers for multiple disease prediction.

摘要

高通量技术的进步产生了大规模的人类肠道微生物群图谱,引发了人们对探索肠道微生物组与复杂人类疾病之间关系的浓厚兴趣。通过从复杂的微生物组数据中提取和整合知识,现有的基于机器学习(ML)的研究已经证明了它们在精确识别高危个体方面的有效性。然而,这些方法难以解决微生物特征的异质性和稀疏性问题,也难以探索人类疾病之间的内在关联性。在这项工作中,我们将基于人类肠道微生物组的疾病检测重新定义为一个多标签分类(MLC)问题,并在所提出的MLC框架(恰当地命名为GutMLC)中整合了一系列创新技术。具体而言,通过捕捉疾病和微生物之间的共享属性和内在关联,将实体语义相似性作为先验知识纳入多标签特征选择和损失函数中。为了解决标签内和标签间的不平衡问题,我们使用去偏逆加权对MLC的焦点损失(FL)函数进行了调整。大量实验结果一致表明,与常用的MLC和单标签分类(SLC)算法相比,GutMLC具有竞争性能。这项工作旨在挖掘肠道微生物群作为多种疾病预测可靠生物标志物的潜力。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验