• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

微软变压器:一种用于使用宏基因组数据进行疾病预测的多级融合表格变压器。

MSFT-transformer: a multistage fusion tabular transformer for disease prediction using metagenomic data.

作者信息

Wang Ning, Wu Minghui, Gu Wenchao, Dai Chenglong, Shao Zongru, Subbalakshmi K P

机构信息

School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214121, Jiangsu, China.

Silicon Austria Labs, Linz, Austria.

出版信息

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf217.

DOI:10.1093/bib/bbaf217
PMID:40370098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12078939/
Abstract

More and more recent studies highlight the crucial role of the human microbiome in maintaining health, while modern advancements in metagenomic sequencing technologies have been accumulating data that are associated with human diseases. Although metagenomic data offer rich, multifaceted information, including taxonomic and functional abundance profiles, their full potential remains underutilized, as most approaches rely only on one type of information to discover and understand their related correlations with respect to disease occurrences. To address this limitation, we propose a multistage fusion tabular transformer architecture (MSFT-Transformer), aiming to effectively integrate various types of high-dimensional tabular information extracted from metagenomic data. Its multistage fusion strategy consists of three modules: a fusion-aware feature extraction module in the early stage to improve the extracted information from inputs, an alignment-enhanced fusion module in the mid stage to enforce the retainment of desired information in cross-modal learning, and an integrated feature decision layer in the late stage to incorporate desired cross-modal information. We conduct extensive experiments to evaluate the performance of MSFT-Transformer over state-of-the-art models on five standard datasets. Our results indicate that MSFT-Transformer provides stable performance gains with reduced computational costs. An ablation study illustrates the contributions of all three models compared with a reference multistage fusion transformer without these novel strategies. The result analysis implies the significant potential of the proposed model in future disease prediction with metagenomic data.

摘要

越来越多的近期研究强调了人类微生物组在维持健康方面的关键作用,而宏基因组测序技术的现代进展一直在积累与人类疾病相关的数据。尽管宏基因组数据提供了丰富、多方面的信息,包括分类学和功能丰度概况,但它们的全部潜力仍未得到充分利用,因为大多数方法仅依赖于一种类型的信息来发现和理解它们与疾病发生的相关关系。为了解决这一局限性,我们提出了一种多阶段融合表格变换器架构(MSFT-Transformer),旨在有效整合从宏基因组数据中提取的各种类型的高维表格信息。其多阶段融合策略由三个模块组成:早期的融合感知特征提取模块,用于改进从输入中提取的信息;中期的对齐增强融合模块,用于在跨模态学习中强制保留所需信息;后期的集成特征决策层,用于纳入所需的跨模态信息。我们进行了广泛的实验,以评估MSFT-Transformer在五个标准数据集上相对于现有模型的性能。我们的结果表明,MSFT-Transformer在降低计算成本的同时提供了稳定的性能提升。一项消融研究说明了与没有这些新颖策略的参考多阶段融合变换器相比,所有三个模型的贡献。结果分析表明,所提出的模型在未来利用宏基因组数据进行疾病预测方面具有巨大潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/ccc6538ac202/bbaf217f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/95be949fb855/bbaf217f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/82bf8cc2a0f4/bbaf217f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/5fc595d6b2e0/bbaf217f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/ccc6538ac202/bbaf217f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/95be949fb855/bbaf217f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/82bf8cc2a0f4/bbaf217f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/5fc595d6b2e0/bbaf217f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97b/12078939/ccc6538ac202/bbaf217f4.jpg

相似文献

1
MSFT-transformer: a multistage fusion tabular transformer for disease prediction using metagenomic data.微软变压器:一种用于使用宏基因组数据进行疾病预测的多级融合表格变压器。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf217.
2
Accurate identification of bacteriophages from metagenomic data using Transformer.基于 Transformer 从宏基因组数据中准确识别噬菌体
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac258.
3
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
4
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet:一种新颖的跨尺度级联分层Transformer 网络,用于增强视网膜血管分割。
Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.
5
Missing-modality enabled multi-modal fusion architecture for medical data.用于医学数据的缺失模态启用多模态融合架构。
J Biomed Inform. 2025 Apr;164:104796. doi: 10.1016/j.jbi.2025.104796. Epub 2025 Feb 21.
6
MACTFusion: Lightweight Cross Transformer for Adaptive Multimodal Medical Image Fusion.MACTFusion:用于自适应多模态医学图像融合的轻量级交叉变换器
IEEE J Biomed Health Inform. 2025 May;29(5):3317-3328. doi: 10.1109/JBHI.2024.3391620. Epub 2025 May 6.
7
Feature selection with vector-symbolic architectures: a case study on microbial profiles of shotgun metagenomic samples of colorectal cancer.基于向量符号架构的特征选择:以结直肠癌鸟枪法宏基因组样本的微生物图谱为例
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf177.
8
CAIM: coverage-based analysis for identification of microbiome.CAIM:基于覆盖度的微生物组分析方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae424.
9
Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights.大型宏基因组数据集的机器学习荟萃分析:工具与生物学见解
PLoS Comput Biol. 2016 Jul 11;12(7):e1004977. doi: 10.1371/journal.pcbi.1004977. eCollection 2016 Jul.
10
IDMIL: an alignment-free Interpretable Deep Multiple Instance Learning (MIL) for predicting disease from whole-metagenomic data.IDMIL:一种无对齐的可解释深度多重实例学习(MIL)方法,用于从全宏基因组数据预测疾病。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i39-i47. doi: 10.1093/bioinformatics/btaa477.

本文引用的文献

1
Incorporating metabolic activity, taxonomy and community structure to improve microbiome-based predictive models for host phenotype prediction.将代谢活性、分类学和群落结构纳入其中,以改进基于微生物组的预测模型,从而预测宿主表型。
Gut Microbes. 2024 Jan-Dec;16(1):2302076. doi: 10.1080/19490976.2024.2302076. Epub 2024 Jan 12.
2
Shared-Specific Feature Learning With Bottleneck Fusion Transformer for Multi-Modal Whole Slide Image Analysis.基于瓶颈融合 Transformer 的共享特定特征学习在多模态全切片图像分析中的应用。
IEEE Trans Med Imaging. 2023 Nov;42(11):3374-3383. doi: 10.1109/TMI.2023.3287256. Epub 2023 Oct 27.
3
Multimodal Learning With Transformers: A Survey.
基于Transformer的多模态学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12113-12132. doi: 10.1109/TPAMI.2023.3275156. Epub 2023 Sep 5.
4
Indoleacrylic acid produced by Parabacteroides distasonis alleviates type 2 diabetes via activation of AhR to repair intestinal barrier.拟杆菌属产生的吲哚丙烯酸通过激活 AhR 修复肠道屏障来缓解 2 型糖尿病。
BMC Biol. 2023 Apr 18;21(1):90. doi: 10.1186/s12915-023-01578-2.
5
Multimodal biomedical AI.多模态生物医学人工智能。
Nat Med. 2022 Sep;28(9):1773-1784. doi: 10.1038/s41591-022-01981-2. Epub 2022 Sep 15.
6
EnsDeepDP: An Ensemble Deep Learning Approach for Disease Prediction Through Metagenomics.EnsDeepDP:一种通过宏基因组学进行疾病预测的集成深度学习方法。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):986-998. doi: 10.1109/TCBB.2022.3201295. Epub 2023 Apr 3.
7
Plasma and urinary extracellular vesicle microRNAs and their related pathways in diabetic kidney disease.血浆和尿液细胞外囊泡 microRNAs 及其在糖尿病肾病中的相关通路。
Genomics. 2022 Jul;114(4):110407. doi: 10.1016/j.ygeno.2022.110407. Epub 2022 Jun 15.
8
MegaD: Deep Learning for Rapid and Accurate Disease Status Prediction of Metagenomic Samples.MegaD:用于宏基因组样本疾病状态快速准确预测的深度学习
Life (Basel). 2022 Apr 30;12(5):669. doi: 10.3390/life12050669.
9
Microbiome-based disease prediction with multimodal variational information bottlenecks.基于微生物组的多模态变分信息瓶颈疾病预测。
PLoS Comput Biol. 2022 Apr 11;18(4):e1010050. doi: 10.1371/journal.pcbi.1010050. eCollection 2022 Apr.
10
Human disease prediction from microbiome data by multiple feature fusion and deep learning.通过多特征融合和深度学习从微生物组数据预测人类疾病
iScience. 2022 Mar 16;25(4):104081. doi: 10.1016/j.isci.2022.104081. eCollection 2022 Apr 15.