• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VTrans:一种基于变分自编码器的预训练Transformer方法用于微生物组数据分析。

VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis.

作者信息

Shi Xinyuan, Zhu Fangfang, Min Wenwen

机构信息

School of Information Science and Engineering, Yunnan University, Kunming, China.

School of Health and Nursing, Yunnan Open University, Kunming, China.

出版信息

J Comput Biol. 2025 Sep;32(9):850-864. doi: 10.1089/cmb.2024.0884. Epub 2025 Apr 28.

DOI:10.1089/cmb.2024.0884
PMID:40295093
Abstract

Predicting the survival outcomes and assessing the risk of patients play a pivotal role in comprehending the microbial composition across various stages of cancer. With the ongoing advancements in deep learning, it has been substantiated that deep learning holds the potential to analyze patient survival risks based on microbial data. However, confronting a common challenge in individual cancer datasets involves the limited sample size and the high dimensionality of the feature space. This predicament often leads to overfitting issues in deep learning models, hindering their ability to effectively extract profound data representations and resulting in suboptimal model performance. To overcome these challenges, we advocate the utilization of pretraining and fine-tuning strategies, which have proven effective in addressing the constraint of having a smaller sample size in individual cancer datasets. In this study, we propose a deep learning model that amalgamates Transformer encoder and variational autoencoder (VAE), VTrans, employing both pre-training and fine-tuning strategies to predict the survival risk of cancer patients using microbial data. Furthermore, we highlight the potential of extending VTrans to integrate microbial multi-omics data. Our method is assessed on three distinct cancer datasets from The Cancer Genome Atlas Program, and the research findings demonstrated that (1) VTrans excels in terms of performance compared to conventional machine learning and other deep learning models. (2) The utilization of pretraning significantly enhances its performance. (3) In contrast to positional encoding, employing VAE encoding proves to be more effective in enriching data representation. (4) Using the idea of saliency map, it is possible to observe which microbes have a high contribution to the classification results. These results demonstrate the effectiveness of VTrans in prediting patient survival risk. Source code and all datasets used in this paper are available at https://github.com/wenwenmin/VTrans and https://doi.org/10.5281/zenodo.14166580.

摘要

预测患者的生存结果并评估其风险在理解癌症各个阶段的微生物组成方面起着关键作用。随着深度学习的不断发展,已经证实深度学习有潜力基于微生物数据来分析患者的生存风险。然而,在单个癌症数据集中面临的一个常见挑战是样本量有限以及特征空间的高维度。这种困境常常导致深度学习模型出现过拟合问题,阻碍它们有效提取深度数据表示的能力,从而导致模型性能欠佳。为了克服这些挑战,我们提倡使用预训练和微调策略,这些策略已被证明在解决单个癌症数据集中样本量较小的限制方面是有效的。在本研究中,我们提出了一种深度学习模型,该模型融合了Transformer编码器和变分自编码器(VAE),即VTrans,采用预训练和微调策略,利用微生物数据来预测癌症患者的生存风险。此外,我们强调了扩展VTrans以整合微生物多组学数据的潜力。我们的方法在来自癌症基因组图谱计划的三个不同癌症数据集上进行了评估,研究结果表明:(1)与传统机器学习和其他深度学习模型相比,VTrans在性能方面表现出色。(2)预训练的使用显著提高了其性能。(3)与位置编码相比,采用VAE编码在丰富数据表示方面更有效。(4)利用显著性图的概念,可以观察到哪些微生物对分类结果有高贡献。这些结果证明了VTrans在预测患者生存风险方面的有效性。本文使用的源代码和所有数据集可在https://github.com/wenwenmin/VTrans和https://doi.org/10.5281/zenodo.14166580获取。

相似文献

1
VTrans: A VAE-Based Pre-Trained Transformer Method for Microbiome Data Analysis.VTrans:一种基于变分自编码器的预训练Transformer方法用于微生物组数据分析。
J Comput Biol. 2025 Sep;32(9):850-864. doi: 10.1089/cmb.2024.0884. Epub 2025 Apr 28.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.使用语义语言内容和变压器深度学习架构评估认知能力下降。
Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.
4
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
7
Short-Term Memory Impairment短期记忆障碍
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
9
Actor critic with experience replay-based automatic treatment planning for prostate cancer intensity modulated radiotherapy.基于经验回放的演员-评论家算法用于前列腺癌调强放射治疗的自动治疗计划
Med Phys. 2025 Jul;52(7):e17915. doi: 10.1002/mp.17915. Epub 2025 May 31.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.