• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于泛癌临床结局终点预测的多组学监督自动编码器。

A multi-omics supervised autoencoder for pan-cancer clinical outcome endpoints prediction.

机构信息

Communication & Computer Network Lab of Guangdong, School of Computer Science & Engineering, South China University of Technology, Wushan Road, Guangzhou, 381, China.

出版信息

BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):129. doi: 10.1186/s12911-020-1114-3.

DOI:10.1186/s12911-020-1114-3
PMID:32646413
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7477832/
Abstract

BACKGROUND

With the rapid development of sequencing technologies, collecting diverse types of cancer omics data become more cost-effective. Many computational methods attempted to represent and fuse multiple omics into a comprehensive view of cancer. However, different types of omics are related and heterogeneous. Most of the existing methods do not consider the difference between omics, so the biological knowledge of individual omics may not be fully excavated. And for a given task (e.g. predicting overall survival), these methods prefer to use sample similarity or domain knowledge to learn a more reasonable representation of omics, but it's not enough.

METHODS

For the purpose of learning more useful representation for individual omics and fusing them to improve the prediction ability, we proposed an autoencoder-based method named MOSAE (Multi-omics Supervised Autoencoder). In our method, a specific autoencoder were designed for each omics according to their size of dimension to generate omics-specific representations. Then, a supervised autoencoder was constructed based on specific autoencoder by using labels to enforce each specific autoencoder to learn both omics-specific and task-specific representations. Finally, representations of different omics that generate from supervised autoencoders were fused in a traditional but powerful way, and the fused representation was used for subsequent predictive tasks.

RESULTS

We applied our method over TCGA Pan-Cancer dataset to predict four different clinical outcome endpoints (OS, PFI, DFI, and DSS). Compared with traditional and state-of-the-art methods, MOSAE achieved better predictive performance. We also tested the effects of each improvement, which all have a positive effect on predictive performance.

CONCLUSIONS

Predicting clinical outcome endpoints are very important for precision medicine and personalized medicine. And multi-omics fusion is an effective way to solve this problem. MOSAE is a powerful multi-omics fusion method, which can generate both omics-specific and task-specific representation for given endpoint predictive tasks and improve the predictive performance.

摘要

背景

随着测序技术的快速发展,收集各种类型的癌症组学数据变得更加经济实惠。许多计算方法试图将多种组学数据表示并融合为癌症的综合视图。然而,不同类型的组学数据之间存在相关性和异质性。大多数现有的方法都没有考虑组学数据之间的差异,因此个体组学的生物学知识可能没有被充分挖掘。对于给定的任务(例如预测总生存期),这些方法更倾向于使用样本相似性或领域知识来学习更合理的组学表示,但这还不够。

方法

为了学习更有用的个体组学表示并融合它们以提高预测能力,我们提出了一种基于自动编码器的方法,称为 MOSAE(多组学监督自动编码器)。在我们的方法中,根据每个组学的维度大小为其设计特定的自动编码器,以生成组学特异性表示。然后,基于特定的自动编码器构建监督自动编码器,使用标签强制每个特定的自动编码器学习组学特异性和任务特异性表示。最后,将来自监督自动编码器的不同组学的表示以传统但强大的方式融合,并将融合的表示用于后续的预测任务。

结果

我们在 TCGA 泛癌数据集上应用我们的方法来预测四个不同的临床结局终点(OS、PFI、DFI 和 DSS)。与传统方法和最先进的方法相比,MOSAE 实现了更好的预测性能。我们还测试了每个改进的效果,它们都对预测性能有积极的影响。

结论

预测临床结局终点对于精准医学和个性化医学非常重要。多组学融合是解决这个问题的有效方法。MOSAE 是一种强大的多组学融合方法,它可以为给定的终点预测任务生成组学特异性和任务特异性表示,并提高预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daa/7646190/f169cb2a520d/12911_2020_1114_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daa/7646190/40c08c482292/12911_2020_1114_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daa/7646190/8abf63d98b23/12911_2020_1114_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daa/7646190/f169cb2a520d/12911_2020_1114_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daa/7646190/40c08c482292/12911_2020_1114_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daa/7646190/8abf63d98b23/12911_2020_1114_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4daa/7646190/f169cb2a520d/12911_2020_1114_Fig3_HTML.jpg

相似文献

1
A multi-omics supervised autoencoder for pan-cancer clinical outcome endpoints prediction.一种用于泛癌临床结局终点预测的多组学监督自动编码器。
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):129. doi: 10.1186/s12911-020-1114-3.
2
A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data.一种通过整合多组学数据进行癌症亚型分类的层次化集成深度灵活神经森林框架。
BMC Bioinformatics. 2019 Oct 28;20(1):527. doi: 10.1186/s12859-019-3116-7.
3
The prediction of drug sensitivity by multi-omics fusion reveals the heterogeneity of drug response in pan-cancer.多组学融合预测药物敏感性揭示了泛癌中药物反应的异质性。
Comput Biol Med. 2023 Sep;163:107220. doi: 10.1016/j.compbiomed.2023.107220. Epub 2023 Jul 1.
4
Autoencoder-assisted latent representation learning for survival prediction and multi-view clustering on multi-omics cancer subtyping.基于自动编码器辅助的生存预测潜在表示学习和多组学生物标志物癌症亚型的多视图聚类。
Math Biosci Eng. 2023 Nov 27;20(12):21098-21119. doi: 10.3934/mbe.2023933.
5
AVBAE-MODFR: A novel deep learning framework of embedding and feature selection on multi-omics data for pan-cancer classification.AVBAE-MODFR:一种基于多组学数据的嵌入和特征选择的深度学习框架,用于泛癌分类。
Comput Biol Med. 2024 Jul;177:108614. doi: 10.1016/j.compbiomed.2024.108614. Epub 2024 May 14.
6
Integrate multi-omics data with biological interaction networks using Multi-view Factorization AutoEncoder (MAE).使用多视图因子分解自动编码器(MAE)将多组学数据与生物相互作用网络集成。
BMC Genomics. 2019 Dec 20;20(Suppl 11):944. doi: 10.1186/s12864-019-6285-x.
7
Integrating multi-omics data through deep learning for accurate cancer prognosis prediction.通过深度学习整合多组学数据,实现癌症预后的精准预测。
Comput Biol Med. 2021 Jul;134:104481. doi: 10.1016/j.compbiomed.2021.104481. Epub 2021 May 9.
8
Local augmented graph neural network for multi-omics cancer prognosis prediction and analysis.用于多组学癌症预后预测与分析的局部增强图神经网络
Methods. 2023 May;213:1-9. doi: 10.1016/j.ymeth.2023.02.011. Epub 2023 Mar 16.
9
Multi-View Spectral Clustering Based on Multi-Smooth Representation Fusion for Cancer Subtype Prediction.基于多平滑表示融合的多视图谱聚类用于癌症亚型预测
Front Genet. 2021 Sep 6;12:718915. doi: 10.3389/fgene.2021.718915. eCollection 2021.
10
Super.FELT: supervised feature extraction learning using triplet loss for drug response prediction with multi-omics data.Super.FELT:基于三重损失的监督特征提取学习在多组学数据药物反应预测中的应用。
BMC Bioinformatics. 2021 May 25;22(1):269. doi: 10.1186/s12859-021-04146-z.

引用本文的文献

1
Machine learning based screening of biomarkers associated with cell death and immunosuppression of multiple life stages sepsis populations.基于机器学习对与多生命阶段脓毒症人群细胞死亡和免疫抑制相关生物标志物的筛选。
Sci Rep. 2025 Aug 19;15(1):30302. doi: 10.1038/s41598-025-14600-0.
2
MO-GCAN: multi-omics integration based on graph convolutional and attention networks.MO-GCAN:基于图卷积和注意力网络的多组学整合
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf405.
3
Pan-cancer and multiomics: advanced strategies for diagnosis, prognosis, and therapy in the complex genetic and molecular universe of cancer.

本文引用的文献

1
An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics.TCGA 泛癌临床数据资源整合,推动高质量生存预后分析。
Cell. 2018 Apr 5;173(2):400-416.e11. doi: 10.1016/j.cell.2018.02.052.
2
Deep Learning-Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer.基于深度学习的多组学整合可稳健预测肝癌患者的生存情况。
Clin Cancer Res. 2018 Mar 15;24(6):1248-1259. doi: 10.1158/1078-0432.CCR-17-0853. Epub 2017 Oct 5.
3
A review on machine learning principles for multi-view biological data integration.
泛癌与多组学:癌症复杂遗传和分子领域中的诊断、预后及治疗的先进策略
Clin Transl Oncol. 2024 Dec 26. doi: 10.1007/s12094-024-03819-4.
4
Artificial Intelligence in Metabolomics: A Current Review.代谢组学中的人工智能:当前综述
Trends Analyt Chem. 2024 Sep;178. doi: 10.1016/j.trac.2024.117852. Epub 2024 Jul 3.
5
Survival prediction landscape: an in-depth systematic literature review on activities, methods, tools, diseases, and databases.生存预测全景:关于活动、方法、工具、疾病和数据库的深入系统文献综述
Front Artif Intell. 2024 Jul 3;7:1428501. doi: 10.3389/frai.2024.1428501. eCollection 2024.
6
A practical introduction to holo-omics.全息组学实用入门
Cell Rep Methods. 2024 Jul 15;4(7):100820. doi: 10.1016/j.crmeth.2024.100820. Epub 2024 Jul 9.
7
Exploring machine learning strategies for predicting cardiovascular disease risk factors from multi-omic data.探索用于从多组学数据预测心血管疾病风险因素的机器学习策略。
BMC Med Inform Decis Mak. 2024 May 2;24(1):116. doi: 10.1186/s12911-024-02521-3.
8
Multimodal analysis methods in predictive biomedicine.预测性生物医学中的多模态分析方法。
Comput Struct Biotechnol J. 2023 Nov 20;21:5829-5838. doi: 10.1016/j.csbj.2023.11.011. eCollection 2023.
9
DeepAutoGlioma: a deep learning autoencoder-based multi-omics data integration and classification tools for glioma subtyping.深度自动胶质瘤分类器:一种基于深度学习自动编码器的多组学数据集成与胶质瘤亚型分类工具。
BioData Min. 2023 Nov 15;16(1):32. doi: 10.1186/s13040-023-00349-7.
10
A Bayesian method to infer copy number clones from single-cell RNA and ATAC sequencing.一种从单细胞 RNA 和 ATAC 测序推断拷贝数克隆的贝叶斯方法。
PLoS Comput Biol. 2023 Nov 2;19(11):e1011557. doi: 10.1371/journal.pcbi.1011557. eCollection 2023 Nov.
机器学习原理在多视图生物数据集成中的研究综述。
Brief Bioinform. 2018 Mar 1;19(2):325-340. doi: 10.1093/bib/bbw113.
4
Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records.深度患者:一种从电子健康记录中预测患者未来的无监督表示。
Sci Rep. 2016 May 17;6:26094. doi: 10.1038/srep26094.
5
Dimension reduction techniques for the integrative analysis of multi-omics data.用于多组学数据综合分析的降维技术
Brief Bioinform. 2016 Jul;17(4):628-41. doi: 10.1093/bib/bbv108. Epub 2016 Mar 11.
6
Methods for the integration of multi-omics data: mathematical aspects.多组学数据整合方法:数学方面
BMC Bioinformatics. 2016 Jan 20;17 Suppl 2(Suppl 2):15. doi: 10.1186/s12859-015-0857-9.
7
Learning a hierarchical representation of the yeast transcriptomic machinery using an autoencoder model.使用自动编码器模型学习酵母转录组机制的层次表示。
BMC Bioinformatics. 2016 Jan 11;17 Suppl 1(Suppl 1):9. doi: 10.1186/s12859-015-0852-1.
8
Similarity network fusion for aggregating data types on a genomic scale.基于基因组尺度聚合数据类型的相似网络融合。
Nat Methods. 2014 Mar;11(3):333-7. doi: 10.1038/nmeth.2810. Epub 2014 Jan 26.
9
netClass: an R-package for network based, integrative biomarker signature discovery.netClass:一个基于网络的综合生物标志物特征发现的 R 包。
Bioinformatics. 2014 May 1;30(9):1325-6. doi: 10.1093/bioinformatics/btu025. Epub 2014 Jan 17.
10
Identifying multi-layer gene regulatory modules from multi-dimensional genomic data.从多维基因组数据中识别多层基因调控模块。
Bioinformatics. 2012 Oct 1;28(19):2458-66. doi: 10.1093/bioinformatics/bts476. Epub 2012 Aug 3.