• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多组学数据的深度学习自动编码器在癌症亚型检测中的性能比较

Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data.

作者信息

Franco Edian F, Rana Pratip, Cruz Aline, Calderón Víctor V, Azevedo Vasco, Ramos Rommel T J, Ghosh Preetam

机构信息

Institute of Biological Sciences, Federal University of Para, Belem, PA 66075-110, Brazil.

Laboratory of Virology and Environmental Genomics, Instituto de Innovacion en Biotecnologia e Industria (IIBI), Santo Domingo 10104, Dominican Republic.

出版信息

Cancers (Basel). 2021 Apr 22;13(9):2013. doi: 10.3390/cancers13092013.

DOI:10.3390/cancers13092013
PMID:33921978
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8122584/
Abstract

A heterogeneous disease such as cancer is activated through multiple pathways and different perturbations. Depending upon the activated pathway(s), the survival of the patients varies significantly and shows different efficacy to various drugs. Therefore, cancer subtype detection using genomics level data is a significant research problem. Subtype detection is often a complex problem, and in most cases, needs multi-omics data fusion to achieve accurate subtyping. Different data fusion and subtyping approaches have been proposed over the years, such as kernel-based fusion, matrix factorization, and deep learning autoencoders. In this paper, we compared the performance of different deep learning autoencoders for cancer subtype detection. We performed cancer subtype detection on four different cancer types from The Cancer Genome Atlas (TCGA) datasets using four autoencoder implementations. We also predicted the optimal number of subtypes in a cancer type using the silhouette score and found that the detected subtypes exhibit significant differences in survival profiles. Furthermore, we compared the effect of feature selection and similarity measures for subtype detection. For further evaluation, we used the Glioblastoma multiforme (GBM) dataset and identified the differentially expressed genes in each of the subtypes. The results obtained are consistent with other genomic studies and can be corroborated with the involved pathways and biological functions. Thus, it shows that the results from the autoencoders, obtained through the interaction of different datatypes of cancer, can be used for the prediction and characterization of patient subgroups and survival profiles.

摘要

像癌症这样的异质性疾病是通过多种途径和不同扰动激活的。根据激活的途径不同,患者的生存率差异显著,并且对各种药物表现出不同的疗效。因此,利用基因组学水平的数据进行癌症亚型检测是一个重要的研究问题。亚型检测通常是一个复杂的问题,在大多数情况下,需要多组学数据融合来实现准确的亚型分类。多年来已经提出了不同的数据融合和亚型分类方法,如基于核的融合、矩阵分解和深度学习自动编码器。在本文中,我们比较了不同深度学习自动编码器在癌症亚型检测中的性能。我们使用四种自动编码器实现方法,对来自癌症基因组图谱(TCGA)数据集的四种不同癌症类型进行了癌症亚型检测。我们还使用轮廓系数预测了一种癌症类型中的最佳亚型数量,发现检测到的亚型在生存概况上表现出显著差异。此外,我们比较了特征选择和相似性度量对亚型检测的影响。为了进一步评估,我们使用了多形性胶质母细胞瘤(GBM)数据集,并确定了每个亚型中的差异表达基因。获得的结果与其他基因组研究一致,并且可以通过所涉及的途径和生物学功能得到证实。因此,这表明通过癌症不同数据类型的相互作用从自动编码器获得的结果可用于预测和表征患者亚组及生存概况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/8aa2dc85bc5c/cancers-13-02013-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/4f90fcac1d3b/cancers-13-02013-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/f29e417bf5ce/cancers-13-02013-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/3f761d12f460/cancers-13-02013-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/8aa2dc85bc5c/cancers-13-02013-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/4f90fcac1d3b/cancers-13-02013-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/f29e417bf5ce/cancers-13-02013-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/3f761d12f460/cancers-13-02013-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9966/8122584/8aa2dc85bc5c/cancers-13-02013-g007.jpg

相似文献

1
Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data.使用多组学数据的深度学习自动编码器在癌症亚型检测中的性能比较
Cancers (Basel). 2021 Apr 22;13(9):2013. doi: 10.3390/cancers13092013.
2
Capturing the latent space of an Autoencoder for multi-omics integration and cancer subtyping.捕获自动编码器的潜在空间,用于多组学整合和癌症亚型分类。
Comput Biol Med. 2022 Sep;148:105832. doi: 10.1016/j.compbiomed.2022.105832. Epub 2022 Jul 5.
3
A deep learning approach based on multi-omics data integration to construct a risk stratification prediction model for skin cutaneous melanoma.基于多组学数据整合的深度学习方法构建皮肤黑色素瘤风险分层预测模型。
J Cancer Res Clin Oncol. 2023 Nov;149(17):15923-15938. doi: 10.1007/s00432-023-05358-x. Epub 2023 Sep 7.
4
A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data.一种通过整合多组学数据进行癌症亚型分类的层次化集成深度灵活神经森林框架。
BMC Bioinformatics. 2019 Oct 28;20(1):527. doi: 10.1186/s12859-019-3116-7.
5
DeepAutoGlioma: a deep learning autoencoder-based multi-omics data integration and classification tools for glioma subtyping.深度自动胶质瘤分类器:一种基于深度学习自动编码器的多组学数据集成与胶质瘤亚型分类工具。
BioData Min. 2023 Nov 15;16(1):32. doi: 10.1186/s13040-023-00349-7.
6
Cancer subtype identification by consensus guided graph autoencoders.基于共识引导图自编码器的癌症亚型识别。
Bioinformatics. 2021 Dec 11;37(24):4779-4786. doi: 10.1093/bioinformatics/btab535.
7
PathME: pathway based multi-modal sparse autoencoders for clustering of patient-level multi-omics data.PathME:基于通路的多模态稀疏自动编码器,用于对患者层面多组学数据进行聚类。
BMC Bioinformatics. 2020 Apr 16;21(1):146. doi: 10.1186/s12859-020-3465-2.
8
Autoencoder-assisted latent representation learning for survival prediction and multi-view clustering on multi-omics cancer subtyping.基于自动编码器辅助的生存预测潜在表示学习和多组学生物标志物癌症亚型的多视图聚类。
Math Biosci Eng. 2023 Nov 27;20(12):21098-21119. doi: 10.3934/mbe.2023933.
9
A multi-omics supervised autoencoder for pan-cancer clinical outcome endpoints prediction.一种用于泛癌临床结局终点预测的多组学监督自动编码器。
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):129. doi: 10.1186/s12911-020-1114-3.
10
Deep learning assisted multi-omics integration for survival and drug-response prediction in breast cancer.深度学习辅助的多组学整合用于乳腺癌生存和药物反应预测
BMC Genomics. 2021 Mar 24;22(1):214. doi: 10.1186/s12864-021-07524-2.

引用本文的文献

1
Cancer genomics and bioinformatics in Latin American countries: applications, challenges, and perspectives.拉丁美洲国家的癌症基因组学与生物信息学:应用、挑战与前景
Front Oncol. 2025 Jul 9;15:1584178. doi: 10.3389/fonc.2025.1584178. eCollection 2025.
2
Immune-related genes can accurately predict survival in bladder cancer: a retrospective study via two independent immunotherapy cohorts.免疫相关基因可准确预测膀胱癌的生存率:一项通过两个独立免疫治疗队列进行的回顾性研究。
Transl Androl Urol. 2025 Jun 30;14(6):1661-1678. doi: 10.21037/tau-2025-28. Epub 2025 Jun 26.
3
A densely connected framework for cancer subtype classification.

本文引用的文献

1
Breaking through the glioblastoma micro-environment via extracellular vesicles.通过细胞外囊泡突破脑胶质瘤微环境。
Oncogene. 2020 Jun;39(23):4477-4490. doi: 10.1038/s41388-020-1308-2. Epub 2020 May 4.
2
Integrated Analysis to Evaluate the Prognostic Value of Signature mRNAs in Glioblastoma Multiforme.综合分析评估多形性胶质母细胞瘤中特征性mRNA的预后价值
Front Genet. 2020 Mar 31;11:253. doi: 10.3389/fgene.2020.00253. eCollection 2020.
3
Molecular Targets for Combined Therapeutic Strategies to Limit Glioblastoma Cell Migration and Invasion.
一种用于癌症亚型分类的密集连接框架。
BMC Bioinformatics. 2025 Jul 18;26(1):183. doi: 10.1186/s12859-025-06230-0.
4
Comparative analysis of statistical and deep learning-based multi-omics integration for breast cancer subtype classification.基于统计和深度学习的多组学整合用于乳腺癌亚型分类的比较分析
J Transl Med. 2025 Jul 1;23(1):709. doi: 10.1186/s12967-025-06662-5.
5
Multimodal deep learning for predicting neoadjuvant treatment outcomes in breast cancer: a systematic review.用于预测乳腺癌新辅助治疗结果的多模态深度学习:一项系统综述
Biol Direct. 2025 Jun 23;20(1):72. doi: 10.1186/s13062-025-00661-8.
6
Navigating the Multiverse: a Hitchhiker's guide to selecting harmonization methods for multimodal biomedical data.探索多元宇宙:多模态生物医学数据协调方法选择指南
Biol Methods Protoc. 2025 Apr 17;10(1):bpaf028. doi: 10.1093/biomethods/bpaf028. eCollection 2025.
7
GramSeq-DTA: A Grammar-Based Drug-Target Affinity Prediction Approach Fusing Gene Expression Information.GramSeq-DTA:一种融合基因表达信息的基于语法的药物-靶点亲和力预测方法。
Biomolecules. 2025 Mar 12;15(3):405. doi: 10.3390/biom15030405.
8
Role of AI in empowering and redefining the oncology care landscape: perspective from a developing nation.人工智能在赋能和重新定义肿瘤护理格局中的作用:来自一个发展中国家的视角。
Front Digit Health. 2025 Mar 4;7:1550407. doi: 10.3389/fdgth.2025.1550407. eCollection 2025.
9
Heterogeneous Clustering of Multiomics Data for Breast Cancer Subgroup Classification and Detection.用于乳腺癌亚组分类和检测的多组学数据的异质性聚类
Int J Mol Sci. 2025 Feb 17;26(4):1707. doi: 10.3390/ijms26041707.
10
Using multiomic integration to improve blood biomarkers of major depressive disorder: a case-control study.利用多组学整合改善重度抑郁症的血液生物标志物:一项病例对照研究。
EBioMedicine. 2025 Mar;113:105569. doi: 10.1016/j.ebiom.2025.105569. Epub 2025 Feb 5.
限制胶质母细胞瘤细胞迁移和侵袭的联合治疗策略的分子靶点
Front Pharmacol. 2020 Mar 27;11:358. doi: 10.3389/fphar.2020.00358. eCollection 2020.
4
Variational Autoencoders for Cancer Data Integration: Design Principles and Computational Practice.用于癌症数据整合的变分自编码器:设计原理与计算实践
Front Genet. 2019 Dec 11;10:1205. doi: 10.3389/fgene.2019.01205. eCollection 2019.
5
Recent advances on constraint-based models by integrating machine learning.基于机器学习的约束模型的最新进展。
Curr Opin Biotechnol. 2020 Aug;64:85-91. doi: 10.1016/j.copbio.2019.11.007. Epub 2019 Dec 5.
6
Evaluation of colorectal cancer subtypes and cell lines using deep learning.基于深度学习的结直肠癌亚型和细胞系评估。
Life Sci Alliance. 2019 Dec 2;2(6). doi: 10.26508/lsa.201900517. Print 2019 Dec.
7
Identification of aberrantly methylated differentially expressed genes in glioblastoma multiforme and their association with patient survival.多形性胶质母细胞瘤中异常甲基化的差异表达基因的鉴定及其与患者生存的关联。
Exp Ther Med. 2019 Sep;18(3):2140-2152. doi: 10.3892/etm.2019.7807. Epub 2019 Jul 24.
8
MOSClip: multi-omic and survival pathway analysis for the identification of survival associated gene and modules.MOSClip:用于识别生存相关基因和模块的多组学和生存途径分析。
Nucleic Acids Res. 2019 Aug 22;47(14):e80. doi: 10.1093/nar/gkz324.
9
A multi-omics data simulator for complex disease studies and its application to evaluate multi-omics data analysis methods for disease classification.用于复杂疾病研究的多组学数据模拟器及其在评估疾病分类的多组学数据分析方法中的应用。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz045.
10
Glutamate Receptors and Glioblastoma Multiforme: An Old "Route" for New Perspectives.谷氨酸受体与多形性胶质母细胞瘤:旧“途径”新视角。
Int J Mol Sci. 2019 Apr 11;20(7):1796. doi: 10.3390/ijms20071796.