基于深度学习的 RNA-seq 数据癌症生存预后：方法与评估。

Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations.

机构信息

School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA.

Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.

出版信息

BMC Med Genomics. 2020 Apr 3;13(Suppl 5):41. doi: 10.1186/s12920-020-0686-1.

DOI:10.1186/s12920-020-0686-1

PMID:32241264

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7118823/

Abstract

BACKGROUND

Recent advances in kernel-based Deep Learning models have introduced a new era in medical research. Originally designed for pattern recognition and image processing, Deep Learning models are now applied to survival prognosis of cancer patients. Specifically, Deep Learning versions of the Cox proportional hazards models are trained with transcriptomic data to predict survival outcomes in cancer patients.

METHODS

In this study, a broad analysis was performed on TCGA cancers using a variety of Deep Learning-based models, including Cox-nnet, DeepSurv, and a method proposed by our group named AECOX (AutoEncoder with Cox regression network). Concordance index and p-value of the log-rank test are used to evaluate the model performances.

RESULTS

All models show competitive results across 12 cancer types. The last hidden layers of the Deep Learning approaches are lower dimensional representations of the input data that can be used for feature reduction and visualization. Furthermore, the prognosis performances reveal a negative correlation between model accuracy, overall survival time statistics, and tumor mutation burden (TMB), suggesting an association among overall survival time, TMB, and prognosis prediction accuracy.

CONCLUSIONS

Deep Learning based algorithms demonstrate superior performances than traditional machine learning based models. The cancer prognosis results measured in concordance index are indistinguishable across models while are highly variable across cancers. These findings shedding some light into the relationships between patient characteristics and survival learnability on a pan-cancer level.

摘要

背景

基于核的深度学习模型的最新进展开创了医学研究的新纪元。深度学习模型最初设计用于模式识别和图像处理，现在已应用于癌症患者的生存预后。具体来说，基于转录组数据训练了 Cox 比例风险模型的深度学习版本，以预测癌症患者的生存结局。

方法

本研究使用多种基于深度学习的模型对 TCGA 癌症进行了广泛分析，包括 Cox-nnet、DeepSurv 和我们团队提出的名为 AECOX（具有 Cox 回归网络的自动编码器）的方法。一致性指数和对数秩检验的 p 值用于评估模型性能。

结果

所有模型在 12 种癌症类型中均表现出有竞争力的结果。深度学习方法的最后一个隐藏层是输入数据的低维表示，可以用于特征降维和可视化。此外，预后表现揭示了模型准确性、总生存时间统计和肿瘤突变负担（TMB）之间的负相关，表明总生存时间、TMB 和预后预测准确性之间存在关联。

结论

基于深度学习的算法表现优于传统基于机器学习的模型。在一致性指数上测量的癌症预后结果在模型之间不可区分，而在癌症之间高度可变。这些发现为患者特征与泛癌水平上的生存可学习性之间的关系提供了一些启示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7037/7118823/540fb1c678de/12920_2020_686_Fig1_HTML.jpg

相似文献

Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations.基于深度学习的 RNA-seq 数据癌症生存预后：方法与评估。

BMC Med Genomics. 2020 Apr 3;13(Suppl 5):41. doi: 10.1186/s12920-020-0686-1.

Deep learning-based model for predicting progression in patients with head and neck squamous cell carcinoma.基于深度学习的头颈部鳞状细胞癌患者进展预测模型。

Cancer Biomark. 2020;27(1):19-28. doi: 10.3233/CBM-190380.

Deep Learning-Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer.基于深度学习的多组学整合可稳健预测肝癌患者的生存情况。

Clin Cancer Res. 2018 Mar 15;24(6):1248-1259. doi: 10.1158/1078-0432.CCR-17-0853. Epub 2017 Oct 5.

Network-based drug sensitivity prediction.基于网络的药物敏感性预测。

BMC Med Genomics. 2020 Dec 28;13(Suppl 11):193. doi: 10.1186/s12920-020-00829-3.

G2Vec: Distributed gene representations for identification of cancer prognostic genes.G2Vec：用于识别癌症预后基因的分布式基因表示。

Sci Rep. 2018 Sep 13;8(1):13729. doi: 10.1038/s41598-018-32180-0.

Robust evaluation of deep learning-based representation methods for survival and gene essentiality prediction on bulk RNA-seq data.基于深度学习的代表性方法在批量 RNA-seq 数据上的生存和基因必需性预测的稳健评估。

Sci Rep. 2024 Jul 24;14(1):17064. doi: 10.1038/s41598-024-67023-8.

Prediction of circRNA-disease associations based on inductive matrix completion.基于归纳矩阵补全的 circRNA-疾病关联预测。

BMC Med Genomics. 2020 Apr 3;13(Suppl 5):42. doi: 10.1186/s12920-020-0679-0.

A deep learning model to predict RNA-Seq expression of tumours from whole slide images.从全切片图像预测肿瘤 RNA-Seq 表达的深度学习模型。

Nat Commun. 2020 Aug 3;11(1):3877. doi: 10.1038/s41467-020-17678-4.

A Linear Regression and Deep Learning Approach for Detecting Reliable Genetic Alterations in Cancer Using DNA Methylation and Gene Expression Data.基于 DNA 甲基化和基因表达数据的线性回归和深度学习方法在癌症中检测可靠的遗传改变。

Genes (Basel). 2020 Aug 12;11(8):931. doi: 10.3390/genes11080931.

Improved survival analysis by learning shared genomic information from pan-cancer data.从泛癌数据中学习共享基因组信息以改善生存分析。

Bioinformatics. 2020 Jul 1;36(Suppl_1):i389-i398. doi: 10.1093/bioinformatics/btaa462.

引用本文的文献

Fatty acid-binding protein 4 as a biomarker for colon adenocarcinoma risk and prognosis: Challenges and future directions.脂肪酸结合蛋白4作为结肠腺癌风险和预后的生物标志物：挑战与未来方向

World J Gastrointest Oncol. 2025 Aug 15;17(8):106621. doi: 10.4251/wjgo.v17.i8.106621.

Identification of Key Genes Associated with Overall Survival in Glioblastoma Multiforme Using TCGA RNA-Seq Expression Data.利用TCGA RNA测序表达数据鉴定多形性胶质母细胞瘤中与总生存期相关的关键基因

Genes (Basel). 2025 Jun 27;16(7):755. doi: 10.3390/genes16070755.

Classification and diagnosis of cervical lesions based on colposcopy images using deep fully convolutional networks: A man-machine comparison cohort study.基于深度全卷积网络的阴道镜图像宫颈病变分类与诊断：一项人机对比队列研究。

Fundam Res. 2022 Nov 9;5(1):419-428. doi: 10.1016/j.fmre.2022.09.032. eCollection 2025 Jan.

Cox-Sage: enhancing Cox proportional hazards model with interpretable graph neural networks for cancer prognosis.Cox-Sage：使用可解释的图神经网络增强Cox比例风险模型以进行癌症预后分析

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf108.

Context aware machine learning techniques for brain tumor classification and detection - A review.用于脑肿瘤分类与检测的上下文感知机器学习技术——综述

Heliyon. 2025 Jan 13;11(2):e41835. doi: 10.1016/j.heliyon.2025.e41835. eCollection 2025 Jan 30.

Spatial transcriptome reveals histology-correlated immune signature learnt by deep learning attention mechanism on H&E-stained images for ovarian cancer prognosis.空间转录组揭示了通过深度学习注意力机制在苏木精-伊红染色图像上学习到的与组织学相关的免疫特征，用于卵巢癌预后评估。

J Transl Med. 2025 Jan 24;23(1):113. doi: 10.1186/s12967-024-06007-8.

Case-Base Neural Network: Survival analysis with time-varying, higher-order interactions.基于案例的神经网络：具有时变高阶相互作用的生存分析

Mach Learn Appl. 2024 Jun;16. doi: 10.1016/j.mlwa.2024.100535. Epub 2024 Feb 20.

Cancer Biomarkers and Precision Oncology: A Review of Recent Trends and Innovations.癌症生物标志物与精准肿瘤学：近期趋势与创新综述

Clin Med Insights Oncol. 2024 Nov 17;18:11795549241298541. doi: 10.1177/11795549241298541. eCollection 2024.

Exploiting common patterns in diverse cancer types via multi-task learning.通过多任务学习探索不同癌症类型中的常见模式。

NPJ Precis Oncol. 2024 Oct 29;8(1):245. doi: 10.1038/s41698-024-00700-z.

Artificial intelligence: illuminating the depths of the tumor microenvironment.人工智能：照亮肿瘤微环境的深处。

J Transl Med. 2024 Aug 29;22(1):799. doi: 10.1186/s12967-024-05609-6.

本文引用的文献

Gene Co-Expression Networks Restructured Gene Fusion in Rhabdomyosarcoma Cancers.基因共表达网络重构横纹肌肉瘤癌症中的基因融合。

Genes (Basel). 2019 Aug 30;10(9):665. doi: 10.3390/genes10090665.

BERMUDA: a novel deep transfer learning method for single-cell RNA sequencing batch correction reveals hidden high-resolution cellular subtypes.百慕大：一种新型的单细胞 RNA 测序批次校正深度迁移学习方法揭示了隐藏的高分辨率细胞亚型。

Genome Biol. 2019 Aug 12;20(1):165. doi: 10.1186/s13059-019-1764-6.

Correlation Analysis of Histopathology and Proteogenomics Data for Breast Cancer.乳腺癌组织病理学与蛋白质基因组学数据的相关性分析。

Mol Cell Proteomics. 2019 Aug 9;18(8 suppl 1):S37-S51. doi: 10.1074/mcp.RA118.001232. Epub 2019 Jul 8.

Gene Co-expression Network and Copy Number Variation Analyses Identify Transcription Factors Associated With Multiple Myeloma Progression.基因共表达网络和拷贝数变异分析鉴定与多发性骨髓瘤进展相关的转录因子。

Front Genet. 2019 May 17;10:468. doi: 10.3389/fgene.2019.00468. eCollection 2019.

PseudoFuN: Deriving functional potentials of pseudogenes from integrative relationships with genes and microRNAs across 32 cancers.伪 FuN：从 32 种癌症中与基因和 microRNAs 的整合关系中推导出假基因的功能潜力。

Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz046.

SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer.SALMON：基于多组学神经网络的乳腺癌生存分析学习

Front Genet. 2019 Mar 8;10:166. doi: 10.3389/fgene.2019.00166. eCollection 2019.

Condition-specific gene co-expression network mining identifies key pathways and regulators in the brain tissue of Alzheimer's disease patients.特定疾病基因共表达网络挖掘可识别阿尔茨海默病患者脑组织中的关键通路和调节因子。

BMC Med Genomics. 2018 Dec 31;11(Suppl 6):115. doi: 10.1186/s12920-018-0431-1.

Identification of potential key genes associated with severe pneumonia using mRNA-seq.使用mRNA测序鉴定与重症肺炎相关的潜在关键基因。

Exp Ther Med. 2018 Aug;16(2):758-766. doi: 10.3892/etm.2018.6262. Epub 2018 Jun 7.

Prognostic Impact of Tumor Mutation Burden in Patients With Completely Resected Non-Small Cell Lung Cancer: Brief Report.完全切除的非小细胞肺癌患者肿瘤突变负荷的预后影响：简要报告。

J Thorac Oncol. 2018 Aug;13(8):1217-1221. doi: 10.1016/j.jtho.2018.04.003. Epub 2018 Apr 12.

Cox-nnet: An artificial neural network method for prognosis prediction of high-throughput omics data.Cox-nnet：一种用于高通量组学数据预后预测的人工神经网络方法。

PLoS Comput Biol. 2018 Apr 10;14(4):e1006076. doi: 10.1371/journal.pcbi.1006076. eCollection 2018 Apr.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于深度学习的 RNA-seq 数据癌症生存预后：方法与评估。

Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献