用于癌症预后的可解释多模态深度学习中新兴预训练策略的评估。

Assessment of emerging pretraining strategies in interpretable multimodal deep learning for cancer prognostication.

作者信息

Azher Zarif L, Suvarna Anish, Chen Ji-Qing, Zhang Ze, Christensen Brock C, Salas Lucas A, Vaickus Louis J, Levy Joshua J

机构信息

Thomas Jefferson High School for Science and Technology, Alexandria, VA, USA.

Cancer Biology Graduate Program, Dartmouth College Geisel School of Medicine, Hanover, NH, USA.

出版信息

BioData Min. 2023 Jul 22;16(1):23. doi: 10.1186/s13040-023-00338-w.

DOI:10.1186/s13040-023-00338-w

PMID:37481666

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10363299/

Abstract

BACKGROUND

Deep learning models can infer cancer patient prognosis from molecular and anatomic pathology information. Recent studies that leveraged information from complementary multimodal data improved prognostication, further illustrating the potential utility of such methods. However, current approaches: 1) do not comprehensively leverage biological and histomorphological relationships and 2) make use of emerging strategies to "pretrain" models (i.e., train models on a slightly orthogonal dataset/modeling objective) which may aid prognostication by reducing the amount of information required for achieving optimal performance. In addition, model interpretation is crucial for facilitating the clinical adoption of deep learning methods by fostering practitioner understanding and trust in the technology.

METHODS

Here, we develop an interpretable multimodal modeling framework that combines DNA methylation, gene expression, and histopathology (i.e., tissue slides) data, and we compare performance of crossmodal pretraining, contrastive learning, and transfer learning versus the standard procedure.

RESULTS

Our models outperform the existing state-of-the-art method (average 11.54% C-index increase), and baseline clinically driven models (average 11.7% C-index increase). Model interpretations elucidate consideration of biologically meaningful factors in making prognosis predictions.

DISCUSSION

Our results demonstrate that the selection of pretraining strategies is crucial for obtaining highly accurate prognostication models, even more so than devising an innovative model architecture, and further emphasize the all-important role of the tumor microenvironment on disease progression.

摘要

背景

深度学习模型可根据分子和解剖病理学信息推断癌症患者的预后。最近利用互补多模态数据信息的研究改善了预后预测，进一步说明了此类方法的潜在效用。然而，当前方法：1）未全面利用生物学和组织形态学关系；2）采用新兴策略对模型进行“预训练”（即在稍有正交性的数据集/建模目标上训练模型），这可能通过减少实现最佳性能所需的信息量来辅助预后预测。此外，模型解释对于促进深度学习方法在临床中的应用至关重要，因为它能增进从业者对该技术的理解和信任。

方法

在此，我们开发了一个可解释的多模态建模框架，该框架结合了DNA甲基化、基因表达和组织病理学（即组织切片）数据，并将跨模态预训练、对比学习和迁移学习的性能与标准程序进行了比较。

结果

我们的模型优于现有的最先进方法（C指数平均提高11.54%）和基线临床驱动模型（C指数平均提高11.7%）。模型解释阐明了在进行预后预测时对生物学上有意义的因素的考量。

讨论

我们的结果表明，预训练策略的选择对于获得高度准确的预后预测模型至关重要，甚至比设计创新的模型架构更为关键，并且进一步强调了肿瘤微环境在疾病进展中的至关重要的作用。

相似文献

Assessment of emerging pretraining strategies in interpretable multimodal deep learning for cancer prognostication.

BioData Min. 2023 Jul 22;16(1):23. doi: 10.1186/s13040-023-00338-w.

Spatial Omics Driven Crossmodal Pretraining Applied to Graph-based Deep Learning for Cancer Pathology Analysis.

bioRxiv. 2023 Jul 31:2023.07.30.551187. doi: 10.1101/2023.07.30.551187.

Spatial Omics Driven Crossmodal Pretraining Applied to Graph-based Deep Learning for Cancer Pathology Analysis.

Pac Symp Biocomput. 2024;29:464-476.

XMR: an explainable multimodal neural network for drug response prediction.

Front Bioinform. 2023 Aug 2;3:1164482. doi: 10.3389/fbinf.2023.1164482. eCollection 2023.

Pretraining Strategies for Structure Agnostic Material Property Prediction.

J Chem Inf Model. 2024 Feb 12;64(3):627-637. doi: 10.1021/acs.jcim.3c00919. Epub 2024 Feb 1.

Interpretable deep learning model to predict the molecular classification of endometrial cancer from haematoxylin and eosin-stained whole-slide images: a combined analysis of the PORTEC randomised trials and clinical cohorts.

Lancet Digit Health. 2023 Feb;5(2):e71-e82. doi: 10.1016/S2589-7500(22)00210-2. Epub 2022 Dec 7.

Deep multimodal graph-based network for survival prediction from highly multiplexed images and patient variables.

Comput Biol Med. 2023 Mar;154:106576. doi: 10.1016/j.compbiomed.2023.106576. Epub 2023 Feb 1.

RATING: Medical knowledge-guided rheumatoid arthritis assessment from multimodal ultrasound images via deep learning.

Patterns (N Y). 2022 Sep 29;3(10):100592. doi: 10.1016/j.patter.2022.100592. eCollection 2022 Oct 14.

The Overlooked Role of Specimen Preparation in Bolstering Deep Learning-Enhanced Spatial Transcriptomics Workflows.

medRxiv. 2023 Oct 9:2023.10.09.23296700. doi: 10.1101/2023.10.09.23296700.

NaroNet: Discovery of tumor microenvironment elements from highly multiplexed images.

Med Image Anal. 2022 May;78:102384. doi: 10.1016/j.media.2022.102384. Epub 2022 Feb 14.

引用本文的文献

Autonomous learning of pathologists' cancer grading rules.

bioRxiv. 2025 Apr 7:2025.03.18.643999. doi: 10.1101/2025.03.18.643999.

Insights to aging prediction with AI based epigenetic clocks.

Epigenomics. 2025 Jan;17(1):49-57. doi: 10.1080/17501911.2024.2432854. Epub 2024 Nov 25.

Progress and opportunities of foundation models in bioinformatics.

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae548.

Building RadiologyNET: an unsupervised approach to annotating a large-scale multimodal medical database.

BioData Min. 2024 Jul 12;17(1):22. doi: 10.1186/s13040-024-00373-1.

Spatial Omics Driven Crossmodal Pretraining Applied to Graph-based Deep Learning for Cancer Pathology Analysis.

Pac Symp Biocomput. 2024;29:464-476.

Graph Neural Networks in Cancer and Oncology Research: Emerging and Future Trends.

Cancers (Basel). 2023 Dec 15;15(24):5858. doi: 10.3390/cancers15245858.

本文引用的文献

Cytokines secreted by inflamed oral mucosa: implications for oral cancer progression.

Oncogene. 2023 Apr;42(15):1159-1165. doi: 10.1038/s41388-023-02649-y. Epub 2023 Mar 6.

HiTIMED: hierarchical tumor immune microenvironment epigenetic deconvolution for accurate cell type resolution in the tumor microenvironment using tumor-type-specific DNA methylation data.

J Transl Med. 2022 Nov 8;20(1):516. doi: 10.1186/s12967-022-03736-6.

Artificial intelligence for multimodal data integration in oncology.

Cancer Cell. 2022 Oct 10;40(10):1095-1110. doi: 10.1016/j.ccell.2022.09.012.

Pan-cancer integrative histology-genomic analysis via multimodal deep learning.

Cancer Cell. 2022 Aug 8;40(8):865-878.e6. doi: 10.1016/j.ccell.2022.07.004.

A Novel Deep Learning Method to Predict Lung Cancer Long-Term Survival With Biological Knowledge Incorporated Gene Expression Images and Clinical Data.

Front Genet. 2022 Mar 14;13:800853. doi: 10.3389/fgene.2022.800853. eCollection 2022.

Identification and Validation of DEPDC1B as an Independent Early Diagnostic and Prognostic Biomarker in Liver Hepatocellular Carcinoma.

Front Genet. 2022 Jan 13;12:681809. doi: 10.3389/fgene.2021.681809. eCollection 2021.

Cancer statistics, 2022.

CA Cancer J Clin. 2022 Jan;72(1):7-33. doi: 10.3322/caac.21708. Epub 2022 Jan 12.

Unbox the black-box for the medical explainable AI via multi-modal and multi-centre data fusion: A mini-review, two showcases and beyond.

Inf Fusion. 2022 Jan;77:29-52. doi: 10.1016/j.inffus.2021.07.016.

Regulated lytic cell death in breast cancer.

Cell Biol Int. 2022 Jan;46(1):12-33. doi: 10.1002/cbin.11705. Epub 2021 Oct 4.

MethylSPWNet and MethylCapsNet: Biologically Motivated Organization of DNAm Neural Networks, Inspired by Capsule Networks.

NPJ Syst Biol Appl. 2021 Aug 20;7(1):33. doi: 10.1038/s41540-021-00193-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于癌症预后的可解释多模态深度学习中新兴预训练策略的评估。

Assessment of emerging pretraining strategies in interpretable multimodal deep learning for cancer prognostication.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

DISCUSSION

背景

方法

结果

讨论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献