MMsurv：一种整合病理图像、临床信息和测序数据的多模态多实例多癌症生存预测模型。

MMsurv: a multimodal multi-instance multi-cancer survival prediction model integrating pathological images, clinical information, and sequencing data.

作者信息

Yang Hailong, Wang Jia, Wang Wenyan, Shi Shufang, Liu Lijing, Yao Yuhua, Tian Geng, Wang Peizhen, Yang Jialiang

机构信息

School of Electrical and Information Engineering, Anhui University of Technology, No. 1530 Maxiang Road, Huashan District, Ma'anshan, Anhui 243032, China.

Department of Sciences, Geneis Beijing Co., Ltd., No. 31 Xinbei Road, Laiguangying, Chaoyang District, Beijing 100102, China.

出版信息

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf209.

DOI:10.1093/bib/bbaf209

PMID:40366860

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12077396/

Abstract

Accurate prediction of patient survival rates in cancer treatment is essential for effective therapeutic planning. Unfortunately, current models often underutilize the extensive multimodal data available, affecting confidence in predictions. This study presents MMSurv, an interpretable multimodal deep learning model to predict survival in different types of cancer. MMSurv integrates clinical information, sequencing data, and hematoxylin and eosin-stained whole-slide images (WSIs) to forecast patient survival. Specifically, we segment tumor regions from WSIs into image tiles and employ neural networks to encode each tile into one-dimensional feature vectors. We then optimize clinical features by applying word embedding techniques, inspired by natural language processing, to the clinical data. To better utilize the complementarity of multimodal data, this study proposes a novel fusion method, multimodal fusion method based on compact bilinear pooling and transformer, which integrates bilinear pooling with Transformer architecture. The fused features are then processed through a dual-layer multi-instance learning model to remove prognosis-irrelevant image patches and predict each patient's survival risk. Furthermore, we employ cell segmentation to investigate the cellular composition within the tiles that received high attention from the model, thereby enhancing its interpretive capacity. We evaluate our approach on six cancer types from The Cancer Genome Atlas. The results demonstrate that utilizing multimodal data leads to higher predictive accuracy compared to using single-modal image data, with an average C-index increase from 0.6750 to 0.7283. Additionally, we compare our proposed baseline model with state-of-the-art methods using the C-index and five-fold cross-validation approach, revealing a significant average improvement of nearly 10% in our model's performance.

摘要

准确预测癌症治疗中的患者生存率对于有效的治疗规划至关重要。不幸的是，当前模型往往未充分利用现有的大量多模态数据，影响了预测的可信度。本研究提出了MMSurv，这是一种可解释的多模态深度学习模型，用于预测不同类型癌症的生存率。MMSurv整合临床信息、测序数据以及苏木精和伊红染色的全切片图像（WSIs）来预测患者生存率。具体而言，我们将WSIs中的肿瘤区域分割成图像块，并使用神经网络将每个块编码为一维特征向量。然后，受自然语言处理启发，我们通过应用词嵌入技术对临床数据优化临床特征。为了更好地利用多模态数据的互补性，本研究提出了一种新颖的融合方法，即基于紧凑双线性池化和Transformer的多模态融合方法，该方法将双线性池化与Transformer架构相结合。然后，融合后的特征通过双层多实例学习模型进行处理，以去除与预后无关的图像块并预测每个患者的生存风险。此外，我们采用细胞分割来研究模型高度关注的图像块内的细胞组成，从而增强其解释能力。我们在来自癌症基因组图谱的六种癌症类型上评估了我们的方法。结果表明，与使用单模态图像数据相比，利用多模态数据可提高预测准确性，平均C指数从0.6750提高到0.7283。此外，我们使用C指数和五折交叉验证方法将我们提出的基线模型与现有最先进方法进行比较，结果显示我们模型的性能平均有近10%的显著提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a42/12077396/5d162e07a182/bbaf209f1.jpg

相似文献

MMsurv: a multimodal multi-instance multi-cancer survival prediction model integrating pathological images, clinical information, and sequencing data.

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf209.

Multimodal multi-instance evidence fusion neural networks for cancer survival prediction.

Sci Rep. 2025 Mar 26;15(1):10470. doi: 10.1038/s41598-025-93770-3.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Deep learning-driven survival prediction in pan-cancer studies by integrating multimodal histology-genomic data.

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf121.

Dual-path neural network extracts tumor microenvironment information from whole slide images to predict molecular typing and prognosis of Glioma.

Comput Methods Programs Biomed. 2025 Apr;261:108580. doi: 10.1016/j.cmpb.2024.108580. Epub 2025 Jan 4.

MMGCN: Multi-modal multi-view graph convolutional networks for cancer prognosis prediction.

Comput Methods Programs Biomed. 2024 Dec;257:108400. doi: 10.1016/j.cmpb.2024.108400. Epub 2024 Sep 6.

SG-Fusion: A swin-transformer and graph convolution-based multi-modal deep neural network for glioma prognosis.

Artif Intell Med. 2024 Nov;157:102972. doi: 10.1016/j.artmed.2024.102972. Epub 2024 Aug 31.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

Interpretable deep learning model to predict the molecular classification of endometrial cancer from haematoxylin and eosin-stained whole-slide images: a combined analysis of the PORTEC randomised trials and clinical cohorts.

Lancet Digit Health. 2023 Feb;5(2):e71-e82. doi: 10.1016/S2589-7500(22)00210-2. Epub 2022 Dec 7.

SAMPLER: unsupervised representations for rapid analysis of whole slide tissue images.

EBioMedicine. 2024 Jan;99:104908. doi: 10.1016/j.ebiom.2023.104908. Epub 2023 Dec 14.

本文引用的文献

Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.

CA Cancer J Clin. 2024 May-Jun;74(3):229-263. doi: 10.3322/caac.21834. Epub 2024 Apr 4.

Prognostic Nomograms for Patients with Primary Sarcomatoid Carcinoma of The Urinary Bladder: Based on The SEER Database.

Urol J. 2024 Mar 24;21(2):87-97. doi: 10.22037/uj.v20i.7595.

MHAttnSurv: Multi-head attention for survival prediction using whole-slide pathology images.

Comput Biol Med. 2023 May;158:106883. doi: 10.1016/j.compbiomed.2023.106883. Epub 2023 Apr 5.

Deep multimodal graph-based network for survival prediction from highly multiplexed images and patient variables.

Comput Biol Med. 2023 Mar;154:106576. doi: 10.1016/j.compbiomed.2023.106576. Epub 2023 Feb 1.

ICSDA: a multi-modal deep learning model to predict breast cancer recurrence and metastasis risk by integrating pathological, clinical and gene expression data.

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac448.

Predicting colorectal cancer tumor mutational burden from histopathological images and clinical information using multi-modal deep learning.

Bioinformatics. 2022 Nov 15;38(22):5108-5115. doi: 10.1093/bioinformatics/btac641.

Pan-cancer integrative histology-genomic analysis via multimodal deep learning.

Cancer Cell. 2022 Aug 8;40(8):865-878.e6. doi: 10.1016/j.ccell.2022.07.004.

HFBSurv: hierarchical multimodal fusion with factorized bilinear models for cancer survival prediction.

Bioinformatics. 2022 Apr 28;38(9):2587-2594. doi: 10.1093/bioinformatics/btac113.

Prediction of HER2-positive breast cancer recurrence and metastasis risk from histopathological images and clinical information via multimodal deep learning.

Comput Struct Biotechnol J. 2021 Dec 23;20:333-342. doi: 10.1016/j.csbj.2021.12.028. eCollection 2022.

Understanding, explaining, and utilizing medical artificial intelligence.

Nat Hum Behav. 2021 Dec;5(12):1636-1642. doi: 10.1038/s41562-021-01146-0. Epub 2021 Jun 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MMsurv：一种整合病理图像、临床信息和测序数据的多模态多实例多癌症生存预测模型。

MMsurv: a multimodal multi-instance multi-cancer survival prediction model integrating pathological images, clinical information, and sequencing data.

作者信息

Yang Hailong, Wang Jia, Wang Wenyan, Shi Shufang, Liu Lijing, Yao Yuhua, Tian Geng, Wang Peizhen, Yang Jialiang

机构信息

School of Electrical and Information Engineering, Anhui University of Technology, No. 1530 Maxiang Road, Huashan District, Ma'anshan, Anhui 243032, China.

Department of Sciences, Geneis Beijing Co., Ltd., No. 31 Xinbei Road, Laiguangying, Chaoyang District, Beijing 100102, China.

出版信息

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf209.

DOI:10.1093/bib/bbaf209

PMID:40366860

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12077396/

Abstract

摘要

MMsurv：一种整合病理图像、临床信息和测序数据的多模态多实例多癌症生存预测模型。

MMsurv: a multimodal multi-instance multi-cancer survival prediction model integrating pathological images, clinical information, and sequencing data.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

MMsurv：一种整合病理图像、临床信息和测序数据的多模态多实例多癌症生存预测模型。

MMsurv: a multimodal multi-instance multi-cancer survival prediction model integrating pathological images, clinical information, and sequencing data.

作者信息

机构信息

出版信息

相似文献

本文引用的文献