利用组织病理学和基因组数据对缺失数据进行多模态学习以用于癌症诊断

Multi-modal Learning with Missing Data for Cancer Diagnosis Using Histopathological and Genomic Data.

作者信息

Cui Can, Asad Zuhayr, Dean William F, Smith Isabelle T, Madden Christopher, Bao Shunxing, Landman Bennett A, Roland Joseph T, Coburn Lori A, Wilson Keith T, Zwerner Jeffrey P, Zhao Shilin, Wheless Lee E, Huo Yuankai

机构信息

Department of Computer Science, Vanderbilt University, Nashville, TN 37235, USA.

College of Arts and Science, Vanderbilt University, Nashville, TN 37235, USA.

出版信息

Proc SPIE Int Soc Opt Eng. 2022 Feb-Mar;12033. doi: 10.1117/12.2612318. Epub 2022 Apr 4.

DOI:10.1117/12.2612318

PMID:36304178

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9605813/

Abstract

Multi-modal learning (e.g., integrating pathological images with genomic features) tends to improve the accuracy of cancer diagnosis and prognosis as compared to learning with a single modality. However, missing data is a common problem in clinical practice, i.e., not every patient has all modalities available. Most of the previous works directly discarded samples with missing modalities, which might lose information in these data and increase the likelihood of overfitting. In this work, we generalize the multi-modal learning in cancer diagnosis with the capacity of dealing with missing data using histological images and genomic data. Our integrated model can utilize all available data from patients with both complete and partial modalities. The experiments on the public TCGA-GBM and TCGA-LGG datasets show that the data with missing modalities can contribute to multi-modal learning, which improves the model performance in grade classification of glioma cancer.

摘要

与单模态学习相比，多模态学习（例如将病理图像与基因组特征相结合）往往能提高癌症诊断和预后的准确性。然而，数据缺失是临床实践中的常见问题，即并非每个患者都具备所有可用模态的数据。以前的大多数工作直接丢弃具有缺失模态的样本，这可能会丢失这些数据中的信息，并增加过拟合的可能性。在这项工作中，我们通过使用组织学图像和基因组数据来处理数据缺失的能力，推广了癌症诊断中的多模态学习。我们的集成模型可以利用来自具有完整和部分模态数据患者的所有可用数据。在公共TCGA-GBM和TCGA-LGG数据集上的实验表明，具有缺失模态的数据有助于多模态学习，从而提高了神经胶质瘤癌症分级分类中的模型性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3e4/9605813/9dcf293eb64a/nihms-1844002-f0001.jpg

相似文献

Multi-modal Learning with Missing Data for Cancer Diagnosis Using Histopathological and Genomic Data.利用组织病理学和基因组数据对缺失数据进行多模态学习以用于癌症诊断

Proc SPIE Int Soc Opt Eng. 2022 Feb-Mar;12033. doi: 10.1117/12.2612318. Epub 2022 Apr 4.

Comprehensive learning and adaptive teaching: Distilling multi-modal knowledge for pathological glioma grading.综合学习与适应性教学：提炼用于脑胶质瘤病理分级的多模态知识

Med Image Anal. 2024 Jan;91:102990. doi: 10.1016/j.media.2023.102990. Epub 2023 Oct 9.

A multi-modal fusion framework based on multi-task correlation learning for cancer prognosis prediction.一种基于多任务关联学习的多模态融合框架用于癌症预后预测。

Artif Intell Med. 2022 Apr;126:102260. doi: 10.1016/j.artmed.2022.102260. Epub 2022 Feb 24.

SG-Fusion: A swin-transformer and graph convolution-based multi-modal deep neural network for glioma prognosis.SG-Fusion：一种基于 Swin-Transformer 和图卷积的多模态深度神经网络，用于脑胶质瘤预后。

Artif Intell Med. 2024 Nov;157:102972. doi: 10.1016/j.artmed.2024.102972. Epub 2024 Aug 31.

Predicting rectal cancer prognosis from histopathological images and clinical information using multi-modal deep learning.利用多模态深度学习从组织病理学图像和临床信息预测直肠癌预后。

Front Oncol. 2024 Apr 15;14:1353446. doi: 10.3389/fonc.2024.1353446. eCollection 2024.

Deep Multi-path Network Integrating Incomplete Biomarker and Chest CT Data for Evaluating Lung Cancer Risk.深度多路径网络集成不完整生物标志物和胸部CT数据以评估肺癌风险

Proc SPIE Int Soc Opt Eng. 2021 Feb;11596. doi: 10.1117/12.2580730. Epub 2021 Feb 15.

Relation-Aware Shared Representation Learning for Cancer Prognosis Analysis With Auxiliary Clinical Variables and Incomplete Multi-Modality Data.基于辅助临床变量和不完整多模态数据的癌症预后分析的关系感知共享表示学习。

IEEE Trans Med Imaging. 2022 Jan;41(1):186-198. doi: 10.1109/TMI.2021.3108802. Epub 2021 Dec 30.

Gradient modulated contrastive distillation of low-rank multi-modal knowledge for disease diagnosis.梯度调制的低秩多模态知识对比蒸馏用于疾病诊断。

Med Image Anal. 2023 Aug;88:102874. doi: 10.1016/j.media.2023.102874. Epub 2023 Jun 21.

Multi-Constraint Latent Representation Learning for Prognosis Analysis Using Multi-Modal Data.基于多模态数据的多约束潜在表征学习用于预后分析

IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3737-3750. doi: 10.1109/TNNLS.2021.3112194. Epub 2023 Jul 6.

Multi-modal fusion network with intra- and inter-modality attention for prognosis prediction in breast cancer.多模态融合网络，具有内在和外在模态注意力，用于乳腺癌预后预测。

Comput Biol Med. 2024 Jan;168:107796. doi: 10.1016/j.compbiomed.2023.107796. Epub 2023 Dec 3.

引用本文的文献

Multimodal integration strategies for clinical application in oncology.肿瘤学临床应用中的多模态整合策略

Front Pharmacol. 2025 Aug 20;16:1609079. doi: 10.3389/fphar.2025.1609079. eCollection 2025.

本文引用的文献

GPDBN: deep bilinear network integrating both genomic data and pathological images for breast cancer prognosis prediction.GPDBN：用于乳腺癌预后预测的整合基因组数据和病理图像的深度双线性网络。

Bioinformatics. 2021 Sep 29;37(18):2963-2970. doi: 10.1093/bioinformatics/btab185.

Joint analysis of expression levels and histological images identifies genes associated with tissue morphology.联合表达水平分析和组织学图像分析鉴定与组织形态相关的基因。

Nat Commun. 2021 Mar 11;12(1):1609. doi: 10.1038/s41467-021-21727-x.

A Novel Pathological Images and Genomic Data Fusion Framework for Breast Cancer Survival Prediction.

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:1384-1387. doi: 10.1109/EMBC44109.2020.9176360.

Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis.病理融合：融合组织病理学和基因组特征用于癌症诊断和预后的综合框架。

IEEE Trans Med Imaging. 2022 Apr;41(4):757-770. doi: 10.1109/TMI.2020.3021387. Epub 2022 Apr 1.

Multi-task multi-modal learning for joint diagnosis and prognosis of human cancers.多任务多模态学习在人类癌症的联合诊断和预后中的应用。

Med Image Anal. 2020 Oct;65:101795. doi: 10.1016/j.media.2020.101795. Epub 2020 Jul 23.

Deep learning with multimodal representation for pancancer prognosis prediction.基于多模态表示的深度学习在泛癌预后预测中的应用。

Bioinformatics. 2019 Jul 15;35(14):i446-i454. doi: 10.1093/bioinformatics/btz342.

Predicting cancer outcomes from histology and genomics using convolutional networks.使用卷积网络从组织学和基因组学预测癌症结局。

Proc Natl Acad Sci U S A. 2018 Mar 27;115(13):E2970-E2979. doi: 10.1073/pnas.1717139115. Epub 2018 Mar 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验