Suppr超能文献

利用组织病理学和基因组数据对缺失数据进行多模态学习以用于癌症诊断

Multi-modal Learning with Missing Data for Cancer Diagnosis Using Histopathological and Genomic Data.

作者信息

Cui Can, Asad Zuhayr, Dean William F, Smith Isabelle T, Madden Christopher, Bao Shunxing, Landman Bennett A, Roland Joseph T, Coburn Lori A, Wilson Keith T, Zwerner Jeffrey P, Zhao Shilin, Wheless Lee E, Huo Yuankai

机构信息

Department of Computer Science, Vanderbilt University, Nashville, TN 37235, USA.

College of Arts and Science, Vanderbilt University, Nashville, TN 37235, USA.

出版信息

Proc SPIE Int Soc Opt Eng. 2022 Feb-Mar;12033. doi: 10.1117/12.2612318. Epub 2022 Apr 4.

Abstract

Multi-modal learning (e.g., integrating pathological images with genomic features) tends to improve the accuracy of cancer diagnosis and prognosis as compared to learning with a single modality. However, missing data is a common problem in clinical practice, i.e., not every patient has all modalities available. Most of the previous works directly discarded samples with missing modalities, which might lose information in these data and increase the likelihood of overfitting. In this work, we generalize the multi-modal learning in cancer diagnosis with the capacity of dealing with missing data using histological images and genomic data. Our integrated model can utilize all available data from patients with both complete and partial modalities. The experiments on the public TCGA-GBM and TCGA-LGG datasets show that the data with missing modalities can contribute to multi-modal learning, which improves the model performance in grade classification of glioma cancer.

摘要

与单模态学习相比,多模态学习(例如将病理图像与基因组特征相结合)往往能提高癌症诊断和预后的准确性。然而,数据缺失是临床实践中的常见问题,即并非每个患者都具备所有可用模态的数据。以前的大多数工作直接丢弃具有缺失模态的样本,这可能会丢失这些数据中的信息,并增加过拟合的可能性。在这项工作中,我们通过使用组织学图像和基因组数据来处理数据缺失的能力,推广了癌症诊断中的多模态学习。我们的集成模型可以利用来自具有完整和部分模态数据患者的所有可用数据。在公共TCGA-GBM和TCGA-LGG数据集上的实验表明,具有缺失模态的数据有助于多模态学习,从而提高了神经胶质瘤癌症分级分类中的模型性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d3e4/9605813/9dcf293eb64a/nihms-1844002-f0001.jpg

相似文献

9
Multi-Constraint Latent Representation Learning for Prognosis Analysis Using Multi-Modal Data.基于多模态数据的多约束潜在表征学习用于预后分析
IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3737-3750. doi: 10.1109/TNNLS.2021.3112194. Epub 2023 Jul 6.

引用本文的文献

1
Multimodal integration strategies for clinical application in oncology.肿瘤学临床应用中的多模态整合策略
Front Pharmacol. 2025 Aug 20;16:1609079. doi: 10.3389/fphar.2025.1609079. eCollection 2025.

本文引用的文献

3
A Novel Pathological Images and Genomic Data Fusion Framework for Breast Cancer Survival Prediction.
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:1384-1387. doi: 10.1109/EMBC44109.2020.9176360.
7
Predicting cancer outcomes from histology and genomics using convolutional networks.使用卷积网络从组织学和基因组学预测癌症结局。
Proc Natl Acad Sci U S A. 2018 Mar 27;115(13):E2970-E2979. doi: 10.1073/pnas.1717139115. Epub 2018 Mar 12.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验