• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

疾病诊断与预后中图像和非图像数据的深度多模态融合:综述

Deep multimodal fusion of image and non-image data in disease diagnosis and prognosis: a review.

作者信息

Cui Can, Yang Haichun, Wang Yaohong, Zhao Shilin, Asad Zuhayr, Coburn Lori A, Wilson Keith T, Landman Bennett A, Huo Yuankai

机构信息

Department of Computer Science, Vanderbilt University, Nashville, TN 37235, United States of America.

Department of Pathology, Microbiology and Immunology, Vanderbilt University Medical Center, Nashville, TN 37215, United States of America.

出版信息

Prog Biomed Eng (Bristol). 2023 Apr 11;5(2). doi: 10.1088/2516-1091/acc2fe.

DOI:10.1088/2516-1091/acc2fe
PMID:37360402
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10288577/
Abstract

The rapid development of diagnostic technologies in healthcare is leading to higher requirements for physicians to handle and integrate the heterogeneous, yet complementary data that are produced during routine practice. For instance, the personalized diagnosis and treatment planning for a single cancer patient relies on various images (e.g. radiology, pathology and camera images) and non-image data (e.g. clinical data and genomic data). However, such decision-making procedures can be subjective, qualitative, and have large inter-subject variabilities. With the recent advances in multimodal deep learning technologies, an increasingly large number of efforts have been devoted to a key question: how do we extract and aggregate multimodal information to ultimately provide more objective, quantitative computer-aided clinical decision making? This paper reviews the recent studies on dealing with such a question. Briefly, this review will include the (a) overview of current multimodal learning workflows, (b) summarization of multimodal fusion methods, (c) discussion of the performance, (d) applications in disease diagnosis and prognosis, and (e) challenges and future directions.

摘要

医疗保健领域诊断技术的快速发展,对医生处理和整合日常实践中产生的异构但互补的数据提出了更高要求。例如,针对单个癌症患者的个性化诊断和治疗计划依赖于各种图像(如放射学、病理学和相机图像)和非图像数据(如临床数据和基因组数据)。然而,此类决策过程可能具有主观性、定性性,并且个体间差异很大。随着多模态深度学习技术的最新进展,越来越多的努力致力于一个关键问题:我们如何提取和聚合多模态信息,以最终提供更客观、定量的计算机辅助临床决策?本文综述了近期关于处理这一问题的研究。简而言之,本综述将包括(a)当前多模态学习工作流程概述,(b)多模态融合方法总结,(c)性能讨论,(d)在疾病诊断和预后中的应用,以及(e)挑战和未来方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/604b/10288577/29e98931a095/nihms-1903587-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/604b/10288577/44f2e1efb4d4/nihms-1903587-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/604b/10288577/6d2a70e420ff/nihms-1903587-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/604b/10288577/29e98931a095/nihms-1903587-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/604b/10288577/44f2e1efb4d4/nihms-1903587-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/604b/10288577/6d2a70e420ff/nihms-1903587-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/604b/10288577/29e98931a095/nihms-1903587-f0003.jpg

相似文献

1
Deep multimodal fusion of image and non-image data in disease diagnosis and prognosis: a review.疾病诊断与预后中图像和非图像数据的深度多模态融合:综述
Prog Biomed Eng (Bristol). 2023 Apr 11;5(2). doi: 10.1088/2516-1091/acc2fe.
2
Artificial intelligence-based methods for fusion of electronic health records and imaging data.基于人工智能的电子健康记录与医学影像数据融合方法。
Sci Rep. 2022 Oct 26;12(1):17981. doi: 10.1038/s41598-022-22514-4.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Deep learning-based multimodal image analysis for cervical cancer detection.基于深度学习的宫颈癌多模态图像分析。
Methods. 2022 Sep;205:46-52. doi: 10.1016/j.ymeth.2022.05.004. Epub 2022 May 20.
5
A Review of Multimodal Medical Image Fusion Techniques.多模态医学图像融合技术综述。
Comput Math Methods Med. 2020 Apr 23;2020:8279342. doi: 10.1155/2020/8279342. eCollection 2020.
6
Artificial Intelligence in Lung Cancer Pathology Image Analysis.人工智能在肺癌病理图像分析中的应用
Cancers (Basel). 2019 Oct 28;11(11):1673. doi: 10.3390/cancers11111673.
7
Computer-Aided Diagnosis of Spinal Tuberculosis From CT Images Based on Deep Learning With Multimodal Feature Fusion.基于多模态特征融合深度学习的CT图像脊柱结核计算机辅助诊断
Front Microbiol. 2022 Feb 23;13:823324. doi: 10.3389/fmicb.2022.823324. eCollection 2022.
8
Deep Learning Approaches for Medical Image Analysis and Diagnosis.用于医学图像分析与诊断的深度学习方法
Cureus. 2024 May 2;16(5):e59507. doi: 10.7759/cureus.59507. eCollection 2024 May.
9
Computer-aided prognosis: predicting patient and disease outcome via quantitative fusion of multi-scale, multi-modal data.计算机辅助预后:通过多尺度、多模态数据的定量融合预测患者和疾病结局。
Comput Med Imaging Graph. 2011 Oct-Dec;35(7-8):506-14. doi: 10.1016/j.compmedimag.2011.01.008. Epub 2011 Feb 17.
10
A scoping review on multimodal deep learning in biomedical images and texts.关于生物医学图像与文本中多模态深度学习的范围综述。
ArXiv. 2023 Oct 18:arXiv:2307.07362v3.

引用本文的文献

1
AI-driven preclinical disease risk assessment using imaging in UK biobank.在英国生物银行中使用成像技术进行人工智能驱动的临床前疾病风险评估。
NPJ Digit Med. 2025 Jul 26;8(1):480. doi: 10.1038/s41746-025-01771-3.
2
A Radiogenomic Deep Ensemble Learning Model for Identifying Radionecrosis Following Brain Metastases (BM) Stereotactic Radiosurgery in Patients With Non-small Cell Lung Cancer BM.一种用于识别非小细胞肺癌脑转移(BM)患者脑转移立体定向放射治疗后放射性坏死的放射基因组深度集成学习模型
Adv Radiat Oncol. 2025 Jul 2;10(8):101826. doi: 10.1016/j.adro.2025.101826. eCollection 2025 Aug.
3
AI-Driven Transcriptome Prediction in Human Pathology: From Molecular Insights to Clinical Applications.

本文引用的文献

1
Multimodal Learning With Transformers: A Survey.基于Transformer的多模态学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12113-12132. doi: 10.1109/TPAMI.2023.3275156. Epub 2023 Sep 5.
2
Human treelike tubular structure segmentation: A comprehensive review and future perspectives.人体树状管状结构分割:全面综述与未来展望。
Comput Biol Med. 2022 Dec;151(Pt A):106241. doi: 10.1016/j.compbiomed.2022.106241. Epub 2022 Oct 27.
3
Cohesive Multi-Modality Feature Learning and Fusion for COVID-19 Patient Severity Prediction.
人工智能驱动的人类病理学转录组预测:从分子洞察到临床应用
Biology (Basel). 2025 Jun 4;14(6):651. doi: 10.3390/biology14060651.
4
A review of recent artificial intelligence for traditional medicine.近期人工智能在传统医学中的应用综述。
J Tradit Complement Med. 2025 Feb 21;15(3):215-228. doi: 10.1016/j.jtcme.2025.02.009. eCollection 2025 May.
5
Advancements in Medical Radiology Through Multimodal Machine Learning: A Comprehensive Overview.通过多模态机器学习实现医学放射学的进展:全面概述
Bioengineering (Basel). 2025 Apr 30;12(5):477. doi: 10.3390/bioengineering12050477.
6
Beyond Biomarkers: Machine Learning-Driven Multiomics for Personalized Medicine in Gastric Cancer.超越生物标志物:机器学习驱动的多组学在胃癌个性化医疗中的应用
J Pers Med. 2025 Apr 24;15(5):166. doi: 10.3390/jpm15050166.
7
Spatial Multiomics Toward Understanding Neurological Systems.用于理解神经系统的空间多组学
J Mass Spectrom. 2025 Jun;60(6):e5143. doi: 10.1002/jms.5143.
8
A Multimodal Deep Learning Model for the Classification of Breast Cancer Subtypes.一种用于乳腺癌亚型分类的多模态深度学习模型。
Diagnostics (Basel). 2025 Apr 14;15(8):995. doi: 10.3390/diagnostics15080995.
9
MRI-based digital twins to improve treatment response of breast cancer by optimizing neoadjuvant chemotherapy regimens.基于磁共振成像(MRI)的数字孪生模型,通过优化新辅助化疗方案改善乳腺癌的治疗反应。
NPJ Digit Med. 2025 Apr 7;8(1):195. doi: 10.1038/s41746-025-01579-1.
10
Interpretable Multimodal Fusion Model for Bridged Histology and Genomics Survival Prediction in Pan-Cancer.用于泛癌中桥接组织学和基因组学生存预测的可解释多模态融合模型
Adv Sci (Weinh). 2025 May;12(17):e2407060. doi: 10.1002/advs.202407060. Epub 2025 Mar 7.
用于COVID-19患者严重程度预测的凝聚多模态特征学习与融合
IEEE Trans Circuits Syst Video Technol. 2021 Mar 4;32(5):2535-2549. doi: 10.1109/TCSVT.2021.3063952. eCollection 2022 May.
4
Data harmonisation for information fusion in digital healthcare: A state-of-the-art systematic review, meta-analysis and future research directions.数字医疗中用于信息融合的数据协调:最新的系统评价、荟萃分析及未来研究方向
Inf Fusion. 2022 Jun;82:99-122. doi: 10.1016/j.inffus.2022.01.001.
5
A multimodal transformer to fuse images and metadata for skin disease classification.一种用于融合图像和元数据以进行皮肤病分类的多模态变压器。
Vis Comput. 2022 May 5:1-13. doi: 10.1007/s00371-022-02492-4.
6
The DFUC 2020 Dataset: Analysis Towards Diabetic Foot Ulcer Detection.DFUC 2020数据集:糖尿病足溃疡检测分析
touchREV Endocrinol. 2021 Apr;17(1):5-11. doi: 10.17925/EE.2021.17.1.5. Epub 2021 Apr 28.
7
Multimodal deep learning for biomedical data fusion: a review.多模态深度学习在生物医学数据融合中的应用综述。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab569.
8
Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning.基于自监督对比学习的双流多实例学习网络用于全切片图像分类
Conf Comput Vis Pattern Recognit Workshops. 2021 Jun;2021:14318-14328. doi: 10.1109/CVPR46437.2021.01409. Epub 2021 Nov 13.
9
A Fully Automated Multimodal MRI-Based Multi-Task Learning for Glioma Segmentation and IDH Genotyping.基于多模态 MRI 的全自动多任务学习在脑胶质瘤分割和 IDH 基因分型中的应用
IEEE Trans Med Imaging. 2022 Jun;41(6):1520-1532. doi: 10.1109/TMI.2022.3142321. Epub 2022 Jun 1.
10
Unbox the black-box for the medical explainable AI via multi-modal and multi-centre data fusion: A mini-review, two showcases and beyond.通过多模态和多中心数据融合开启医学可解释人工智能的黑匣子:一篇综述、两个案例展示及其他
Inf Fusion. 2022 Jan;77:29-52. doi: 10.1016/j.inffus.2021.07.016.