• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于生物医学图像与文本中多模态深度学习的范围综述。

A scoping review on multimodal deep learning in biomedical images and texts.

作者信息

Sun Zhaoyi, Lin Mingquan, Zhu Qingqing, Xie Qianqian, Wang Fei, Lu Zhiyong, Peng Yifan

出版信息

ArXiv. 2023 Oct 18:arXiv:2307.07362v3.

PMID:37576120
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10418520/
Abstract

Computer-assisted diagnostic and prognostic systems of the future should be capable of simultaneously processing multimodal data. Multimodal deep learning (MDL), which involves the integration of multiple sources of data, such as images and text, has the potential to revolutionize the analysis and interpretation of biomedical data. However, it only caught researchers' attention recently. To this end, there is a critical need to conduct a systematic review on this topic, identify the limitations of current work, and explore future directions. In this scoping review, we aim to provide a comprehensive overview of the current state of the field and identify key concepts, types of studies, and research gaps with a focus on biomedical images and texts joint learning, mainly because these two were the most commonly available data types in MDL research. This study reviewed the current uses of multimodal deep learning on five tasks: (1) Report generation, (2) Visual question answering, (3) Cross-modal retrieval, (4) Computer-aided diagnosis, and (5) Semantic segmentation. Our results highlight the diverse applications and potential of MDL and suggest directions for future research in the field. We hope our review will facilitate the collaboration of natural language processing (NLP) and medical imaging communities and support the next generation of decision-making and computer-assisted diagnostic system development.

摘要

未来的计算机辅助诊断和预后系统应能够同时处理多模态数据。多模态深度学习(MDL)涉及图像和文本等多种数据来源的整合,有潜力彻底改变生物医学数据的分析和解释方式。然而,它直到最近才引起研究人员的关注。为此,迫切需要对该主题进行系统综述,识别当前工作的局限性,并探索未来方向。在本范围综述中,我们旨在全面概述该领域的现状,识别关键概念、研究类型和研究空白,重点关注生物医学图像和文本联合学习,主要是因为这两种是MDL研究中最常见的数据类型。本研究回顾了多模态深度学习在五个任务上的当前应用:(1)报告生成,(2)视觉问答,(3)跨模态检索,(4)计算机辅助诊断,以及(5)语义分割。我们的结果突出了MDL的多样应用和潜力,并为该领域的未来研究指明了方向。我们希望我们的综述将促进自然语言处理(NLP)和医学成像社区的合作,并支持下一代决策和计算机辅助诊断系统的开发。

相似文献

1
A scoping review on multimodal deep learning in biomedical images and texts.关于生物医学图像与文本中多模态深度学习的范围综述。
ArXiv. 2023 Oct 18:arXiv:2307.07362v3.
2
A scoping review on multimodal deep learning in biomedical images and texts.多模态深度学习在生物医学图像和文本中的应用综述
J Biomed Inform. 2023 Oct;146:104482. doi: 10.1016/j.jbi.2023.104482. Epub 2023 Aug 29.
3
Multimodal representations of biomedical knowledge from limited training whole slide images and reports using deep learning.利用深度学习从有限的训练全切片图像和报告中获取生物医学知识的多模态表示。
Med Image Anal. 2024 Oct;97:103303. doi: 10.1016/j.media.2024.103303. Epub 2024 Aug 14.
4
MMAgentRec, a personalized multi-modal recommendation agent with large language model.MMAgentRec,一个带有大语言模型的个性化多模态推荐代理。
Sci Rep. 2025 Apr 8;15(1):12062. doi: 10.1038/s41598-025-96458-w.
5
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
6
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
7
Histopathology in focus: a review on explainable multi-modal approaches for breast cancer diagnosis.聚焦组织病理学:乳腺癌诊断的可解释多模态方法综述
Front Med (Lausanne). 2024 Sep 30;11:1450103. doi: 10.3389/fmed.2024.1450103. eCollection 2024.
8
Developing ChatGPT for biology and medicine: a complete review of biomedical question answering.为生物学和医学开发ChatGPT:生物医学问答的全面综述
Biophys Rep. 2024 Jun 30;10(3):152-171. doi: 10.52601/bpr.2024.240004.
9
Ethics of Procuring and Using Organs or Tissue from Infants and Newborns for Transplantation, Research, or Commercial Purposes: Protocol for a Bioethics Scoping Review.从婴儿和新生儿获取器官或组织用于移植、研究或商业目的的伦理问题:生物伦理学范围审查方案
Wellcome Open Res. 2024 Dec 5;9:717. doi: 10.12688/wellcomeopenres.23235.1. eCollection 2024.
10
Applications of Natural Language Processing for the Management of Stroke Disorders: Scoping Review.自然语言处理在中风疾病管理中的应用:范围综述
JMIR Med Inform. 2023 Sep 6;11:e48693. doi: 10.2196/48693.