• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Dual-modality visual feature flow for medical report generation.

作者信息

Tang Quan, Xu Liming, Wang Yongheng, Zheng Bochuan, Lv Jiancheng, Zeng Xianhua, Li Weisheng

机构信息

School of Computer Science, China West Normal University, Nanchong, 637009, Sichuan, China.

School of Computer Science, China West Normal University, Nanchong, 637009, Sichuan, China; College of Computer Science, Sichuan University, Chengdu, 610041, Sichuan, China.

出版信息

Med Image Anal. 2025 Apr;101:103413. doi: 10.1016/j.media.2024.103413. Epub 2024 Dec 1.

DOI:10.1016/j.media.2024.103413
PMID:39693718
Abstract

Medical report generation, a cross-modal task of generating medical text information, aiming to provide professional descriptions of medical images in clinical language. Despite some methods have made progress, there are still some limitations, including insufficient focus on lesion areas, omission of internal edge features, and difficulty in aligning cross-modal data. To address these issues, we propose Dual-Modality Visual Feature Flow (DMVF) for medical report generation. Firstly, we introduce region-level features based on grid-level features to enhance the method's ability to identify lesions and key areas. Then, we enhance two types of feature flows based on their attributes to prevent the loss of key information, respectively. Finally, we align visual mappings from different visual feature with report textual embeddings through a feature fusion module to perform cross-modal learning. Extensive experiments conducted on four benchmark datasets demonstrate that our approach outperforms the state-of-the-art methods in both natural language generation and clinical efficacy metrics.

摘要

相似文献

1
Dual-modality visual feature flow for medical report generation.
Med Image Anal. 2025 Apr;101:103413. doi: 10.1016/j.media.2024.103413. Epub 2024 Dec 1.
2
CGFTrans: Cross-Modal Global Feature Fusion Transformer for Medical Report Generation.CGFTrans:用于医疗报告生成的跨模态全局特征融合转换器。
IEEE J Biomed Health Inform. 2024 Sep;28(9):5600-5612. doi: 10.1109/JBHI.2024.3414413. Epub 2024 Sep 5.
3
Interactive dual-stream contrastive learning for radiology report generation.交互式双流对比学习在放射科报告生成中的应用。
J Biomed Inform. 2024 Sep;157:104718. doi: 10.1016/j.jbi.2024.104718. Epub 2024 Aug 28.
4
Pathology report generation from whole slide images with knowledge retrieval and multi-level regional feature selection.基于知识检索和多级区域特征选择从全切片图像生成病理报告
Comput Methods Programs Biomed. 2025 May;263:108677. doi: 10.1016/j.cmpb.2025.108677. Epub 2025 Feb 27.
5
A 3D hierarchical cross-modality interaction network using transformers and convolutions for brain glioma segmentation in MR images.一种使用变换和卷积的 3D 层次跨模态交互网络,用于磁共振图像中的脑胶质瘤分割。
Med Phys. 2024 Nov;51(11):8371-8389. doi: 10.1002/mp.17354. Epub 2024 Aug 13.
6
Multi-Grained Radiology Report Generation With Sentence-Level Image-Language Contrastive Learning.基于句子级图像-语言对比学习的多粒度放射学报告生成
IEEE Trans Med Imaging. 2024 Jul;43(7):2657-2669. doi: 10.1109/TMI.2024.3372638. Epub 2024 Jul 1.
7
Multifocal region-assisted cross-modality learning for chest X-ray report generation.多焦点区域辅助跨模态学习在胸部 X 射线报告生成中的应用。
Comput Biol Med. 2024 Dec;183:109187. doi: 10.1016/j.compbiomed.2024.109187. Epub 2024 Oct 21.
8
Visual prior-based cross-modal alignment network for radiology report generation.基于视觉先验的跨模态对齐网络在放射科报告生成中的应用。
Comput Biol Med. 2023 Nov;166:107522. doi: 10.1016/j.compbiomed.2023.107522. Epub 2023 Sep 22.
9
Cross-Modal self-supervised vision language pre-training with multiple objectives for medical visual question answering.用于医学视觉问答的多目标跨模态自监督视觉语言预训练
J Biomed Inform. 2024 Dec;160:104748. doi: 10.1016/j.jbi.2024.104748. Epub 2024 Nov 12.
10
Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval.用于文本到图像人物检索的文本引导图像恢复与语义增强
Neural Netw. 2025 Apr;184:107028. doi: 10.1016/j.neunet.2024.107028. Epub 2024 Dec 16.