• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于Transformer的语义查询学习模拟医生生成胸部X光报告的思维逻辑。

Simulating doctors' thinking logic for chest X-ray report generation via Transformer-based Semantic Query learning.

作者信息

Gao Danyang, Kong Ming, Zhao Yongrui, Huang Jing, Huang Zhengxing, Kuang Kun, Wu Fei, Zhu Qiang

机构信息

Computer School, Beijing Information Science and Technology University, Beijing 100005, China.

College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China.

出版信息

Med Image Anal. 2024 Jan;91:102982. doi: 10.1016/j.media.2023.102982. Epub 2023 Sep 29.

DOI:10.1016/j.media.2023.102982
PMID:37837692
Abstract

Medical report generation can be treated as a process of doctors' observing, understanding, and describing images from different perspectives. Following this process, this paper innovatively proposes a Transformer-based Semantic Query learning paradigm (TranSQ). Briefly, this paradigm is to learn an intention embedding set and make a semantic query to the visual features, generate intent-compliant sentence candidates, and form a coherent report. We apply a bipartite matching mechanism during training to realize the dynamic correspondence between the intention embeddings and the sentences to induct medical concepts into the observation intentions. Experimental results on two major radiology reporting datasets (i.e., IU X-ray and MIMIC-CXR) demonstrate that our model outperforms state-of-the-art models regarding generation effectiveness and clinical efficacy. In addition, comprehensive ablation experiments fully validate the TranSQ model's innovation and interpretation. The code is available at https://github.com/zjukongming/TranSQ.

摘要

医学报告生成可以被视为医生从不同角度观察、理解和描述图像的过程。遵循这一过程,本文创新性地提出了一种基于Transformer的语义查询学习范式(TranSQ)。简而言之,该范式旨在学习一个意图嵌入集,并对视觉特征进行语义查询,生成符合意图的句子候选,并形成连贯的报告。我们在训练过程中应用二分匹配机制,以实现意图嵌入与句子之间的动态对应,从而将医学概念引入观察意图。在两个主要的放射学报告数据集(即IU X射线和MIMIC-CXR)上的实验结果表明,我们的模型在生成效果和临床疗效方面优于现有模型。此外,全面的消融实验充分验证了TranSQ模型的创新性和可解释性。代码可在https://github.com/zjukongming/TranSQ获取。

相似文献

1
Simulating doctors' thinking logic for chest X-ray report generation via Transformer-based Semantic Query learning.通过基于Transformer的语义查询学习模拟医生生成胸部X光报告的思维逻辑。
Med Image Anal. 2024 Jan;91:102982. doi: 10.1016/j.media.2023.102982. Epub 2023 Sep 29.
2
A label information fused medical image report generation framework.一种融合标签信息的医学图像报告生成框架。
Artif Intell Med. 2024 Apr;150:102823. doi: 10.1016/j.artmed.2024.102823. Epub 2024 Feb 22.
3
Translating medical image to radiological report: Adaptive multilevel multi-attention approach.将医学图像翻译为放射报告:自适应多级多关注方法。
Comput Methods Programs Biomed. 2022 Jun;221:106853. doi: 10.1016/j.cmpb.2022.106853. Epub 2022 May 4.
4
Radiology report generation with a learned knowledge base and multi-modal alignment.基于学习知识库和多模态对齐的放射学报告生成
Med Image Anal. 2023 May;86:102798. doi: 10.1016/j.media.2023.102798. Epub 2023 Mar 23.
5
ITransformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning.Transformer:用于电视字幕的内关系和外关系嵌入 Transformer
IEEE Trans Image Process. 2022;31:3565-3577. doi: 10.1109/TIP.2022.3159472. Epub 2022 May 26.
6
Memory Guided Transformer With Spatio-Semantic Visual Extractor for Medical Report Generation.用于医学报告生成的具有时空语义视觉提取器的记忆引导变换器
IEEE J Biomed Health Inform. 2024 May;28(5):3079-3089. doi: 10.1109/JBHI.2024.3371894. Epub 2024 May 6.
7
Label correlation transformer for automated chest X-ray diagnosis with reliable interpretability.基于可靠可解释性的自动胸部 X 射线诊断标签相关变换。
Radiol Med. 2023 Jun;128(6):726-733. doi: 10.1007/s11547-023-01647-0. Epub 2023 May 26.
8
Improving chest X-ray report generation by leveraging warm starting.利用热启动提高胸部 X 光报告生成
Artif Intell Med. 2023 Oct;144:102633. doi: 10.1016/j.artmed.2023.102633. Epub 2023 Aug 19.
9
Semantic-Powered Explainable Model-Free Few-Shot Learning Scheme of Diagnosing COVID-19 on Chest X-Ray.基于语义的可解释无模型少样本胸部X光诊断COVID-19学习方案
IEEE J Biomed Health Inform. 2022 Dec;26(12):5870-5882. doi: 10.1109/JBHI.2022.3205167. Epub 2022 Dec 7.
10
A disentangled generative model for disease decomposition in chest X-rays via normal image synthesis.通过正常图像合成对胸部 X 光片中的疾病进行分解的解缠生成模型。
Med Image Anal. 2021 Jan;67:101839. doi: 10.1016/j.media.2020.101839. Epub 2020 Oct 7.

引用本文的文献

1
Advancements in Radiology Report Generation: A Comprehensive Analysis.放射学报告生成的进展:全面分析
Bioengineering (Basel). 2025 Jun 25;12(7):693. doi: 10.3390/bioengineering12070693.