用于自动生成放射学报告的标签知识引导变压器

Label knowledge guided transformer for automatic radiology report generation.

作者信息

Wang Rui, Liang Jianguo

机构信息

College of Computer, Qufu Normal University, Rizhao, 276800, Shandong, China.

出版信息

Comput Methods Programs Biomed. 2025 Sep;269:108877. doi: 10.1016/j.cmpb.2025.108877. Epub 2025 May 23.

DOI:10.1016/j.cmpb.2025.108877

PMID:40449180

Abstract

BACKGROUND AND OBJECTIVE

The task of automatically generating radiology reports is a key research area at the intersection of computer science and medicine, aiming to enable computers to generate corresponding reports on the basis of radiology images. This field currently faces a significant data bias issue, which causes words describing diseases to be overshadowed by words describing normal regions in the reports.

METHODS

To address this, we propose the label knowledge guided transformer model for generating radiology reports. Specifically, our model incorporates a Multi Feature Extraction module and a Dual-branch Collaborative Attention module. The Multi Feature Extraction module leverages medical knowledge graphs and feature clustering algorithms to optimize the label feature extraction process from both the prediction and encoding of label information, making it the first module specifically designed to reduce redundant label features. The Dual-branch Collaborative Attention module uses two parallel attention mechanisms to simultaneously compute visual features and label features, and prevents the direct integration of label features into visual features, thereby effectively balancing the model's attention between label features and visual features.

RESULTS

We conduct experimental tests using the IU X-Ray and MIMIC-CXR datasets under six natural language generation evaluation metrics and analyze the results. Experimental results demonstrate that our model achieves state-of-the-art (SOTA) performance. Compared with the baseline models, the label knowledge guided transformer achieves an average improvement of 23.3% on the IU X-Ray dataset and 20.7% on the MIMIC-CXR dataset.

CONCLUSION

Our model has strong capabilities in capturing abnormal features, effectively mitigating the adverse effects caused by data bias, and demonstrates significant potential to enhance the quality and accuracy of automatically generated radiology reports.

摘要

背景与目的

自动生成放射学报告的任务是计算机科学与医学交叉领域的一个关键研究方向，旨在使计算机能够基于放射学图像生成相应报告。该领域目前面临严重的数据偏差问题，这导致报告中描述疾病的词汇被描述正常区域的词汇所掩盖。

方法

为解决此问题，我们提出了用于生成放射学报告的标签知识引导变压器模型。具体而言，我们的模型包含一个多特征提取模块和一个双分支协同注意力模块。多特征提取模块利用医学知识图谱和特征聚类算法，从标签信息的预测和编码两方面优化标签特征提取过程，使其成为首个专门设计用于减少冗余标签特征的模块。双分支协同注意力模块使用两个并行的注意力机制同时计算视觉特征和标签特征，并防止将标签特征直接整合到视觉特征中，从而有效平衡模型在标签特征和视觉特征之间的注意力。

结果

我们使用IU X-Ray和MIMIC-CXR数据集在六种自然语言生成评估指标下进行了实验测试并分析结果。实验结果表明，我们的模型达到了当前最优（SOTA）性能。与基线模型相比，标签知识引导变压器在IU X-Ray数据集上平均提高了23.3%，在MIMIC-CXR数据集上提高了20.7%。

结论

我们的模型在捕捉异常特征方面具有强大能力，有效减轻了数据偏差带来的不利影响，并展现出显著潜力来提高自动生成放射学报告的质量和准确性。

相似文献

Label knowledge guided transformer for automatic radiology report generation.用于自动生成放射学报告的标签知识引导变压器

Comput Methods Programs Biomed. 2025 Sep;269:108877. doi: 10.1016/j.cmpb.2025.108877. Epub 2025 May 23.

Radiology report generation using automatic keyword adaptation, frequency-based multi-label classification and text-to-text large language models.使用自动关键词适配、基于频率的多标签分类和文本到文本的大语言模型生成放射学报告。

Comput Biol Med. 2025 Jul 3;196(Pt A):110625. doi: 10.1016/j.compbiomed.2025.110625.

[CRAKUT:integrating contrastive regional attention and clinical prior knowledge in U-transformer for radiology report generation].[CRAKUT：在用于放射学报告生成的U型变压器中整合对比区域注意力和临床先验知识]

Nan Fang Yi Ke Da Xue Xue Bao. 2025 Jun 20;45(6):1343-1352. doi: 10.12122/j.issn.1673-4254.2025.06.24.

Short-Term Memory Impairment短期记忆障碍

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet：一种新颖的跨尺度级联分层Transformer 网络，用于增强视网膜血管分割。

Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

DASNet a dual branch multi level attention sheep counting network.DASNet是一种双分支多级注意力羊只计数网络。

Sci Rep. 2025 Jul 2;15(1):23228. doi: 10.1038/s41598-025-97929-w.

Are Artificial Intelligence Models Listening Like Cardiologists? Bridging the Gap Between Artificial Intelligence and Clinical Reasoning in Heart-Sound Classification Using Explainable Artificial Intelligence.人工智能模型能像心脏病专家一样“聆听”吗？利用可解释人工智能弥合人工智能与心音分类临床推理之间的差距。

Bioengineering (Basel). 2025 May 22;12(6):558. doi: 10.3390/bioengineering12060558.

Knowledge Graph-Based Few-Shot Learning for Label of Medical Imaging Reports.基于知识图谱的医学影像报告标签少样本学习

Acad Radiol. 2025 Jul;32(7):4206-4220. doi: 10.1016/j.acra.2025.02.045. Epub 2025 Mar 25.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于自动生成放射学报告的标签知识引导变压器

Label knowledge guided transformer for automatic radiology report generation.

作者信息

机构信息

出版信息

BACKGROUND AND OBJECTIVE

METHODS

RESULTS

CONCLUSION

背景与目的

方法

结果

结论

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献