• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医学-VLBERT:用于通过交替学习生成COVID-19 CT报告的医学视觉语言BERT

Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning.

作者信息

Liu Guangyi, Liao Yinghong, Wang Fuyu, Zhang Bin, Zhang Lu, Liang Xiaodan, Wan Xiang, Li Shaolin, Li Zhen, Zhang Shuixing, Cui Shuguang

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):3786-3797. doi: 10.1109/TNNLS.2021.3099165. Epub 2021 Aug 31.

DOI:10.1109/TNNLS.2021.3099165
PMID:34370672
Abstract

Medical imaging technologies, including computed tomography (CT) or chest X-Ray (CXR), are largely employed to facilitate the diagnosis of the COVID-19. Since manual report writing is usually too time-consuming, a more intelligent auxiliary medical system that could generate medical reports automatically and immediately is urgently needed. In this article, we propose to use the medical visual language BERT (Medical-VLBERT) model to identify the abnormality on the COVID-19 scans and generate the medical report automatically based on the detected lesion regions. To produce more accurate medical reports and minimize the visual-and-linguistic differences, this model adopts an alternate learning strategy with two procedures that are knowledge pretraining and transferring. To be more precise, the knowledge pretraining procedure is to memorize the knowledge from medical texts, while the transferring procedure is to utilize the acquired knowledge for professional medical sentences generations through observations of medical images. In practice, for automatic medical report generation on the COVID-19 cases, we constructed a dataset of 368 medical findings in Chinese and 1104 chest CT scans from The First Affiliated Hospital of Jinan University, Guangzhou, China, and The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China. Besides, to alleviate the insufficiency of the COVID-19 training samples, our model was first trained on the large-scale Chinese CX-CHR dataset and then transferred to the COVID-19 CT dataset for further fine-tuning. The experimental results showed that Medical-VLBERT achieved state-of-the-art performances on terminology prediction and report generation with the Chinese COVID-19 CT dataset and the CX-CHR dataset. The Chinese COVID-19 CT dataset is available at https://covid19ct.github.io/.

摘要

包括计算机断层扫描(CT)或胸部X光(CXR)在内的医学成像技术在很大程度上被用于辅助诊断新型冠状病毒肺炎(COVID-19)。由于人工撰写报告通常耗时过长,因此迫切需要一个更智能的辅助医疗系统,能够自动且即时地生成医学报告。在本文中,我们建议使用医学视觉语言BERT(Medical-VLBERT)模型来识别COVID-19扫描图像上的异常情况,并根据检测到的病变区域自动生成医学报告。为了生成更准确的医学报告并尽量减少视觉与语言之间的差异,该模型采用了一种交替学习策略,包括知识预训练和迁移两个过程。更确切地说,知识预训练过程是从医学文本中记忆知识,而迁移过程则是通过观察医学图像,利用所获取的知识生成专业医学语句。在实践中,为了针对COVID-19病例自动生成医学报告,我们构建了一个数据集,其中包含来自中国广州暨南大学附属第一医院和中国珠海中山大学附属第五医院的368份中文医学检查结果和1104份胸部CT扫描。此外,为了缓解COVID-19训练样本不足的问题,我们的模型首先在大规模中文CX-CHR数据集上进行训练,然后转移到COVID-19 CT数据集上进行进一步微调。实验结果表明,Medical-VLBERT在中国COVID-19 CT数据集和CX-CHR数据集上的术语预测和报告生成方面达到了先进水平。中国COVID-19 CT数据集可在https://covid19ct.github.io/获取。

相似文献

1
Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning.医学-VLBERT:用于通过交替学习生成COVID-19 CT报告的医学视觉语言BERT
IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):3786-3797. doi: 10.1109/TNNLS.2021.3099165. Epub 2021 Aug 31.
2
Deep learning-based artificial intelligence model to assist thyroid nodule diagnosis and management: a multicentre diagnostic study.基于深度学习的人工智能模型辅助甲状腺结节诊断和管理:一项多中心诊断研究。
Lancet Digit Health. 2021 Apr;3(4):e250-e259. doi: 10.1016/S2589-7500(21)00041-8.
3
Deep learning-based triage and analysis of lesion burden for COVID-19: a retrospective study with external validation.基于深度学习的 COVID-19 病变负担分类和分析:一项具有外部验证的回顾性研究。
Lancet Digit Health. 2020 Oct;2(10):e506-e515. doi: 10.1016/S2589-7500(20)30199-0. Epub 2020 Sep 22.
4
Accurate diagnosis of COVID-19 from lung CT images using transfer learning.基于迁移学习的肺部 CT 图像对 COVID-19 的准确诊断。
Eur Rev Med Pharmacol Sci. 2024 Feb;28(3):1213-1226. doi: 10.26355/eurrev_202402_35360.
5
Federated semi-supervised learning for COVID region segmentation in chest CT using multi-national data from China, Italy, Japan.使用来自中国、意大利和日本的多国数据,通过联邦半监督学习方法对胸部 CT 中的 COVID 区域进行分割。
Med Image Anal. 2021 May;70:101992. doi: 10.1016/j.media.2021.101992. Epub 2021 Feb 6.
6
Boosting automatic COVID-19 detection performance with self-supervised learning and batch knowledge ensembling.基于自监督学习和批量知识集成提升 COVID-19 自动检测性能
Comput Biol Med. 2023 May;158:106877. doi: 10.1016/j.compbiomed.2023.106877. Epub 2023 Mar 31.
7
JCS: An Explainable COVID-19 Diagnosis System by Joint Classification and Segmentation.JCS:基于联合分类与分割的 COVID-19 可解释诊断系统。
IEEE Trans Image Process. 2021;30:3113-3126. doi: 10.1109/TIP.2021.3058783. Epub 2021 Feb 24.
8
Automated Screening of COVID-19-Based Tongue Image on Chinese Medicine.基于中医的 COVID-19 舌象自动筛查。
Biomed Res Int. 2022 Jun 23;2022:6825576. doi: 10.1155/2022/6825576. eCollection 2022.
9
Toward data-efficient learning: A benchmark for COVID-19 CT lung and infection segmentation.迈向数据高效学习:COVID-19 CT 肺和感染分割的基准。
Med Phys. 2021 Mar;48(3):1197-1210. doi: 10.1002/mp.14676. Epub 2021 Feb 6.
10
Computer aid screening of COVID-19 using X-ray and CT scan images: An inner comparison.基于 X 射线和 CT 扫描图像的 COVID-19 计算机辅助筛查:内部比较。
J Xray Sci Technol. 2021;29(2):197-210. doi: 10.3233/XST-200784.

引用本文的文献

1
Advancements in Medical Radiology Through Multimodal Machine Learning: A Comprehensive Overview.通过多模态机器学习实现医学放射学的进展:全面概述
Bioengineering (Basel). 2025 Apr 30;12(5):477. doi: 10.3390/bioengineering12050477.
2
A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis.一种用于零样本临床诊断的多模态、多领域、多语言医学基础模型。
NPJ Digit Med. 2025 Feb 6;8(1):86. doi: 10.1038/s41746-024-01339-7.
3
A medical multimodal large language model for future pandemics.用于应对未来大流行的医学多模态大语言模型。
NPJ Digit Med. 2023 Dec 2;6(1):226. doi: 10.1038/s41746-023-00952-2.
4
A scoping review on multimodal deep learning in biomedical images and texts.多模态深度学习在生物医学图像和文本中的应用综述
J Biomed Inform. 2023 Oct;146:104482. doi: 10.1016/j.jbi.2023.104482. Epub 2023 Aug 29.
5
A survey on automatic generation of medical imaging reports based on deep learning.基于深度学习的医学影像报告自动生成研究综述。
Biomed Eng Online. 2023 May 18;22(1):48. doi: 10.1186/s12938-023-01113-y.
6
Survey of Explainable AI Techniques in Healthcare.医疗保健领域可解释人工智能技术调查。
Sensors (Basel). 2023 Jan 5;23(2):634. doi: 10.3390/s23020634.
7
Natural language processing: using artificial intelligence to understand human language in orthopedics.自然语言处理:利用人工智能理解骨科领域的人类语言。
Knee Surg Sports Traumatol Arthrosc. 2023 Apr;31(4):1203-1211. doi: 10.1007/s00167-022-07272-0. Epub 2022 Dec 7.
8
Deep learning based model for classification of COVID -19 images for healthcare research progress.基于深度学习的模型用于COVID-19图像分类以推动医疗保健研究进展。
Mater Today Proc. 2022;62:5008-5012. doi: 10.1016/j.matpr.2022.04.884. Epub 2022 May 18.