• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于CT肺血管造影报告生成和结果预测的视觉语言模型。

Vision-language model for report generation and outcome prediction in CT pulmonary angiogram.

作者信息

Zhong Zhusi, Wang Yuli, Wu Jing, Hsu Wen-Chi, Somasundaram Vin, Bi Lulu, Kulkarni Shreyas, Ma Zhuoqi, Collins Scott, Baird Grayson, Ahn Sun Ho, Feng Xue, Kamel Ihab, Lin Cheng Ting, Greineder Colin, Atalay Michael, Jiao Zhicheng, Bai Harrison

机构信息

Department of Diagnostic Imaging, Brown University Health, Providence, RI, USA.

Warren Alpert Medical School of Brown University, Providence, RI, USA.

出版信息

NPJ Digit Med. 2025 Jul 12;8(1):432. doi: 10.1038/s41746-025-01807-8.

DOI:10.1038/s41746-025-01807-8
PMID:40652098
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12255762/
Abstract

Accurate and comprehensive interpretation of pulmonary embolism (PE) from Computed Tomography Pulmonary Angiography (CTPA) scans remains a clinical challenge due to the limited specificity and structure of existing AI tools. We propose an agent-based framework that integrates Vision-Language Models (VLMs) for detecting 32 PE-related abnormalities and Large Language Models (LLMs) for structured report generation. Trained on over 69,000 CTPA studies from 24,890 patients across Brown University Health (BUH), Johns Hopkins University (JHU), and the INSPECT dataset from Stanford, the model demonstrates strong performance in abnormality classification and report generation. For abnormality classification, it achieved AUROC scores of 0.788 (BUH), 0.754 (INSPECT), and 0.710 (JHU), with corresponding BERT-F1 scores of 0.891, 0.829, and 0.842. The abnormality-guided reporting strategy consistently outperformed the organ-based and holistic captioning baselines. For survival prediction, a multimodal fusion model that incorporates imaging, clinical variables, diagnostic outputs, and generated reports achieved concordance indices of 0.863 (BUH) and 0.731 (JHU), outperforming traditional PESI scores. This framework provides a clinically meaningful and interpretable solution for end-to-end PE diagnosis, structured reporting, and outcome prediction.

摘要

由于现有人工智能工具的特异性和结构有限,从计算机断层扫描肺动脉造影(CTPA)扫描中准确、全面地解读肺栓塞(PE)仍然是一项临床挑战。我们提出了一个基于智能体的框架,该框架集成了用于检测32种与PE相关异常的视觉语言模型(VLM)和用于生成结构化报告的大语言模型(LLM)。该模型在布朗大学健康系统(BUH)、约翰·霍普金斯大学(JHU)的24890名患者的69000多项CTPA研究以及斯坦福大学的INSPECT数据集上进行了训练,在异常分类和报告生成方面表现出强大的性能。在异常分类方面,它在BUH数据集上的曲线下面积(AUROC)得分为0.788,在INSPECT数据集上为0.754,在JHU数据集上为0.710,相应的BERT-F1分数分别为0.891、0.829和0.842。基于异常的报告策略始终优于基于器官和整体字幕的基线方法。在生存预测方面,一个整合了影像、临床变量、诊断输出和生成报告的多模态融合模型在BUH数据集上的一致性指数为0.863,在JHU数据集上为0.731,优于传统的肺栓塞严重指数(PESI)评分。该框架为端到端的PE诊断、结构化报告和结果预测提供了一个具有临床意义且可解释的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/acfa6d81541d/41746_2025_1807_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/6993ec4b3152/41746_2025_1807_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/0122d2e543e0/41746_2025_1807_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/f511c352b15d/41746_2025_1807_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/404682c88864/41746_2025_1807_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/993880091d01/41746_2025_1807_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/f4995334bd26/41746_2025_1807_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/92596be99f8d/41746_2025_1807_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/acfa6d81541d/41746_2025_1807_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/6993ec4b3152/41746_2025_1807_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/0122d2e543e0/41746_2025_1807_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/f511c352b15d/41746_2025_1807_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/404682c88864/41746_2025_1807_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/993880091d01/41746_2025_1807_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/f4995334bd26/41746_2025_1807_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/92596be99f8d/41746_2025_1807_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/000d/12255762/acfa6d81541d/41746_2025_1807_Fig8_HTML.jpg

相似文献

1
Vision-language model for report generation and outcome prediction in CT pulmonary angiogram.用于CT肺血管造影报告生成和结果预测的视觉语言模型。
NPJ Digit Med. 2025 Jul 12;8(1):432. doi: 10.1038/s41746-025-01807-8.
2
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
3
Utilizing large language models for detecting hospital-acquired conditions: an empirical study on pulmonary embolism.利用大语言模型检测医院获得性疾病:关于肺栓塞的实证研究
J Am Med Inform Assoc. 2025 May 1;32(5):876-884. doi: 10.1093/jamia/ocaf048.
4
Using a Large Language Model for Postdeployment Monitoring of FDA-Approved Artificial Intelligence: Pulmonary Embolism Detection Use Case.使用大语言模型对美国食品药品监督管理局批准的人工智能进行部署后监测:肺栓塞检测用例
J Am Coll Radiol. 2025 Jun 30. doi: 10.1016/j.jacr.2025.06.036.
5
The Current State of Artificial Intelligence on Detecting Pulmonary Embolism via Computerised Tomography Pulmonary Angiogram: A Systematic Review.通过计算机断层扫描肺动脉造影检测肺栓塞的人工智能现状:一项系统评价。
Br J Hosp Med (Lond). 2025 Jun 25;86(6):1-21. doi: 10.12968/hmed.2024.0757. Epub 2025 Jun 5.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
123I-MIBG scintigraphy and 18F-FDG-PET imaging for diagnosing neuroblastoma.用于诊断神经母细胞瘤的123I-间碘苄胍闪烁扫描术和18F-氟代脱氧葡萄糖正电子发射断层显像
Cochrane Database Syst Rev. 2015 Sep 29;2015(9):CD009263. doi: 10.1002/14651858.CD009263.pub2.
8
Predicting Drug-Side Effect Relationships From Parametric Knowledge Embedded in Biomedical BERT Models: Methodological Study With a Natural Language Processing Approach.从生物医学BERT模型中嵌入的参数知识预测药物副作用关系:一种自然语言处理方法的方法学研究
JMIR Med Inform. 2025 Jul 10;13:e67513. doi: 10.2196/67513.
9
Sex as a prognostic factor for mortality in adults with acute symptomatic pulmonary embolism.性别作为急性症状性肺栓塞成年患者死亡率的一个预后因素。
Cochrane Database Syst Rev. 2025 Mar 20;3(3):CD013835. doi: 10.1002/14651858.CD013835.pub2.
10
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

引用本文的文献

1
AI Agents in Clinical Medicine: A Systematic Review.临床医学中的人工智能代理:一项系统综述。
medRxiv. 2025 Aug 26:2025.08.22.25334232. doi: 10.1101/2025.08.22.25334232.
2
PathVLM-Eval: Evaluation of open vision language models in histopathology.PathVLM-Eval:组织病理学中开放视觉语言模型的评估
J Pathol Inform. 2025 Jun 5;18:100455. doi: 10.1016/j.jpi.2025.100455. eCollection 2025 Aug.

本文引用的文献

1
Pulmonary Embolism Survival Prediction Using Multimodal Learning Based on Computed Tomography Angiography and Clinical Data.基于计算机断层扫描血管造影和临床数据的多模态学习用于肺栓塞生存预测
J Thorac Imaging. 2025 Sep 1;40(5):e0831. doi: 10.1097/RTI.0000000000000831.
2
Multi-modal large language models in radiology: principles, applications, and potential.放射学中的多模态大语言模型:原理、应用及潜力
Abdom Radiol (NY). 2025 Jun;50(6):2745-2757. doi: 10.1007/s00261-024-04708-8. Epub 2024 Dec 2.
3
Clinical and imaging aspects of pulmonary embolism: a primer for radiologists.
肺栓塞的临床与影像学表现:放射科医生入门指南
Clin Imaging. 2025 Jan;117:110328. doi: 10.1016/j.clinimag.2024.110328. Epub 2024 Oct 23.
4
Multimodal fusion models for pulmonary embolism mortality prediction.多模态融合模型在肺栓塞死亡率预测中的应用。
Sci Rep. 2023 May 9;13(1):7544. doi: 10.1038/s41598-023-34303-8.
5
A large language model for electronic health records.用于电子健康记录的大型语言模型。
NPJ Digit Med. 2022 Dec 26;5(1):194. doi: 10.1038/s41746-022-00742-2.
6
Evolving Management Trends and Outcomes in Catheter Management of Acute Pulmonary Embolism.急性肺栓塞导管管理中不断演变的管理趋势与结果
J Cardiothorac Vasc Anesth. 2022 Aug;36(8 Pt B):3344-3356. doi: 10.1053/j.jvca.2021.09.050. Epub 2021 Oct 4.
7
Pulmonary CTA Reporting: Expert Panel Narrative Review.肺部 CT 血管造影报告:专家小组叙述性综述。
AJR Am J Roentgenol. 2022 Mar;218(3):396-404. doi: 10.2214/AJR.21.26646. Epub 2021 Oct 6.
8
Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problem.常规影像中的自动肺分割主要是一个数据多样性问题,而不是方法学问题。
Eur Radiol Exp. 2020 Aug 20;4(1):50. doi: 10.1186/s41747-020-00173-2.
9
PENet-a scalable deep-learning model for automated diagnosis of pulmonary embolism using volumetric CT imaging.PENet——一种用于使用容积CT成像自动诊断肺栓塞的可扩展深度学习模型。
NPJ Digit Med. 2020 Apr 24;3:61. doi: 10.1038/s41746-020-0266-y. eCollection 2020.
10
DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network.DeepSurv:使用 Cox 比例风险深度神经网络的个性化治疗推荐系统。
BMC Med Res Methodol. 2018 Feb 26;18(1):24. doi: 10.1186/s12874-018-0482-1.