• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EchoLLM:使用轻量级、开源大语言模型提取超声心动图实体。

EchoLLM: extracting echocardiogram entities with light-weight, open-source large language models.

作者信息

Chi Jonathan, Rouphail Yazan, Hillis Ethan, Ma Ningning, Nguyen An, Wang Jane, Hofford Mackenzie, Gupta Aditi, Lyons Patrick G, Wilcox Adam, Lai Albert M, Payne Philip R O, Kollef Marin H, Dreisbach Caitlin, Michelson Andrew P

机构信息

Goergen Institute for Data Science and Artificial Intelligence, University of Rochester, Rochester, NY 14627, United States.

Department of Medicine, Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, MO 63110, United States.

出版信息

JAMIA Open. 2025 Aug 13;8(4):ooaf092. doi: 10.1093/jamiaopen/ooaf092. eCollection 2025 Aug.

DOI:10.1093/jamiaopen/ooaf092
PMID:40809469
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12349756/
Abstract

OBJECTIVES

Large language models (LLMs) have demonstrated high levels of performance in clinical information extraction compared to rule-based systems and traditional machine-learning approaches, offering scalability, contextualization, and easier deployment. However, most studies rely on proprietary models with privacy concerns and high costs, limiting accessibility. We aim to evaluate 14 publicly available open-source LLMs for extracting clinically relevant findings from free-text echocardiogram reports and examine the feasibility of their implementation in information extraction workflows.

MATERIALS AND METHODS

We used 14 open-source LLM models to extract clinically relevant entities from echocardiogram reports ( = 507). Each report was manually annotated by 2 independent health-care professionals and adjudicated by a third. Lexical variance and length of each echocardiogram report were collected. Precision, recall, and F1 scores were calculated for the 9 extracted entities via multiclass classification.

RESULTS

In aggregate, Gemma2:9b-instruct had the highest precision, recall, and F1 scores at 0.973 (0.962-0.983), 0.959 (0.947-0.973), and 0.965 (0.951-0.975), respectively. In comparison, Phi3:3.8b-mini-instruct had the lowest precision score at 0.831 (0.804-0.856), while Gemma:7b-instruct had the lowest recall and F1 scores at 0.382 (0.356-0.408) and 0.392 (0.356-0.428), respectively.

DISCUSSION AND CONCLUSION

Using LLMs for entity extraction for echocardiogram reports has the potential to support both clinical research and health-care delivery. Our work demonstrates the feasibility of using open-source models for more efficient computation and extraction.

摘要

目的

与基于规则的系统和传统机器学习方法相比,大语言模型(LLMs)在临床信息提取方面表现出了很高的性能,具有可扩展性、上下文感知能力且易于部署。然而,大多数研究依赖于存在隐私问题和高成本的专有模型,限制了其可及性。我们旨在评估14个公开可用的开源大语言模型,用于从自由文本超声心动图报告中提取临床相关发现,并检验其在信息提取工作流程中实施的可行性。

材料与方法

我们使用14个开源大语言模型从超声心动图报告(n = 507)中提取临床相关实体。每份报告由2名独立的医疗保健专业人员进行人工标注,并由第三名人员进行裁决。收集了每份超声心动图报告的词汇差异和长度。通过多类分类计算9个提取实体的精确率、召回率和F1分数。

结果

总体而言,Gemma2:9b-instruct的精确率、召回率和F1分数最高,分别为0.973(0.962 - 0.983)、0.959(0.947 - 0.973)和0.965(0.951 - 0.975)。相比之下,Phi3:3.8b-mini-instruct的精确率得分最低,为0.831(0.804 - 0.856),而Gemma:7b-instruct的召回率和F1分数最低,分别为0.382(0.356 - 0.408)和0.392(0.356 - 0.428)。

讨论与结论

使用大语言模型进行超声心动图报告的实体提取有潜力支持临床研究和医疗服务。我们的工作证明了使用开源模型进行更高效计算和提取的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64e6/12349756/54ceed21b1cb/ooaf092f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64e6/12349756/30bdfc4fe6e9/ooaf092f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64e6/12349756/54ceed21b1cb/ooaf092f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64e6/12349756/30bdfc4fe6e9/ooaf092f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64e6/12349756/54ceed21b1cb/ooaf092f2.jpg

相似文献

1
EchoLLM: extracting echocardiogram entities with light-weight, open-source large language models.EchoLLM:使用轻量级、开源大语言模型提取超声心动图实体。
JAMIA Open. 2025 Aug 13;8(4):ooaf092. doi: 10.1093/jamiaopen/ooaf092. eCollection 2025 Aug.
2
Harnessing Moderate-Sized Language Models for Reliable Patient Data Deidentification in Emergency Department Records: Algorithm Development, Validation, and Implementation Study.利用中等规模语言模型对急诊科记录中的患者数据进行可靠去识别:算法开发、验证与实施研究。
JMIR AI. 2025 Apr 1;4:e57828. doi: 10.2196/57828.
3
Utilizing large language models for detecting hospital-acquired conditions: an empirical study on pulmonary embolism.利用大语言模型检测医院获得性疾病:关于肺栓塞的实证研究
J Am Med Inform Assoc. 2025 May 1;32(5):876-884. doi: 10.1093/jamia/ocaf048.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
From BERT to generative AI - Comparing encoder-only vs. large language models in a cohort of lung cancer patients for named entity recognition in unstructured medical reports.从BERT到生成式人工智能——在一组肺癌患者中比较仅编码器模型与大语言模型用于非结构化医疗报告中的命名实体识别
Comput Biol Med. 2025 Sep;195:110665. doi: 10.1016/j.compbiomed.2025.110665. Epub 2025 Jun 24.
6
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Large Language Model Symptom Identification From Clinical Text: Multicenter Study.基于临床文本的大语言模型症状识别:多中心研究。
J Med Internet Res. 2025 Jul 31;27:e72984. doi: 10.2196/72984.
9
Language Models for Multilabel Document Classification of Surgical Concepts in Exploratory Laparotomy Operative Notes: Algorithm Development Study.用于探索性剖腹手术记录中手术概念多标签文档分类的语言模型:算法开发研究
JMIR Med Inform. 2025 Jul 9;13:e71176. doi: 10.2196/71176.
10
Algorithmic Classification of Psychiatric Disorder-Related Spontaneous Communication Using Large Language Model Embeddings: Algorithm Development and Validation.使用大语言模型嵌入对精神障碍相关自发交流进行算法分类:算法开发与验证
JMIR AI. 2025 May 30;4:e67369. doi: 10.2196/67369.

本文引用的文献

1
Comparing Commercial and Open-Source Large Language Models for Labeling Chest Radiograph Reports.比较商用和开源大语言模型在标注胸部 X 光报告中的表现。
Radiology. 2024 Oct;313(1):e241139. doi: 10.1148/radiol.241139.
2
A critical assessment of using ChatGPT for extracting structured data from clinical notes.对使用ChatGPT从临床记录中提取结构化数据的批判性评估。
NPJ Digit Med. 2024 May 1;7(1):106. doi: 10.1038/s41746-024-01079-8.
3
Zero-shot information extraction from radiological reports using ChatGPT.使用 ChatGPT 从放射报告中进行零样本信息提取。
Int J Med Inform. 2024 Mar;183:105321. doi: 10.1016/j.ijmedinf.2023.105321. Epub 2023 Dec 21.
4
Challenges and best practices for digital unstructured data enrichment in health research: A systematic narrative review.健康研究中数字非结构化数据充实的挑战与最佳实践:一项系统性叙述性综述
PLOS Digit Health. 2023 Oct 11;2(10):e0000347. doi: 10.1371/journal.pdig.0000347. eCollection 2023 Oct.
5
A general text mining method to extract echocardiography measurement results from echocardiography documents.一种从超声心动图文档中提取超声心动图测量结果的通用文本挖掘方法。
Artif Intell Med. 2023 Sep;143:102584. doi: 10.1016/j.artmed.2023.102584. Epub 2023 May 20.
6
Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study.利用GPT-4将自由文本放射学报告进行事后转换为结构化报告:一项多语言可行性研究。
Radiology. 2023 May;307(4):e230725. doi: 10.1148/radiol.230725. Epub 2023 Apr 4.
7
The Evolving Use of Electronic Health Records (EHR) for Research.电子健康记录(EHR)在研究中的应用不断发展。
Semin Radiat Oncol. 2019 Oct;29(4):354-361. doi: 10.1016/j.semradonc.2019.05.010.
8
Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review.利用电子健康记录数据开发深度学习模型的机遇与挑战:系统综述。
J Am Med Inform Assoc. 2018 Oct 1;25(10):1419-1428. doi: 10.1093/jamia/ocy068.
9
Extracting Healthcare Quality Information from Unstructured Data.从非结构化数据中提取医疗质量信息。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1243-1252. eCollection 2017.
10
Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.用于捕获和标准化非结构化临床信息的自然语言处理系统:一项系统综述。
J Biomed Inform. 2017 Sep;73:14-29. doi: 10.1016/j.jbi.2017.07.012. Epub 2017 Jul 17.