• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用顺序部署的大语言模型将非结构化心血管诊断报告自动转换为结构化数据集

Automated Transformation of Unstructured Cardiovascular Diagnostic Reports into Structured Datasets Using Sequentially Deployed Large Language Models.

作者信息

Shankar Sumukh Vasisht, Dhingra Lovedeep S, Aminorroaya Arya, Adejumo Philip, Nadkarni Girish N, Xu Hua, Brandt Cynthia, Oikonomou Evangelos K, Pedroso Aline F, Khera Rohan

机构信息

Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA.

The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.

出版信息

medRxiv. 2024 Oct 8:2024.10.08.24315035. doi: 10.1101/2024.10.08.24315035.

DOI:10.1101/2024.10.08.24315035
PMID:39417094
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11482995/
Abstract

BACKGROUND

Rich data in cardiovascular diagnostic testing are often sequestered in unstructured reports, with the necessity of manual abstraction limiting their use in real-time applications in patient care and research.

METHODS

We developed a two-step process that sequentially deploys generative and interpretative large language models (LLMs; Llama2 70b and Llama2 13b). Using a Llama2 70b model, we generated varying formats of transthoracic echocardiogram (TTE) reports from 3,000 real-world echo reports with paired structured elements, leveraging temporal changes in reporting formats to define the variations. Subsequently, we fine-tuned Llama2 13b using sequentially larger batches of generated echo reports as inputs, to extract data from free-text narratives across 18 clinically relevant echocardiographic fields. This was set up as a prompt-based supervised training task. We evaluated the fine-tuned Llama2 13b model, HeartDx-LM, on several distinct echocardiographic datasets: (i) reports across the different time periods and formats at Yale New Haven Health System (YNHHS), (ii) the Medical Information Mart for Intensive Care (MIMIC) III dataset, and (iii) the MIMIC IV dataset. We used the accuracy of extracted fields and Cohen's Kappa as the metrics and have publicly released the HeartDX-LM model.

RESULTS

The HeartDX-LM model was trained on randomly selected 2,000 synthetic echo reports with varying formats and paired structured labels, with a wide range of clinical findings. We identified a lower threshold of 500 annotated reports required for fine-tuning Llama2 13b to achieve stable and consistent performance. At YNHHS, the HeartDx-LM model accurately extracted 69,144 out of 70,032 values (98.7%) across 18 clinical fields from unstructured reports in the test set from contemporary records where paired structured data were also available. In older echo reports where only unstructured reports were available, the model achieved 87.1% accuracy against expert annotations for the same 18 fields for a random sample of 100 reports. Similarly, in expert-annotated external validation sets from MIMIC-IV and MIMIC-III, HeartDx-LM correctly extracted 201 out of 220 available values (91.3%) and 615 out of 707 available values (87.9%), respectively, from 100 randomly chosen and expert annotated echo reports from each set.

CONCLUSION

We developed a novel method using paired large and moderate-sized LLMs to automate the extraction of unstructured echocardiographic reports into tabular datasets. Our approach represents a scalable strategy that transforms unstructured reports into computable elements that can be leveraged to improve cardiovascular care quality and enable research.

摘要

背景

心血管诊断测试中的丰富数据通常被隔离在非结构化报告中,由于需要人工提取,限制了它们在患者护理和研究的实时应用中的使用。

方法

我们开发了一个两步流程,依次部署生成式和解释性大语言模型(LLMs;Llama2 70b和Llama2 13b)。使用Llama2 70b模型,我们从3000份具有配对结构化元素的真实世界超声心动图(TTE)报告中生成了不同格式的报告,利用报告格式的时间变化来定义变化。随后,我们使用依次增加批次的生成式超声心动图报告作为输入对Llama2 13b进行微调,以从18个临床相关超声心动图领域的自由文本叙述中提取数据。这被设置为一个基于提示的监督训练任务。我们在几个不同的超声心动图数据集上评估了经过微调的Llama2 13b模型HeartDx-LM:(i)耶鲁纽黑文医疗系统(YNHHS)不同时间段和格式的报告,(ii)重症监护医学信息集市(MIMIC)III数据集,以及(iii)MIMIC IV数据集。我们使用提取字段的准确性和科恩kappa系数作为指标,并已公开发布HeartDX-LM模型。

结果

HeartDX-LM模型在随机选择的2000份具有不同格式和配对结构化标签的合成超声心动图报告上进行训练,这些报告具有广泛的临床发现。我们确定了将Llama2 13b微调以实现稳定和一致性能所需的较低阈值为500份注释报告。在YNHHS,HeartDx-LM模型从当代记录的测试集中的非结构化报告中准确提取了70032个值中的69144个(98.7%),涉及18个临床领域,其中也有配对的结构化数据。在只有非结构化报告的较旧超声心动图报告中,对于随机抽取的100份报告的相同18个领域,该模型相对于专家注释的准确率达到了87.1%。同样,在来自MIMIC-IV和MIMIC-III的专家注释外部验证集中,HeartDx-LM分别从每组随机选择并经专家注释的100份超声心动图报告中正确提取了220个可用值中的201个(91.3%)和707个可用值中的615个(87.9%)。

结论

我们开发了一种新颖的方法,使用配对的大型和中型LLMs将非结构化超声心动图报告自动提取到表格数据集中。我们的方法代表了一种可扩展的策略,将非结构化报告转化为可计算的元素,可用于提高心血管护理质量并推动研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/120b/11482995/14683f7169a5/nihpp-2024.10.08.24315035v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/120b/11482995/d3d60826ca54/nihpp-2024.10.08.24315035v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/120b/11482995/6d66ab82ebae/nihpp-2024.10.08.24315035v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/120b/11482995/14683f7169a5/nihpp-2024.10.08.24315035v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/120b/11482995/d3d60826ca54/nihpp-2024.10.08.24315035v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/120b/11482995/6d66ab82ebae/nihpp-2024.10.08.24315035v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/120b/11482995/14683f7169a5/nihpp-2024.10.08.24315035v1-f0003.jpg

相似文献

1
Automated Transformation of Unstructured Cardiovascular Diagnostic Reports into Structured Datasets Using Sequentially Deployed Large Language Models.使用顺序部署的大语言模型将非结构化心血管诊断报告自动转换为结构化数据集
medRxiv. 2024 Oct 8:2024.10.08.24315035. doi: 10.1101/2024.10.08.24315035.
2
Automated transformation of unstructured cardiovascular diagnostic reports into structured datasets using sequentially deployed large language models.使用顺序部署的大语言模型将非结构化心血管诊断报告自动转换为结构化数据集。
Eur Heart J Digit Health. 2025 Apr 2;6(4):783-796. doi: 10.1093/ehjdh/ztaf030. eCollection 2025 Jul.
3
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
4
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
5
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
6
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Regional cerebral blood flow single photon emission computed tomography for detection of Frontotemporal dementia in people with suspected dementia.用于检测疑似痴呆患者额颞叶痴呆的局部脑血流单光子发射计算机断层扫描
Cochrane Database Syst Rev. 2015 Jun 23;2015(6):CD010896. doi: 10.1002/14651858.CD010896.pub2.
9
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
10
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

引用本文的文献

1
From Mutation to Prognosis: AI-HOPE-PI3K Enables Artificial Intelligence Agent-Driven Integration of PI3K Pathway Data in Colorectal Cancer Precision Medicine.从突变到预后:AI-HOPE-PI3K助力人工智能驱动的PI3K通路数据在结直肠癌精准医学中的整合
Int J Mol Sci. 2025 Jul 5;26(13):6487. doi: 10.3390/ijms26136487.

本文引用的文献

1
Automated Identification of Heart Failure With Reduced Ejection Fraction Using Deep Learning-Based Natural Language Processing.使用基于深度学习的自然语言处理自动识别射血分数降低的心力衰竭
JACC Heart Fail. 2025 Jan;13(1):75-87. doi: 10.1016/j.jchf.2024.08.012. Epub 2024 Oct 23.
2
Real-world evaluation of an algorithmic machine-learning-guided testing approach in stable chest pain: a multinational, multicohort study.稳定型胸痛中算法机器学习引导检测方法的真实世界评估:一项跨国多队列研究
Eur Heart J Digit Health. 2024 Apr 8;5(3):303-313. doi: 10.1093/ehjdh/ztae023. eCollection 2024 May.
3
A Multimodal Video-Based AI Biomarker for Aortic Stenosis Development and Progression.
基于多模态视频的 AI 生物标志物用于主动脉瓣狭窄的发生和进展。
JAMA Cardiol. 2024 Jun 1;9(6):534-544. doi: 10.1001/jamacardio.2024.0595.
4
Creation and Adoption of Large Language Models in Medicine.医学领域中大型语言模型的创建与采用。
JAMA. 2023 Sep 5;330(9):866-869. doi: 10.1001/jama.2023.14217.
5
Cardiovascular Care Innovation through Data-Driven Discoveries in the Electronic Health Record.电子健康记录中的数据驱动发现推动心血管护理创新。
Am J Cardiol. 2023 Sep 15;203:136-148. doi: 10.1016/j.amjcard.2023.06.104. Epub 2023 Jul 25.
6
Detection of Left Ventricular Systolic Dysfunction From Electrocardiographic Images.从心电图图像中检测左心室收缩功能障碍。
Circulation. 2023 Aug 29;148(9):765-777. doi: 10.1161/CIRCULATIONAHA.122.062646. Epub 2023 Jul 25.
7
Detection of left ventricular systolic dysfunction from single-lead electrocardiography adapted for portable and wearable devices.通过适用于便携式和可穿戴设备的单导联心电图检测左心室收缩功能障碍。
NPJ Digit Med. 2023 Jul 11;6(1):124. doi: 10.1038/s41746-023-00869-w.
8
MIMIC-IV, a freely accessible electronic health record dataset.MIMIC-IV,一个可自由访问的电子健康记录数据集。
Sci Data. 2023 Jan 3;10(1):1. doi: 10.1038/s41597-022-01899-x.
9
Information Extraction From Electronic Health Records to Predict Readmission Following Acute Myocardial Infarction: Does Natural Language Processing Using Clinical Notes Improve Prediction of Readmission?从电子健康记录中提取信息以预测急性心肌梗死患者的再入院情况:使用临床记录进行自然语言处理是否能提高再入院预测的准确性?
J Am Heart Assoc. 2022 Apr 5;11(7):e024198. doi: 10.1161/JAHA.121.024198. Epub 2022 Mar 24.
10
A systematic review of natural language processing applied to radiology reports.自然语言处理在放射学报告中的应用的系统评价。
BMC Med Inform Decis Mak. 2021 Jun 3;21(1):179. doi: 10.1186/s12911-021-01533-7.