使用GPT-4o进行神经血管会诊的信息提取与总结：一项临床案例研究。

Information Extraction and Summarization for Neurovascular Consultations with GPT-4o: A Clinical Case Study.

作者信息

Indrakanti Ashraya Kumar, Heierle Julian Elias, Münger Hannah, Koch Alma Teresa, Kaiser Philippe, Bach Michael, Fiehler Jens, Tsogkas Ioannis, Guzman Raphael, Mutke Matthias Anthony, Psychogios Marios

机构信息

Department of Diagnostic and Interventional Neuroradiology, Clinic of Radiology and Nuclear Medicine, University Hospital Basel, Petersgraben 4, 4031, Basel, Switzerland.

Clinic of Radiology and Nuclear Medicine, University Hospital Basel, Petersgraben 4, 4031, Basel, Switzerland.

出版信息

Clin Neuroradiol. 2025 Jul 31. doi: 10.1007/s00062-025-01538-z.

DOI:10.1007/s00062-025-01538-z

PMID:40742451

Abstract

PURPOSE

In outpatient settings, extensive patient records must frequently be reviewed under time constraints, making efficient extraction and summarization of key clinical information essential. Large language models (LLMs) are potentially useful for this task but require validation for clinical reliability. This study assesses OpenAI's GPT-4o for generating structured summaries to assist in neurovascular consultation preparation, aiming to increase efficiency by automating critical data extraction.

METHODS

A prospective study was conducted from May to August 2024 at a tertiary care hospital, involving a total of 70 patients. Structured summaries were generated by GPT-4o using a predefined template. Extracted data were categorized into aneurysm-specific details, imaging summaries, and patient-specific clinical factors. Accuracy and completeness were assessed by clinicians, with performance measured using precision, recall, specificity, and accuracy.

RESULTS

High accuracy (≥ 0.96) was measured across most categories. In aneurysm-and patient-specific data, extraction performance varied based on stability over time. Aneurysm location and other stable details were extracted consistently, while changes in aneurysm size and medication lists showed variations. In rare cases, aneurysm details were misattributed to a different aneurysm within the same patient. Imaging summaries were generally concise and clinically useful, though their effectiveness declined when summarizing multiple prior studies.

CONCLUSION

Neurovascular patient data was effectively structured by GPT-4o, demonstrating high accuracy with minimal errors. While occasional misattributions like outdated information were observed, reliable citation of sources facilitated easy verification. These findings support integrating LLM-generated summaries into neurovascular consultations, with further optimization needed for temporal data tracking and on-premise implementation to address privacy concerns.

摘要

目的

在门诊环境中，必须经常在时间限制下审查大量患者记录，因此高效提取和总结关键临床信息至关重要。大语言模型（LLMs）可能有助于完成这项任务，但需要对其临床可靠性进行验证。本研究评估了OpenAI的GPT-4o生成结构化总结以协助神经血管会诊准备的能力，旨在通过自动提取关键数据来提高效率。

方法

2024年5月至8月在一家三级护理医院进行了一项前瞻性研究，共纳入70例患者。GPT-4o使用预定义模板生成结构化总结。提取的数据分为动脉瘤特定细节、影像总结和患者特定临床因素。由临床医生评估准确性和完整性，使用精确率、召回率、特异性和准确率来衡量性能。

结果

大多数类别均测得较高的准确性（≥0.96）。在动脉瘤和患者特定数据中，提取性能因随时间的稳定性而异。动脉瘤位置和其他稳定细节被一致提取，而动脉瘤大小和用药清单的变化则存在差异。在极少数情况下，同一患者内的动脉瘤细节被错误归因于另一个动脉瘤。影像总结通常简洁且具有临床实用性，不过在总结多项既往研究时其有效性会下降。

结论

GPT-4o有效地构建了神经血管患者数据，显示出高准确性且错误极少。虽然观察到偶尔会出现像过时信息这样的错误归因，但可靠的来源引用便于轻松核实。这些发现支持将大语言模型生成的总结整合到神经血管会诊中，对于时间数据跟踪和本地实施以解决隐私问题还需要进一步优化。

相似文献

Information Extraction and Summarization for Neurovascular Consultations with GPT-4o: A Clinical Case Study.

Clin Neuroradiol. 2025 Jul 31. doi: 10.1007/s00062-025-01538-z.

Assessing the Accuracy and Reliability of Large Language Models in Psychiatry Using Standardized Multiple-Choice Questions: Cross-Sectional Study.

J Med Internet Res. 2025 May 20;27:e69910. doi: 10.2196/69910.

Evaluating a Large Language Model in Translating Patient Instructions to Spanish Using a Standardized Framework.

JAMA Pediatr. 2025 Jul 7. doi: 10.1001/jamapediatrics.2025.1729.

A comparative study of recent large language models on generating hospital discharge summaries for lung cancer patients.

J Biomed Inform. 2025 Aug;168:104867. doi: 10.1016/j.jbi.2025.104867. Epub 2025 Jun 20.

Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.

JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.

Data extraction from free-text stroke CT reports using GPT-4o and Llama-3.3-70B: the impact of annotation guidelines.

Eur Radiol Exp. 2025 Jun 19;9(1):61. doi: 10.1186/s41747-025-00600-2.

Evaluating Large Language Models for Drafting Emergency Department Discharge Summaries.

medRxiv. 2024 Apr 4:2024.04.03.24305088. doi: 10.1101/2024.04.03.24305088.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.

Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.

本文引用的文献

Viability of Open Large Language Models for Clinical Documentation in German Health Care: Real-World Model Evaluation Study.

JMIR Med Inform. 2024 Aug 28;12:e59617. doi: 10.2196/59617.

Monitoring Patients with Glioblastoma by Using a Large Language Model: Accurate Summarization of Radiology Reports with GPT-4.

Radiology. 2024 Jul;312(1):e232640. doi: 10.1148/radiol.232640.

Adapted large language models can outperform medical experts in clinical text summarization.

Nat Med. 2024 Apr;30(4):1134-1142. doi: 10.1038/s41591-024-02855-5. Epub 2024 Feb 27.

Large Language Model-Based Chatbot vs Surgeon-Generated Informed Consent Documentation for Common Procedures.

JAMA Netw Open. 2023 Oct 2;6(10):e2336997. doi: 10.1001/jamanetworkopen.2023.36997.

ELAPSS score for prediction of risk of growth of unruptured intracranial aneurysms.

Neurology. 2017 Apr 25;88(17):1600-1606. doi: 10.1212/WNL.0000000000003865. Epub 2017 Mar 31.

The unruptured intracranial aneurysm treatment score: a multidisciplinary consensus.

Neurology. 2015 Sep 8;85(10):881-9. doi: 10.1212/WNL.0000000000001891. Epub 2015 Aug 14.

PHASES Score for Prediction of Intracranial Aneurysm Growth.

Stroke. 2015 May;46(5):1221-6. doi: 10.1161/STROKEAHA.114.008198. Epub 2015 Mar 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用GPT-4o进行神经血管会诊的信息提取与总结：一项临床案例研究。

Information Extraction and Summarization for Neurovascular Consultations with GPT-4o: A Clinical Case Study.

作者信息

机构信息

Department of Diagnostic and Interventional Neuroradiology, Clinic of Radiology and Nuclear Medicine, University Hospital Basel, Petersgraben 4, 4031, Basel, Switzerland.

Clinic of Radiology and Nuclear Medicine, University Hospital Basel, Petersgraben 4, 4031, Basel, Switzerland.

出版信息

Clin Neuroradiol. 2025 Jul 31. doi: 10.1007/s00062-025-01538-z.

DOI:10.1007/s00062-025-01538-z

PMID:40742451

Abstract

PURPOSE

METHODS

RESULTS

CONCLUSION

摘要

使用GPT-4o进行神经血管会诊的信息提取与总结：一项临床案例研究。

Information Extraction and Summarization for Neurovascular Consultations with GPT-4o: A Clinical Case Study.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用GPT-4o进行神经血管会诊的信息提取与总结：一项临床案例研究。

Information Extraction and Summarization for Neurovascular Consultations with GPT-4o: A Clinical Case Study.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

本文引用的文献