• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用大语言模型开发提示,以从乳腺癌的病理学和超声报告中提取临床信息。

Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer.

作者信息

Choi Hyeon Seok, Song Jun Yeong, Shin Kyung Hwan, Chang Ji Hyun, Jang Bum-Sup

机构信息

Department of Radiation Oncology, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Korea.

Institute of Radiation Medicine, Seoul National University Medical Research Center, Seoul, Korea.

出版信息

Radiat Oncol J. 2023 Sep;41(3):209-216. doi: 10.3857/roj.2023.00633. Epub 2023 Sep 21.

DOI:10.3857/roj.2023.00633
PMID:37793630
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10556835/
Abstract

PURPOSE

We aimed to evaluate the time and cost of developing prompts using large language model (LLM), tailored to extract clinical factors in breast cancer patients and their accuracy.

MATERIALS AND METHODS

We collected data from reports of surgical pathology and ultrasound from breast cancer patients who underwent radiotherapy from 2020 to 2022. We extracted the information using the Generative Pre-trained Transformer (GPT) for Sheets and Docs extension plugin and termed this the "LLM" method. The time and cost of developing the prompts with LLM methods were assessed and compared with those spent on collecting information with "full manual" and "LLM-assisted manual" methods. To assess accuracy, 340 patients were randomly selected, and the extracted information by LLM method were compared with those collected by "full manual" method.

RESULTS

Data from 2,931 patients were collected. We developed 12 prompts for Extract function and 12 for Format function to extract and standardize the information. The overall accuracy was 87.7%. For lymphovascular invasion, it was 98.2%. Developing and processing the prompts took 3.5 hours and 15 minutes, respectively. Utilizing the ChatGPT application programming interface cost US $65.8 and when factoring in the estimated wage, the total cost was US $95.4. In an estimated comparison, "LLM-assisted manual" and "LLM" methods were time- and cost-efficient compared to the "full manual" method.

CONCLUSION

Developing and facilitating prompts for LLM to derive clinical factors was efficient to extract crucial information from huge medical records. This study demonstrated the potential of the application of natural language processing using LLM model in breast cancer patients. Prompts from the current study can be re-used for other research to collect clinical information.

摘要

目的

我们旨在评估使用大语言模型(LLM)开发提示以提取乳腺癌患者临床因素的时间、成本及其准确性。

材料与方法

我们收集了2020年至2022年接受放疗的乳腺癌患者的手术病理报告和超声报告数据。我们使用适用于表格和文档的生成式预训练变换器(GPT)扩展插件提取信息,并将此方法称为“LLM”方法。评估了使用LLM方法开发提示的时间和成本,并与“完全手动”和“LLM辅助手动”方法收集信息所花费的时间和成本进行了比较。为评估准确性,随机选择了340例患者,并将LLM方法提取的信息与“完全手动”方法收集的信息进行比较。

结果

收集了2931例患者的数据。我们为提取功能开发了12个提示词,为格式化功能开发了12个提示词,以提取和规范信息。总体准确率为87.7%。对于脉管侵犯,准确率为98.2%。开发和处理提示词分别耗时3.5小时15分钟。使用ChatGPT应用程序编程接口花费65.8美元,计入估计工资后,总成本为95.4美元。在估计比较中,与“完全手动”方法相比,“LLM辅助手动”和“LLM”方法在时间和成本上更具效率。

结论

开发并促进LLM的提示以获取临床因素,能有效地从大量医疗记录中提取关键信息。本研究证明了使用LLM模型进行自然语言处理在乳腺癌患者中的应用潜力。本研究中的提示词可重新用于其他收集临床信息的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d8e/10556835/03ddcf9e120b/roj-2023-00633f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d8e/10556835/03ddcf9e120b/roj-2023-00633f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d8e/10556835/03ddcf9e120b/roj-2023-00633f1.jpg

相似文献

1
Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer.利用大语言模型开发提示,以从乳腺癌的病理学和超声报告中提取临床信息。
Radiat Oncol J. 2023 Sep;41(3):209-216. doi: 10.3857/roj.2023.00633. Epub 2023 Sep 21.
2
ChatGPT and large language model (LLM) chatbots: The current state of acceptability and a proposal for guidelines on utilization in academic medicine.ChatGPT 和大型语言模型 (LLM) 聊天机器人:在接受度方面的现状以及在学术医学中使用指南的建议。
J Pediatr Urol. 2023 Oct;19(5):598-604. doi: 10.1016/j.jpurol.2023.05.018. Epub 2023 Jun 2.
3
Generative large language models are all-purpose text analytics engines: text-to-text learning is all your need.生成式大语言模型是通用文本分析引擎:文本到文本学习就是你所需要的一切。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1892-1903. doi: 10.1093/jamia/ocae078.
4
Diagnosing Glaucoma Based on the Ocular Hypertension Treatment Study Dataset Using Chat Generative Pre-Trained Transformer as a Large Language Model.基于眼压治疗研究数据集,使用聊天生成预训练变换器作为大语言模型诊断青光眼
Ophthalmol Sci. 2024 Aug 22;5(1):100599. doi: 10.1016/j.xops.2024.100599. eCollection 2025 Jan-Feb.
5
From jargon to clarity: Improving the readability of foot and ankle radiology reports with an artificial intelligence large language model.从行话到清晰明了:利用人工智能大语言模型提高足踝放射学报告的可读性
Foot Ankle Surg. 2024 Jun;30(4):331-337. doi: 10.1016/j.fas.2024.01.008. Epub 2024 Feb 5.
6
LLM-AIx: An open source pipeline for Information Extraction from unstructured medical text based on privacy preserving Large Language Models.LLM-AIx:一种基于隐私保护大语言模型从非结构化医学文本中提取信息的开源管道。
medRxiv. 2024 Sep 3:2024.09.02.24312917. doi: 10.1101/2024.09.02.24312917.
7
Validation of large language models for detecting pathologic complete response in breast cancer using population-based pathology reports.利用基于人群的病理报告验证大型语言模型在乳腺癌病理完全缓解检测中的应用。
BMC Med Inform Decis Mak. 2024 Oct 3;24(1):283. doi: 10.1186/s12911-024-02677-y.
8
Use of Generative AI to Identify Helmet Status Among Patients With Micromobility-Related Injuries From Unstructured Clinical Notes.利用生成式人工智能从非结构化临床记录中识别与微移动相关损伤患者的头盔使用情况。
JAMA Netw Open. 2024 Aug 1;7(8):e2425981. doi: 10.1001/jamanetworkopen.2024.25981.
9
Extracting structured information from unstructured histopathology reports using generative pre-trained transformer 4 (GPT-4).使用生成式预训练转换器 4(GPT-4)从非结构化组织病理学报告中提取结构化信息。
J Pathol. 2024 Mar;262(3):310-319. doi: 10.1002/path.6232. Epub 2023 Dec 14.
10
Large language models: Are artificial intelligence-based chatbots a reliable source of patient information for spinal surgery?大语言模型:基于人工智能的聊天机器人是脊柱手术患者信息的可靠来源吗?
Eur Spine J. 2024 Nov;33(11):4135-4143. doi: 10.1007/s00586-023-07975-z. Epub 2023 Oct 11.

引用本文的文献

1
Performance of Natural Language Processing for Information Extraction From Electronic Health Records Within Cancer: Systematic Review.自然语言处理在癌症电子健康记录信息提取中的性能:系统评价
JMIR Med Inform. 2025 Sep 12;13:e68707. doi: 10.2196/68707.
2
Development and Validation of a Large Language Model-Based System for Medical History-Taking Training: Prospective Multicase Study on Evaluation Stability, Human-AI Consistency, and Transparency.基于大语言模型的病史采集训练系统的开发与验证:关于评估稳定性、人机一致性和透明度的前瞻性多案例研究
JMIR Med Educ. 2025 Aug 29;11:e73419. doi: 10.2196/73419.
3
Incorporating large language models as clinical decision support in oncology: the Woollie model.

本文引用的文献

1
Global Mental Health Services and the Impact of Artificial Intelligence-Powered Large Language Models.全球心理健康服务与人工智能驱动的大语言模型的影响
JAMA Psychiatry. 2023 Jul 1;80(7):662-664. doi: 10.1001/jamapsychiatry.2023.1253.
2
Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.GPT-4作为医学人工智能聊天机器人的益处、局限性和风险
N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.
3
Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models.
将大语言模型纳入肿瘤学临床决策支持:伍利模型。
NPJ Digit Med. 2025 Aug 18;8(1):529. doi: 10.1038/s41746-025-01941-3.
4
Development and evaluation of large-language models (LLMs) for oncology: A scoping review.用于肿瘤学的大语言模型的开发与评估:一项范围综述。
PLOS Digit Health. 2025 Aug 7;4(8):e0000980. doi: 10.1371/journal.pdig.0000980. eCollection 2025 Aug.
5
Challenges and opportunities to integrate artificial intelligence in radiation oncology: a narrative review.将人工智能整合到放射肿瘤学中的挑战与机遇:一篇叙述性综述
Ewha Med J. 2024 Oct;47(4):e49. doi: 10.12771/emj.2024.e49. Epub 2024 Oct 31.
6
Large language model integrations in cancer decision-making: a systematic review and meta-analysis.大型语言模型在癌症决策中的应用:一项系统综述和荟萃分析。
NPJ Digit Med. 2025 Jul 17;8(1):450. doi: 10.1038/s41746-025-01824-7.
7
Data Extraction and Curation from Radiology Reports for Pancreatic Cyst Surveillance Using Large Language Models.使用大语言模型从放射学报告中提取和整理胰腺囊肿监测数据
J Am Coll Surg. 2025 Jul 10. doi: 10.1097/XCS.0000000000001478.
8
Open-Source Hybrid Large Language Model Integrated System for Extraction of Breast Cancer Treatment Pathway From Free-Text Clinical Notes.用于从自由文本临床记录中提取乳腺癌治疗路径的开源混合大语言模型集成系统
JCO Clin Cancer Inform. 2025 Jun;9:e2500002. doi: 10.1200/CCI-25-00002. Epub 2025 Jun 27.
9
Celebrating Ulrik Ringborg: Multi-Omics-Based Patient Stratification for Precision Cancer Treatment.致敬乌尔里克·林伯格:基于多组学的精准癌症治疗患者分层
Biomolecules. 2025 May 10;15(5):693. doi: 10.3390/biom15050693.
10
The influence of prompt engineering on large language models for protein-protein interaction identification in biomedical literature.提示工程对生物医学文献中蛋白质-蛋白质相互作用识别的大语言模型的影响。
Sci Rep. 2025 May 3;15(1):15493. doi: 10.1038/s41598-025-99290-4.
ChatGPT在美国医师执照考试中的表现:使用大语言模型进行人工智能辅助医学教育的潜力。
PLOS Digit Health. 2023 Feb 9;2(2):e0000198. doi: 10.1371/journal.pdig.0000198. eCollection 2023 Feb.
4
Prognostic impact of postoperative radiotherapy in patients with breast cancer and with pT1-2 and 1-3 lymph node metastases: A retrospective cohort study based on the Japanese Breast Cancer Registry.基于日本乳腺癌注册中心的回顾性队列研究:乳腺癌伴 pT1-2 和 1-3 淋巴结转移患者术后放疗的预后影响。
Eur J Cancer. 2022 Sep;172:31-40. doi: 10.1016/j.ejca.2022.05.017. Epub 2022 Jun 22.
5
Changes in the working conditions and learning environment of medical residents after the enactment of the Medical Resident Act in Korea in 2015: a national 4-year longitudinal study.2015 年韩国《住院医师法》颁布后,住院医师工作条件和学习环境的变化:一项全国性的 4 年纵向研究。
J Educ Eval Health Prof. 2021;18:7. doi: 10.3352/jeehp.2021.18.7. Epub 2021 Apr 20.
6
Clinical Natural Language Processing for Radiation Oncology: A Review and Practical Primer.临床自然语言处理在放射肿瘤学中的应用:综述与实用入门
Int J Radiat Oncol Biol Phys. 2021 Jul 1;110(3):641-655. doi: 10.1016/j.ijrobp.2021.01.044. Epub 2021 Feb 3.
7
Artificial intelligence approaches using natural language processing to advance EHR-based clinical research.利用自然语言处理技术的人工智能方法来推进基于电子健康记录的临床研究。
J Allergy Clin Immunol. 2020 Feb;145(2):463-469. doi: 10.1016/j.jaci.2019.12.897. Epub 2019 Dec 26.
8
Incorporating Risk Factors to Identify the Indication of Post-mastectomy Radiotherapy in N1 Breast Cancer Treated with Optimal Systemic Therapy: A Multicenter Analysis in Korea (KROG 14-23).纳入风险因素以确定接受最佳系统治疗的N1期乳腺癌保乳术后放疗的指征:韩国多中心分析(KROG 14-23)
Cancer Res Treat. 2017 Jul;49(3):739-747. doi: 10.4143/crt.2016.405. Epub 2016 Oct 19.
9
Identification of Risk Factors for Locoregional Recurrence in Breast Cancer Patients with Nodal Stage N0 and N1: Who Could Benefit from Post-Mastectomy Radiotherapy?腋窝淋巴结分期为N0和N1的乳腺癌患者局部区域复发危险因素的识别:哪些患者能从乳房切除术后放疗中获益?
PLoS One. 2015 Dec 21;10(12):e0145463. doi: 10.1371/journal.pone.0145463. eCollection 2015.
10
Patients with N1 breast cancer: who could benefit from supraclavicular fossa radiotherapy?N1期乳腺癌患者:谁能从锁骨上窝放疗中获益?
Breast. 2014 Dec;23(6):749-53. doi: 10.1016/j.breast.2014.08.001. Epub 2014 Sep 16.