• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于开源大语言模型的乳腺癌治疗后患者为中心结局自动提取工具包。

Automated Extraction of Patient-Centered Outcomes After Breast Cancer Treatment: An Open-Source Large Language Model-Based Toolkit.

机构信息

Department of Radiology, Mayo Clinic, Phoenix, AZ.

Departments of Medicine and of Epidemiology & Population Health, Stanford University School of Medicine, Palo Alto, CA.

出版信息

JCO Clin Cancer Inform. 2024 Aug;8:e2300258. doi: 10.1200/CCI.23.00258.

DOI:10.1200/CCI.23.00258
PMID:39167746
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11867221/
Abstract

PURPOSE

Patient-centered outcomes (PCOs) are pivotal in cancer treatment, as they directly reflect patients' quality of life. Although multiple studies suggest that factors affecting breast cancer-related morbidity and survival are influenced by treatment side effects and adherence to long-term treatment, such data are generally only available on a smaller scale or from a single center. The primary challenge with collecting these data is that the outcomes are captured as free text in clinical narratives written by clinicians.

MATERIALS AND METHODS

Given the complexity of PCO documentation in these narratives, computerized methods are necessary to unlock the wealth of information buried in unstructured text notes that often document PCOs. Inspired by the success of large language models (LLMs), we examined the adaptability of three LLMs, GPT-2, BioGPT, and PMC-LLaMA, on PCO tasks across three institutions, Mayo Clinic, Emory University Hospital, and Stanford University. We developed an open-source framework for fine-tuning LLM that can directly extract the five different categories of PCO from the clinic notes.

RESULTS

We found that these LLMs without fine-tuning (zero-shot) struggle with challenging PCO extraction tasks, displaying almost random performance, even with some task-specific examples (few-shot learning). The performance of our fine-tuned, task-specific models is notably superior compared with their non-fine-tuned LLM models. Moreover, the fine-tuned GPT-2 model has demonstrated a significantly better performance than the other two larger LLMs.

CONCLUSION

Our discovery indicates that although LLMs serve as effective general-purpose models for tasks across various domains, they require fine-tuning when applied to the clinician domain. Our proposed approach has the potential to lead more efficient, adaptable models for PCO information extraction, reducing reliance on extensive computational resources while still delivering superior performance for specific tasks.

摘要

目的

患者为中心的结局(PCOs)在癌症治疗中至关重要,因为它们直接反映了患者的生活质量。尽管多项研究表明,影响乳腺癌相关发病率和生存率的因素受到治疗副作用和长期治疗依从性的影响,但这些数据通常仅在较小规模或来自单个中心获得。收集这些数据的主要挑战是,结局是以临床医生撰写的临床叙述中的自由文本形式捕获的。

材料和方法

鉴于这些叙述中 PCO 文档的复杂性,需要计算机化方法来解锁隐藏在非结构化文本注释中的丰富信息,这些注释通常记录了 PCOs。受大型语言模型(LLMs)成功的启发,我们检查了 GPT-2、BioGPT 和 PMC-LLaMA 这三个 LLM 在梅奥诊所、埃默里大学医院和斯坦福大学三个机构的 PCO 任务上的适应性。我们开发了一个用于微调 LLM 的开源框架,该框架可以直接从诊所记录中提取五个不同类别的 PCO。

结果

我们发现,这些未经微调的 LLM(零样本)在具有挑战性的 PCO 提取任务中表现不佳,即使提供了一些特定于任务的示例(少样本学习),性能也几乎随机。与非微调的 LLM 模型相比,我们专门针对任务进行微调的模型的性能明显更好。此外,微调后的 GPT-2 模型的性能明显优于其他两个更大的 LLM。

结论

我们的发现表明,尽管 LLM 作为跨各种领域任务的有效通用模型,但在应用于临床医生领域时需要进行微调。我们提出的方法有可能为 PCO 信息提取带来更高效、适应性更强的模型,减少对大量计算资源的依赖,同时仍能为特定任务提供卓越的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/3e650b2a4b26/nihms-2024965-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/22229ac85132/nihms-2024965-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/1fad4db78646/nihms-2024965-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/fee95fa24c0f/nihms-2024965-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/3e650b2a4b26/nihms-2024965-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/22229ac85132/nihms-2024965-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/1fad4db78646/nihms-2024965-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/fee95fa24c0f/nihms-2024965-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc82/11867221/3e650b2a4b26/nihms-2024965-f0007.jpg

相似文献

1
Automated Extraction of Patient-Centered Outcomes After Breast Cancer Treatment: An Open-Source Large Language Model-Based Toolkit.基于开源大语言模型的乳腺癌治疗后患者为中心结局自动提取工具包。
JCO Clin Cancer Inform. 2024 Aug;8:e2300258. doi: 10.1200/CCI.23.00258.
2
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
3
Open-Source Hybrid Large Language Model Integrated System for Extraction of Breast Cancer Treatment Pathway From Free-Text Clinical Notes.用于从自由文本临床记录中提取乳腺癌治疗路径的开源混合大语言模型集成系统
JCO Clin Cancer Inform. 2025 Jun;9:e2500002. doi: 10.1200/CCI-25-00002. Epub 2025 Jun 27.
4
Short-Term Memory Impairment短期记忆障碍
5
Toward Cross-Hospital Deployment of Natural Language Processing Systems: Model Development and Validation of Fine-Tuned Large Language Models for Disease Name Recognition in Japanese.迈向自然语言处理系统的跨医院部署:用于日语疾病名称识别的微调大语言模型的模型开发与验证
JMIR Med Inform. 2025 Jul 8;13:e76773. doi: 10.2196/76773.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.在医疗保健中应用大语言模型:以临床医生为重点的回顾与交互式指南
J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.
8
Advancing entity recognition in biomedicine via instruction tuning of large language models.通过指令调整大型语言模型推进生物医学中的实体识别。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae163.
9
BioInstruct: instruction tuning of large language models for biomedical natural language processing.BioInstruct:用于生物医学自然语言处理的大型语言模型的指令调整。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1821-1832. doi: 10.1093/jamia/ocae122.
10
Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.通过在出院小结中添加重点内容提高大语言模型的总结准确性:比较评估
JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.

引用本文的文献

1
Open-Source Hybrid Large Language Model Integrated System for Extraction of Breast Cancer Treatment Pathway From Free-Text Clinical Notes.用于从自由文本临床记录中提取乳腺癌治疗路径的开源混合大语言模型集成系统
JCO Clin Cancer Inform. 2025 Jun;9:e2500002. doi: 10.1200/CCI-25-00002. Epub 2025 Jun 27.
2
The Role of Artificial Intelligence (ChatGPT-4o) in Supporting Tumor Board Decisions.人工智能(ChatGPT-4o)在辅助肿瘤专家委员会决策中的作用
J Clin Med. 2025 May 18;14(10):3535. doi: 10.3390/jcm14103535.
3
Assessing the accuracy of the GPT-4 model in multidisciplinary tumor board decision prediction.评估GPT-4模型在多学科肿瘤病例讨论决策预测中的准确性。
Clin Transl Oncol. 2025 Mar 25. doi: 10.1007/s12094-025-03905-1.
4
Large language models in cancer: potentials, risks, and safeguards.癌症领域的大语言模型:潜力、风险与保障措施
BJR Artif Intell. 2024 Dec 20;2(1):ubae019. doi: 10.1093/bjrai/ubae019. eCollection 2025 Jan.

本文引用的文献

1
A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架,在临床笔记上进行了微调,能够从电子健康记录中准确提取头痛频率。
Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.
2
Deep learning-based natural language processing for detecting medical symptoms and histories in emergency patient triage.基于深度学习的自然语言处理在急诊分诊中检测医疗症状和病史。
Am J Emerg Med. 2024 Mar;77:29-38. doi: 10.1016/j.ajem.2023.11.063. Epub 2023 Dec 10.
3
Leveraging Large Language Models for Decision Support in Personalized Oncology.利用大型语言模型为个性化肿瘤学提供决策支持。
JAMA Netw Open. 2023 Nov 1;6(11):e2343689. doi: 10.1001/jamanetworkopen.2023.43689.
4
Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer.利用大语言模型开发提示,以从乳腺癌的病理学和超声报告中提取临床信息。
Radiat Oncol J. 2023 Sep;41(3):209-216. doi: 10.3857/roj.2023.00633. Epub 2023 Sep 21.
5
Empirical evaluation of language modeling to ascertain cancer outcomes from clinical text reports.从临床文本报告中评估语言模型以确定癌症结果的实证研究。
BMC Bioinformatics. 2023 Sep 2;24(1):328. doi: 10.1186/s12859-023-05439-1.
6
Decoding radiology reports: Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports.解读放射学报告:OpenAI ChatGPT 潜在应用于增强患者对诊断报告的理解。
Clin Imaging. 2023 Sep;101:137-141. doi: 10.1016/j.clinimag.2023.06.008. Epub 2023 Jun 8.
7
Extracting Medical Information From Free-Text and Unstructured Patient-Generated Health Data Using Natural Language Processing Methods: Feasibility Study With Real-world Data.使用自然语言处理方法从自由文本和非结构化患者生成的健康数据中提取医学信息:基于真实世界数据的可行性研究
JMIR Form Res. 2023 Mar 7;7:e43014. doi: 10.2196/43014.
8
A large language model for electronic health records.用于电子健康记录的大型语言模型。
NPJ Digit Med. 2022 Dec 26;5(1):194. doi: 10.1038/s41746-022-00742-2.
9
BioGPT: generative pre-trained transformer for biomedical text generation and mining.BioGPT:用于生物医学文本生成和挖掘的生成式预训练转换器。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac409.
10
Relation Extraction from Clinical Narratives Using Pre-trained Language Models.使用预训练语言模型从临床叙述中提取关系
AMIA Annu Symp Proc. 2020 Mar 4;2019:1236-1245. eCollection 2019.