• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用思维链微调的大语言模型从肺癌手术病理报告中进行自动病理TN分类预测及依据生成:算法开发与验证研究

Automated Pathologic TN Classification Prediction and Rationale Generation From Lung Cancer Surgical Pathology Reports Using a Large Language Model Fine-Tuned With Chain-of-Thought: Algorithm Development and Validation Study.

作者信息

Kim Sanghwan, Jang Sowon, Kim Borham, Sunwoo Leonard, Kim Seok, Chung Jin-Haeng, Nam Sejin, Cho Hyeongmin, Lee Donghyoung, Lee Keehyuck, Yoo Sooyoung

机构信息

ezCaretech Research & Development Center, Seoul, Republic of Korea.

Department of Radiology, Seoul National University Bundang Hospital, Seongnam, Republic of Korea.

出版信息

JMIR Med Inform. 2024 Dec 20;12:e67056. doi: 10.2196/67056.

DOI:10.2196/67056
PMID:39705675
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11699504/
Abstract

BACKGROUND

Traditional rule-based natural language processing approaches in electronic health record systems are effective but are often time-consuming and prone to errors when handling unstructured data. This is primarily due to the substantial manual effort required to parse and extract information from diverse types of documentation. Recent advancements in large language model (LLM) technology have made it possible to automatically interpret medical context and support pathologic staging. However, existing LLMs encounter challenges in rapidly adapting to specialized guideline updates. In this study, we fine-tuned an LLM specifically for lung cancer pathologic staging, enabling it to incorporate the latest guidelines for pathologic TN classification.

OBJECTIVE

This study aims to evaluate the performance of fine-tuned generative language models in automatically inferring pathologic TN classifications and extracting their rationale from lung cancer surgical pathology reports. By addressing the inefficiencies and extensive parsing efforts associated with rule-based methods, this approach seeks to enable rapid and accurate reclassification aligned with the latest cancer staging guidelines.

METHODS

We conducted a comparative performance evaluation of 6 open-source LLMs for automated TN classification and rationale generation, using 3216 deidentified lung cancer surgical pathology reports based on the American Joint Committee on Cancer (AJCC) Cancer Staging Manual8th edition, collected from a tertiary hospital. The dataset was preprocessed by segmenting each report according to lesion location and morphological diagnosis. Performance was assessed using exact match ratio (EMR) and semantic match ratio (SMR) as evaluation metrics, which measure classification accuracy and the contextual alignment of the generated rationales, respectively.

RESULTS

Among the 6 models, the Orca2_13b model achieved the highest performance with an EMR of 0.934 and an SMR of 0.864. The Orca2_7b model also demonstrated strong performance, recording an EMR of 0.914 and an SMR of 0.854. In contrast, the Llama2_7b model achieved an EMR of 0.864 and an SMR of 0.771, while the Llama2_13b model showed an EMR of 0.762 and an SMR of 0.690. The Mistral_7b and Llama3_8b models, on the other hand, showed lower performance, with EMRs of 0.572 and 0.489, and SMRs of 0.377 and 0.456, respectively. Overall, the Orca2 models consistently outperformed the others in both TN stage classification and rationale generation.

CONCLUSIONS

The generative language model approach presented in this study has the potential to enhance and automate TN classification in complex cancer staging, supporting both clinical practice and oncology data curation. With additional fine-tuning based on cancer-specific guidelines, this approach can be effectively adapted to other cancer types.

摘要

背景

电子健康记录系统中传统的基于规则的自然语言处理方法是有效的,但在处理非结构化数据时通常很耗时且容易出错。这主要是因为需要大量的人工来从各种类型的文档中解析和提取信息。大语言模型(LLM)技术的最新进展使得自动解读医学背景和支持病理分期成为可能。然而,现有的大语言模型在快速适应专门的指南更新方面面临挑战。在本研究中,我们专门针对肺癌病理分期对一个大语言模型进行了微调,使其能够纳入病理TN分类的最新指南。

目的

本研究旨在评估微调后的生成式语言模型在自动推断病理TN分类并从肺癌手术病理报告中提取其依据方面的性能。通过解决与基于规则的方法相关的低效和大量解析工作,这种方法旨在实现与最新癌症分期指南一致的快速准确的重新分类。

方法

我们使用从一家三级医院收集的3216份基于美国癌症联合委员会(AJCC)癌症分期手册第8版去识别的肺癌手术病理报告,对6个开源大语言模型进行了自动TN分类和依据生成的比较性能评估。数据集通过根据病变位置和形态学诊断对每份报告进行分割来进行预处理。使用精确匹配率(EMR)和语义匹配率(SMR)作为评估指标来评估性能,这两个指标分别衡量分类准确性和生成依据的上下文一致性。

结果

在这6个模型中,Orca2_13b模型表现最佳,EMR为0.934,SMR为0.864。Orca2_7b模型也表现出很强的性能,EMR为0.914,SMR为0.854。相比之下,Llama2_7b模型的EMR为0.864,SMR为0.771,而Llama2_13b模型的EMR为0.762,SMR为0.690。另一方面,Mistral_7b和Llama3_8b模型表现较差,EMR分别为0.572和0.489,SMR分别为0.377和0.456。总体而言,Orca2模型在TN分期分类和依据生成方面始终优于其他模型。

结论

本研究中提出的生成式语言模型方法有可能在复杂癌症分期中增强和自动化TN分类,支持临床实践和肿瘤学数据管理。通过基于癌症特定指南进行额外的微调,这种方法可以有效地应用于其他癌症类型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/665f/11699504/419b895183c4/medinform_v12i1e67056_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/665f/11699504/598ef5d1aabe/medinform_v12i1e67056_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/665f/11699504/296300bfc613/medinform_v12i1e67056_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/665f/11699504/419b895183c4/medinform_v12i1e67056_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/665f/11699504/598ef5d1aabe/medinform_v12i1e67056_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/665f/11699504/296300bfc613/medinform_v12i1e67056_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/665f/11699504/419b895183c4/medinform_v12i1e67056_fig3.jpg

相似文献

1
Automated Pathologic TN Classification Prediction and Rationale Generation From Lung Cancer Surgical Pathology Reports Using a Large Language Model Fine-Tuned With Chain-of-Thought: Algorithm Development and Validation Study.使用思维链微调的大语言模型从肺癌手术病理报告中进行自动病理TN分类预测及依据生成:算法开发与验证研究
JMIR Med Inform. 2024 Dec 20;12:e67056. doi: 10.2196/67056.
2
Extracting lung cancer staging descriptors from pathology reports: A generative language model approach.从病理报告中提取肺癌分期描述符:一种生成式语言模型方法。
J Biomed Inform. 2024 Sep;157:104720. doi: 10.1016/j.jbi.2024.104720. Epub 2024 Sep 2.
3
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
4
Assessing Completeness of Clinical Histories Accompanying Imaging Orders Using Adapted Open-Source and Closed-Source Large Language Models.使用适配的开源和闭源大语言模型评估影像检查申请单所附临床病史的完整性
Radiology. 2025 Feb;314(2):e241051. doi: 10.1148/radiol.241051.
5
Utilizing large language models for detecting hospital-acquired conditions: an empirical study on pulmonary embolism.利用大语言模型检测医院获得性疾病:关于肺栓塞的实证研究
J Am Med Inform Assoc. 2025 May 1;32(5):876-884. doi: 10.1093/jamia/ocaf048.
6
Using Large Language Models to Automate Data Extraction From Surgical Pathology Reports: Retrospective Cohort Study.使用大语言模型自动从外科病理报告中提取数据:回顾性队列研究。
JMIR Form Res. 2025 Apr 7;9:e64544. doi: 10.2196/64544.
7
A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架,在临床笔记上进行了微调,能够从电子健康记录中准确提取头痛频率。
Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.
8
Automated Transformation of Unstructured Cardiovascular Diagnostic Reports into Structured Datasets Using Sequentially Deployed Large Language Models.使用顺序部署的大语言模型将非结构化心血管诊断报告自动转换为结构化数据集
medRxiv. 2024 Oct 8:2024.10.08.24315035. doi: 10.1101/2024.10.08.24315035.
9
Large Language Model Approach for Zero-Shot Information Extraction and Clustering of Japanese Radiology Reports: Algorithm Development and Validation.用于日本放射学报告的零样本信息提取和聚类的大语言模型方法:算法开发与验证
JMIR Cancer. 2025 Jan 23;11:e57275. doi: 10.2196/57275.
10
Large Language Model Applications for Health Information Extraction in Oncology: Scoping Review.用于肿瘤学健康信息提取的大语言模型应用:范围综述
JMIR Cancer. 2025 Mar 28;11:e65984. doi: 10.2196/65984.

引用本文的文献

1
Digital and Artificial Intelligence-based Pathology: Not for Every Laboratory - A Mini-review on the Benefits and Pitfalls of Its Implementation.基于数字和人工智能的病理学:并非适用于每个实验室——关于其实施的益处与陷阱的小型综述
J Clin Transl Pathol. 2025 Jun;5(2):79-85. doi: 10.14218/jctp.2025.00007. Epub 2025 Apr 3.
2
Machine learning approaches for EGFR mutation status prediction in NSCLC: an updated systematic review.用于非小细胞肺癌中表皮生长因子受体突变状态预测的机器学习方法:一项更新的系统评价
Front Oncol. 2025 Jul 10;15:1576461. doi: 10.3389/fonc.2025.1576461. eCollection 2025.

本文引用的文献

1
Non-Small Cell Lung Cancer, Version 4.2024, NCCN Clinical Practice Guidelines in Oncology.非小细胞肺癌临床实践指南(第 4.2024 版),NCCN 肿瘤学临床实践指南
J Natl Compr Canc Netw. 2024 May;22(4):249-274. doi: 10.6004/jnccn.2204.0023.
2
The International Association for the Study of Lung Cancer Lung Cancer Staging Project: Proposals for Revision of the TNM Stage Groups in the Forthcoming (Ninth) Edition of the TNM Classification for Lung Cancer.国际肺癌研究协会肺癌分期项目:对即将发布的(第九版)肺癌 TNM 分类中 TNM 分期组的修订建议。
J Thorac Oncol. 2024 Jul;19(7):1007-1027. doi: 10.1016/j.jtho.2024.02.011. Epub 2024 Mar 4.
3
NCCN Guidelines® Insights: Non-Small Cell Lung Cancer, Version 2.2023.
美国国立综合癌症网络(NCCN)指南见解:非小细胞肺癌,2023年第2版
J Natl Compr Canc Netw. 2023 Apr;21(4):340-350. doi: 10.6004/jnccn.2023.0020.
4
Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT.基于混合神经网络和医学 MC-BERT 的中文电子病历命名实体识别。
BMC Med Inform Decis Mak. 2022 Dec 1;22(1):315. doi: 10.1186/s12911-022-02059-2.
5
CancerBERT: a cancer domain-specific language model for extracting breast cancer phenotypes from electronic health records.CancerBERT:一种癌症领域特定的语言模型,用于从电子健康记录中提取乳腺癌表型。
J Am Med Inform Assoc. 2022 Jun 14;29(7):1208-1216. doi: 10.1093/jamia/ocac040.
6
Automatic Extraction of Lung Cancer Staging Information From Computed Tomography Reports: Deep Learning Approach.从计算机断层扫描报告中自动提取肺癌分期信息:深度学习方法
JMIR Med Inform. 2021 Jul 21;9(7):e27955. doi: 10.2196/27955.
7
Deep Natural Language Processing to Identify Symptom Documentation in Clinical Notes for Patients With Heart Failure Undergoing Cardiac Resynchronization Therapy.深度自然语言处理在识别心力衰竭行心脏再同步治疗患者临床记录中的症状文档。
J Pain Symptom Manage. 2020 Nov;60(5):948-958.e3. doi: 10.1016/j.jpainsymman.2020.06.010. Epub 2020 Jun 22.
8
Colorectal cancer statistics, 2020.2020 年结直肠癌统计数据。
CA Cancer J Clin. 2020 May;70(3):145-164. doi: 10.3322/caac.21601. Epub 2020 Mar 5.
9
Essential TNM: a registry tool to reduce gaps in cancer staging information.基本 TNM:一种用于减少癌症分期信息差距的登记工具。
Lancet Oncol. 2019 Feb;20(2):e103-e111. doi: 10.1016/S1470-2045(18)30897-0.
10
The Eighth Edition of TNM Staging of Lung Cancer: Reference Chart and Diagrams.第八版肺癌 TNM 分期:参考图表
Oncologist. 2018 Jul;23(7):844-848. doi: 10.1634/theoncologist.2017-0659. Epub 2018 Apr 12.