• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然语言处理如何借助自由文本放射学报告辅助肺肿瘤学肿瘤淋巴结转移分期:算法开发与验证

How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation.

作者信息

Puts Sander, Nobel Martijn, Zegers Catharina, Bermejo Iñigo, Robben Simon, Dekker Andre

机构信息

GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, Netherlands.

Department of Radiation Oncology, Maastro, Maastricht, Netherlands.

出版信息

JMIR Form Res. 2023 Mar 22;7:e38125. doi: 10.2196/38125.

DOI:10.2196/38125
PMID:36947118
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10131747/
Abstract

BACKGROUND

Natural language processing (NLP) is thought to be a promising solution to extract and store concepts from free text in a structured manner for data mining purposes. This is also true for radiology reports, which still consist mostly of free text. Accurate and complete reports are very important for clinical decision support, for instance, in oncological staging. As such, NLP can be a tool to structure the content of the radiology report, thereby increasing the report's value.

OBJECTIVE

This study describes the implementation and validation of an N-stage classifier for pulmonary oncology. It is based on free-text radiological chest computed tomography reports according to the tumor, node, and metastasis (TNM) classification, which has been added to the already existing T-stage classifier to create a combined TN-stage classifier.

METHODS

SpaCy, PyContextNLP, and regular expressions were used for proper information extraction, after additional rules were set to accurately extract N-stage.

RESULTS

The overall TN-stage classifier accuracy scores were 0.84 and 0.85, respectively, for the training (N=95) and validation (N=97) sets. This is comparable to the outcomes of the T-stage classifier (0.87-0.92).

CONCLUSIONS

This study shows that NLP has potential in classifying pulmonary oncology from free-text radiological reports according to the TNM classification system as both the T- and N-stages can be extracted with high accuracy.

摘要

背景

自然语言处理(NLP)被认为是一种很有前景的解决方案,可用于以结构化方式从自由文本中提取和存储概念,以用于数据挖掘目的。放射学报告也是如此,其大部分内容仍然是自由文本。准确完整的报告对于临床决策支持非常重要,例如在肿瘤分期方面。因此,NLP可以作为一种工具来构建放射学报告的内容,从而提高报告的价值。

目的

本研究描述了一种用于肺肿瘤学的N分期分类器的实施和验证。它基于根据肿瘤、淋巴结和转移(TNM)分类的自由文本胸部计算机断层扫描报告,并已添加到现有的T分期分类器中,以创建一个组合的TN分期分类器。

方法

在设置了额外规则以准确提取N分期后,使用SpaCy、PyContextNLP和正则表达式进行适当的信息提取。

结果

训练集(N = 95)和验证集(N = 97)的总体TN分期分类器准确率分别为0.84和0.85。这与T分期分类器的结果(0.87 - 0.92)相当。

结论

本研究表明,NLP在根据TNM分类系统从自由文本放射学报告中对肺肿瘤学进行分类方面具有潜力,因为T分期和N分期都可以高精度提取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/ad7a4a399d9e/formative_v7i1e38125_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/9252e475218f/formative_v7i1e38125_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/a3381a5309a9/formative_v7i1e38125_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/7ecc871c78ad/formative_v7i1e38125_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/4a4f47c4f830/formative_v7i1e38125_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/ad7a4a399d9e/formative_v7i1e38125_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/9252e475218f/formative_v7i1e38125_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/a3381a5309a9/formative_v7i1e38125_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/7ecc871c78ad/formative_v7i1e38125_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/4a4f47c4f830/formative_v7i1e38125_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9203/10131747/ad7a4a399d9e/formative_v7i1e38125_fig5.jpg

相似文献

1
How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation.自然语言处理如何借助自由文本放射学报告辅助肺肿瘤学肿瘤淋巴结转移分期:算法开发与验证
JMIR Form Res. 2023 Mar 22;7:e38125. doi: 10.2196/38125.
2
Natural Language Processing Algorithm Used for Staging Pulmonary Oncology from Free-Text Radiological Reports: "Including PET-CT and Validation Towards Clinical Use".自然语言处理算法在放射学报告中的肺肿瘤分期中的应用:“包括 PET-CT 以及向临床应用的验证”。
J Imaging Inform Med. 2024 Feb;37(1):3-12. doi: 10.1007/s10278-023-00913-x. Epub 2024 Jan 12.
3
Natural Language Processing in Dutch Free Text Radiology Reports: Challenges in a Small Language Area Staging Pulmonary Oncology.荷兰语自由文本放射学报告中的自然语言处理:小语种地区肺部肿瘤分期面临的挑战
J Digit Imaging. 2020 Aug;33(4):1002-1008. doi: 10.1007/s10278-020-00327-z.
4
T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting.使用自然语言处理从放射学报告中进行肺癌T分期:向多语言环境的转化
Insights Imaging. 2021 Jun 10;12(1):77. doi: 10.1186/s13244-021-01018-1.
5
Automatic Classification of Tumor Response From Radiology Reports With Rule-Based Natural Language Processing Integrated Into the Clinical Oncology Workflow.基于规则的自然语言处理与临床肿瘤学工作流程集成的放射学报告中肿瘤反应的自动分类。
JCO Clin Cancer Inform. 2023 Jan;7:e2200139. doi: 10.1200/CCI.22.00139.
6
Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能:一种估计诊断率的工具
AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.
7
Extracting lung cancer staging descriptors from pathology reports: A generative language model approach.从病理报告中提取肺癌分期描述符:一种生成式语言模型方法。
J Biomed Inform. 2024 Sep;157:104720. doi: 10.1016/j.jbi.2024.104720. Epub 2024 Sep 2.
8
Automated Outcome Classification of Computed Tomography Imaging Reports for Pediatric Traumatic Brain Injury.小儿创伤性脑损伤计算机断层扫描成像报告的自动结果分类
Acad Emerg Med. 2016 Feb;23(2):171-8. doi: 10.1111/acem.12859. Epub 2016 Jan 14.
9
Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach.基于神经网络序列标注方法的乳腺 X 线摄影筛查报告的全面词级分类。
J Digit Imaging. 2019 Oct;32(5):685-692. doi: 10.1007/s10278-018-0141-4.
10
Rule-based natural language processing for automation of stroke data extraction: a validation study.基于规则的自然语言处理在脑卒中数据提取自动化中的应用:一项验证研究。
Neuroradiology. 2022 Dec;64(12):2357-2362. doi: 10.1007/s00234-022-03029-1. Epub 2022 Aug 1.

引用本文的文献

1
Extracting Pulmonary Embolism Diagnoses From Radiology Impressions Using GPT-4o: Large Language Model Evaluation Study.使用GPT-4o从放射学诊断印象中提取肺栓塞诊断:大语言模型评估研究
JMIR Med Inform. 2025 Apr 9;13:e67706. doi: 10.2196/67706.
2
Diagnosis extraction from unstructured Dutch echocardiogram reports using span- and document-level characteristic classification.使用跨度和文档级特征分类从非结构化荷兰语超声心动图报告中提取诊断信息。
BMC Med Inform Decis Mak. 2025 Mar 7;25(1):115. doi: 10.1186/s12911-025-02897-w.
3
Uncertainty-aware automatic TNM staging classification for [F] Fluorodeoxyglucose PET-CT reports for lung cancer utilising transformer-based language models and multi-task learning.

本文引用的文献

1
A novel deep learning approach to extract Chinese clinical entities for lung cancer screening and staging.一种用于提取肺癌筛查和分期用中文临床实体的新型深度学习方法。
BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):214. doi: 10.1186/s12911-021-01575-x.
2
Automatic Extraction of Lung Cancer Staging Information From Computed Tomography Reports: Deep Learning Approach.从计算机断层扫描报告中自动提取肺癌分期信息:深度学习方法
JMIR Med Inform. 2021 Jul 21;9(7):e27955. doi: 10.2196/27955.
3
T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting.
利用基于Transformer的语言模型和多任务学习对[F]氟脱氧葡萄糖PET-CT肺癌报告进行不确定性感知自动TNM分期分类。
BMC Med Inform Decis Mak. 2024 Dec 18;24(1):396. doi: 10.1186/s12911-024-02814-7.
4
Preliminary assessment of TNM classification performance for pancreatic cancer in Japanese radiology reports using GPT-4.使用GPT-4对日本放射学报告中胰腺癌的TNM分类性能进行初步评估。
Jpn J Radiol. 2025 Jan;43(1):51-55. doi: 10.1007/s11604-024-01643-y. Epub 2024 Aug 20.
5
Natural Language Processing Algorithm Used for Staging Pulmonary Oncology from Free-Text Radiological Reports: "Including PET-CT and Validation Towards Clinical Use".自然语言处理算法在放射学报告中的肺肿瘤分期中的应用:“包括 PET-CT 以及向临床应用的验证”。
J Imaging Inform Med. 2024 Feb;37(1):3-12. doi: 10.1007/s10278-023-00913-x. Epub 2024 Jan 12.
使用自然语言处理从放射学报告中进行肺癌T分期:向多语言环境的转化
Insights Imaging. 2021 Jun 10;12(1):77. doi: 10.1186/s13244-021-01018-1.
4
A Scalable Natural Language Processing for Inferring BT-RADS Categorization from Unstructured Brain Magnetic Resonance Reports.一种可扩展的自然语言处理方法,用于从非结构化的脑部磁共振报告中推断 BT-RADS 分类。
J Digit Imaging. 2020 Dec;33(6):1393-1400. doi: 10.1007/s10278-020-00350-0.
5
Natural Language Processing in Dutch Free Text Radiology Reports: Challenges in a Small Language Area Staging Pulmonary Oncology.荷兰语自由文本放射学报告中的自然语言处理:小语种地区肺部肿瘤分期面临的挑战
J Digit Imaging. 2020 Aug;33(4):1002-1008. doi: 10.1007/s10278-020-00327-z.
6
Deep Learning for Natural Language Processing in Radiology-Fundamentals and a Systematic Review.放射学中自然语言处理的深度学习——基础与系统综述
J Am Coll Radiol. 2020 May;17(5):639-648. doi: 10.1016/j.jacr.2019.12.026. Epub 2020 Jan 28.
7
Deep learning to convert unstructured CT pulmonary angiography reports into structured reports.深度学习将非结构化CT肺血管造影报告转换为结构化报告。
Eur Radiol Exp. 2019 Sep 23;3(1):37. doi: 10.1186/s41747-019-0118-1.
8
Automated Detection of Radiology Reports that Require Follow-up Imaging Using Natural Language Processing Feature Engineering and Machine Learning Classification.使用自然语言处理特征工程和机器学习分类自动检测需要随访成像的放射学报告。
J Digit Imaging. 2020 Feb;33(1):131-136. doi: 10.1007/s10278-019-00271-7.
9
Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning.利用机器学习从放射学报告中提取完整的结构化信息。
J Digit Imaging. 2019 Aug;32(4):554-564. doi: 10.1007/s10278-019-00234-y.
10
[Natural language processing in radiology : Neither trivial nor impossible].[放射学中的自然语言处理:既非轻而易举也非不可能]
Radiologe. 2019 Sep;59(9):828-832. doi: 10.1007/s00117-019-0555-0.