• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

荷兰语自由文本放射学报告中的自然语言处理:小语种地区肺部肿瘤分期面临的挑战

Natural Language Processing in Dutch Free Text Radiology Reports: Challenges in a Small Language Area Staging Pulmonary Oncology.

作者信息

Nobel J Martijn, Puts Sander, Bakers Frans C H, Robben Simon G F, Dekker André L A J

机构信息

Department of Radiology and Nuclear Medicine, Maastricht University Medical Center+, Postbox 5800, 6202, Maastricht, AZ, Netherlands.

School of Health Professions Education, Maastricht University, Maastricht, Netherlands.

出版信息

J Digit Imaging. 2020 Aug;33(4):1002-1008. doi: 10.1007/s10278-020-00327-z.

DOI:10.1007/s10278-020-00327-z
PMID:32076924
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7522136/
Abstract

Reports are the standard way of communication between the radiologist and the referring clinician. Efforts are made to improve this communication by, for instance, introducing standardization and structured reporting. Natural Language Processing (NLP) is another promising tool which can improve and enhance the radiological report by processing free text. NLP as such adds structure to the report and exposes the information, which in turn can be used for further analysis. This paper describes pre-processing and processing steps and highlights important challenges to overcome in order to successfully implement a free text mining algorithm using NLP tools and machine learning in a small language area, like Dutch. A rule-based algorithm was constructed to classify T-stage of pulmonary oncology from the original free text radiological report, based on the items tumor size, presence and involvement according to the 8th TNM classification system. PyContextNLP, spaCy and regular expressions were used as tools to extract the correct information and process the free text. Overall accuracy of the algorithm for evaluating T-stage was 0,83 in the training set and 0,87 in the validation set, which shows that the approach in this pilot study is promising. Future research with larger datasets and external validation is needed to be able to introduce more machine learning approaches and perhaps to reduce required input efforts of domain-specific knowledge. However, a hybrid NLP approach will probably achieve the best results.

摘要

报告是放射科医生与转诊临床医生之间的标准沟通方式。人们通过引入标准化和结构化报告等方式努力改善这种沟通。自然语言处理(NLP)是另一种有前景的工具,它可以通过处理自由文本改进和完善放射学报告。NLP为报告增添了结构并揭示了信息,这些信息进而可用于进一步分析。本文描述了预处理和处理步骤,并强调了在荷兰语等小语种领域使用NLP工具和机器学习成功实施自由文本挖掘算法需要克服的重要挑战。基于第8版TNM分类系统中的肿瘤大小、存在情况和累及范围等项目,构建了一种基于规则的算法,用于从原始自由文本放射学报告中对肺肿瘤学的T分期进行分类。使用PyContextNLP、spaCy和正则表达式作为工具来提取正确信息并处理自由文本。该算法评估T分期的总体准确率在训练集中为0.83,在验证集中为0.87,这表明该初步研究中的方法很有前景。需要使用更大的数据集进行未来研究并进行外部验证,以便能够引入更多机器学习方法,并可能减少特定领域知识所需的输入工作量。然而,混合NLP方法可能会取得最佳效果。

相似文献

1
Natural Language Processing in Dutch Free Text Radiology Reports: Challenges in a Small Language Area Staging Pulmonary Oncology.荷兰语自由文本放射学报告中的自然语言处理:小语种地区肺部肿瘤分期面临的挑战
J Digit Imaging. 2020 Aug;33(4):1002-1008. doi: 10.1007/s10278-020-00327-z.
2
How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation.自然语言处理如何借助自由文本放射学报告辅助肺肿瘤学肿瘤淋巴结转移分期:算法开发与验证
JMIR Form Res. 2023 Mar 22;7:e38125. doi: 10.2196/38125.
3
Natural Language Processing Algorithm Used for Staging Pulmonary Oncology from Free-Text Radiological Reports: "Including PET-CT and Validation Towards Clinical Use".自然语言处理算法在放射学报告中的肺肿瘤分期中的应用:“包括 PET-CT 以及向临床应用的验证”。
J Imaging Inform Med. 2024 Feb;37(1):3-12. doi: 10.1007/s10278-023-00913-x. Epub 2024 Jan 12.
4
T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting.使用自然语言处理从放射学报告中进行肺癌T分期:向多语言环境的转化
Insights Imaging. 2021 Jun 10;12(1):77. doi: 10.1186/s13244-021-01018-1.
5
Information extraction from multi-institutional radiology reports.从多机构放射学报告中提取信息。
Artif Intell Med. 2016 Jan;66:29-39. doi: 10.1016/j.artmed.2015.09.007. Epub 2015 Oct 3.
6
Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.在两家大型学术放射科实践中膝关节MRI报告的机器学习分类器性能:一种估计诊断率的工具
AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.
7
Practical Guide to Natural Language Processing for Radiology.实用放射医学自然语言处理指南。
Radiographics. 2021 Sep-Oct;41(5):1446-1453. doi: 10.1148/rg.2021200113.
8
Natural language processing in narrative breast radiology reporting in University Malaya Medical Centre.马来西亚大学医学中心乳腺影像学叙述性报告中的自然语言处理
Health Informatics J. 2023 Jul-Sep;29(3):14604582231203763. doi: 10.1177/14604582231203763.
9
Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?Transformer 与传统自然语言处理:自动化放射科报告分类需要多少数据?
Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.
10
Essential Elements of Natural Language Processing: What the Radiologist Should Know.自然语言处理的基本要素:放射科医生应该知道的内容。
Acad Radiol. 2020 Jan;27(1):6-12. doi: 10.1016/j.acra.2019.08.010. Epub 2019 Sep 17.

引用本文的文献

1
Automated Extraction of Key Entities from Non-English Mammography Reports Using Named Entity Recognition with Prompt Engineering.使用带有提示工程的命名实体识别从非英语乳腺钼靶报告中自动提取关键实体
Bioengineering (Basel). 2025 Feb 10;12(2):168. doi: 10.3390/bioengineering12020168.
2
Enhancing diagnosis of benign lesions and lung cancer through ensemble text and breath analysis: a retrospective cohort study.通过集成文本和呼吸分析提高良性病变和肺癌的诊断:一项回顾性队列研究。
Sci Rep. 2024 Apr 16;14(1):8731. doi: 10.1038/s41598-024-59474-w.
3
Natural Language Processing Algorithm Used for Staging Pulmonary Oncology from Free-Text Radiological Reports: "Including PET-CT and Validation Towards Clinical Use".自然语言处理算法在放射学报告中的肺肿瘤分期中的应用:“包括 PET-CT 以及向临床应用的验证”。
J Imaging Inform Med. 2024 Feb;37(1):3-12. doi: 10.1007/s10278-023-00913-x. Epub 2024 Jan 12.
4
Automatic Detection of Distant Metastasis Mentions in Radiology Reports in Spanish.自动检测西班牙语放射学报告中的远处转移提及。
JCO Clin Cancer Inform. 2024 Jan;8:e2300130. doi: 10.1200/CCI.23.00130.
5
The added value of text from Dutch general practitioner notes in predictive modeling.荷兰全科医生记录中文本在预测建模中的附加价值。
J Am Med Inform Assoc. 2023 Nov 17;30(12):1973-1984. doi: 10.1093/jamia/ocad160.
6
How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation.自然语言处理如何借助自由文本放射学报告辅助肺肿瘤学肿瘤淋巴结转移分期:算法开发与验证
JMIR Form Res. 2023 Mar 22;7:e38125. doi: 10.2196/38125.
7
Quality Management of Pulmonary Nodule Radiology Reports Based on Natural Language Processing.基于自然语言处理的肺结节放射学报告质量管理
Bioengineering (Basel). 2022 Jun 1;9(6):244. doi: 10.3390/bioengineering9060244.
8
Transforming Thyroid Cancer Diagnosis and Staging Information from Unstructured Reports to the Observational Medical Outcome Partnership Common Data Model.将甲状腺癌诊断和分期信息从非结构化报告转化为观察性医疗结局伙伴关系通用数据模型。
Appl Clin Inform. 2022 May;13(3):521-531. doi: 10.1055/s-0042-1748144. Epub 2022 Jun 15.
9
T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting.使用自然语言处理从放射学报告中进行肺癌T分期:向多语言环境的转化
Insights Imaging. 2021 Jun 10;12(1):77. doi: 10.1186/s13244-021-01018-1.
10
Artificial Intelligence Applications to Improve the Treatment of Locally Advanced Non-Small Cell Lung Cancers.人工智能在改善局部晚期非小细胞肺癌治疗中的应用
Cancers (Basel). 2021 May 14;13(10):2382. doi: 10.3390/cancers13102382.

本文引用的文献

1
Extending the NegEx lexicon for multiple languages.扩展适用于多种语言的NegEx词汇表。
Stud Health Technol Inform. 2013;192:677-81.
2
Inventory of tools for Dutch clinical language processing.荷兰临床语言处理工具清单。
Stud Health Technol Inform. 2012;180:245-9.