利用自然语言处理模型对放射学报告中的前列腺癌恶性程度评分进行自动文本分类。

Automatic text classification of prostate cancer malignancy scores in radiology reports using NLP models.

机构信息

Department of Computer Science, Advanced Studies Center in ICT (CEATIC), Universidad de Jaén, Campus Las Lagunillas, Jaén, 23071, Spain.

Natural Language Processing Unit, HT Médica, Carmelo Torres, no̱2, Jaén, 23007, Spain.

出版信息

Med Biol Eng Comput. 2024 Nov;62(11):3373-3383. doi: 10.1007/s11517-024-03131-x. Epub 2024 Jun 7.

DOI:10.1007/s11517-024-03131-x

PMID:38844661

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11485118/

Abstract

This paper presents the implementation of two automated text classification systems for prostate cancer findings based on the PI-RADS criteria. Specifically, a traditional machine learning model using XGBoost and a language model-based approach using RoBERTa were employed. The study focused on Spanish-language radiological MRI prostate reports, which has not been explored before. The results demonstrate that the RoBERTa model outperforms the XGBoost model, although both achieve promising results. Furthermore, the best-performing system was integrated into the radiological company's information systems as an API, operating in a real-world environment.

摘要

本文提出了两种基于 PI-RADS 标准的前列腺癌影像学发现的自动化文本分类系统的实现方法。具体来说，使用 XGBoost 的传统机器学习模型和基于 RoBERTa 的语言模型方法都被采用。本研究侧重于以前没有探索过的西班牙语放射学 MRI 前列腺报告。结果表明，尽管两个模型都取得了不错的效果，但 RoBERTa 模型的性能优于 XGBoost 模型。此外，表现最好的系统被集成到放射科公司的信息系统中作为 API，在实际环境中运行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69b2/11485118/1ddd9c29e287/11517_2024_3131_Fig1_HTML.jpg

相似文献

Automatic text classification of prostate cancer malignancy scores in radiology reports using NLP models.

Med Biol Eng Comput. 2024 Nov;62(11):3373-3383. doi: 10.1007/s11517-024-03131-x. Epub 2024 Jun 7.

TECRR: a benchmark dataset of radiological reports for BI-RADS classification with machine learning, deep learning, and large language model baselines.

BMC Med Inform Decis Mak. 2024 Oct 24;24(1):310. doi: 10.1186/s12911-024-02717-7.

CARES: A Corpus for classification of Spanish Radiological reports.

Comput Biol Med. 2023 Mar;154:106581. doi: 10.1016/j.compbiomed.2023.106581. Epub 2023 Jan 23.

Clinico-radiological characteristic-based machine learning in reducing unnecessary prostate biopsies of PI-RADS 3 lesions with dual validation.

Eur Radiol. 2020 Nov;30(11):6274-6284. doi: 10.1007/s00330-020-06958-8. Epub 2020 Jun 10.

Automatic medical protocol classification using machine learning approaches.

Comput Methods Programs Biomed. 2021 Mar;200:105939. doi: 10.1016/j.cmpb.2021.105939. Epub 2021 Jan 16.

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

AJR Am J Roentgenol. 2017 Apr;208(4):750-753. doi: 10.2214/AJR.16.16128. Epub 2017 Jan 31.

Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?

Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.

Machine learning-based analysis of MR radiomics can help to improve the diagnostic performance of PI-RADS v2 in clinically relevant prostate cancer.

Eur Radiol. 2017 Oct;27(10):4082-4090. doi: 10.1007/s00330-017-4800-5. Epub 2017 Apr 3.

Utility of a Rule-Based Algorithm in the Assessment of Standardized Reporting in PI-RADS.

Acad Radiol. 2023 Jun;30(6):1141-1147. doi: 10.1016/j.acra.2022.06.024. Epub 2022 Jul 28.

Automated Radiology-Arthroscopy Correlation of Knee Meniscal Tears Using Natural Language Processing Algorithms.

Acad Radiol. 2022 Apr;29(4):479-487. doi: 10.1016/j.acra.2021.01.017. Epub 2021 Feb 11.

引用本文的文献

The added value of including thyroid nodule features into large language models for automatic ACR TI-RADS classification based on ultrasound reports.

Jpn J Radiol. 2025 Apr;43(4):593-602. doi: 10.1007/s11604-024-01707-z. Epub 2024 Nov 25.

Machine learning-based cell death marker for predicting prognosis and identifying tumor immune microenvironment in prostate cancer.

Heliyon. 2024 Sep 6;10(18):e37554. doi: 10.1016/j.heliyon.2024.e37554. eCollection 2024 Sep 30.

本文引用的文献

CARES: A Corpus for classification of Spanish Radiological reports.

Comput Biol Med. 2023 Mar;154:106581. doi: 10.1016/j.compbiomed.2023.106581. Epub 2023 Jan 23.

The diagnostic performance in clinically significant prostate cancer with PI-RADS version 2.1: simplified bpMRI versus standard mpMRI.

Abdom Radiol (NY). 2023 Feb;48(2):704-712. doi: 10.1007/s00261-022-03750-8. Epub 2022 Dec 5.

RadBERT: Adapting Transformer-based Language Models to Radiology.

Radiol Artif Intell. 2022 Jun 15;4(4):e210258. doi: 10.1148/ryai.210258. eCollection 2022 Jul.

Utility of a Rule-Based Algorithm in the Assessment of Standardized Reporting in PI-RADS.

Acad Radiol. 2023 Jun;30(6):1141-1147. doi: 10.1016/j.acra.2022.06.024. Epub 2022 Jul 28.

Diagnostic value of combining PI-RADS v2.1 with PSAD in clinically significant prostate cancer.

Abdom Radiol (NY). 2022 Oct;47(10):3574-3582. doi: 10.1007/s00261-022-03592-4. Epub 2022 Jul 5.

Practical Guide to Natural Language Processing for Radiology.

Radiographics. 2021 Sep-Oct;41(5):1446-1453. doi: 10.1148/rg.2021200113.

Automatic medical protocol classification using machine learning approaches.

Comput Methods Programs Biomed. 2021 Mar;200:105939. doi: 10.1016/j.cmpb.2021.105939. Epub 2021 Jan 16.

Semi-automated PIRADS scoring via mpMRI analysis.

J Med Imaging (Bellingham). 2020 Nov;7(6):064501. doi: 10.1117/1.JMI.7.6.064501. Epub 2020 Dec 29.

COVID-19 detection in radiological text reports integrating entity recognition.

Comput Biol Med. 2020 Dec;127:104066. doi: 10.1016/j.compbiomed.2020.104066. Epub 2020 Oct 22.

Prostate MRI with PI-RADS v2.1: initial detection and active surveillance.

Abdom Radiol (NY). 2020 Jul;45(7):2133-2142. doi: 10.1007/s00261-019-02346-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用自然语言处理模型对放射学报告中的前列腺癌恶性程度评分进行自动文本分类。

Automatic text classification of prostate cancer malignancy scores in radiology reports using NLP models.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献