基于自然语言处理的成像协议分配：使用指示文本数据进行多类分类的腹部 CT 协议的机器学习。

Natural Language Processing for Imaging Protocol Assignment: Machine Learning for Multiclass Classification of Abdominal CT Protocols Using Indication Text Data.

机构信息

Imaging Institute, Cleveland Clinic Foundation, 9500 Euclid Ave., P34, Cleveland, OH, 44195, USA.

出版信息

J Digit Imaging. 2022 Oct;35(5):1120-1130. doi: 10.1007/s10278-022-00633-8. Epub 2022 Jun 2.

DOI:10.1007/s10278-022-00633-8

PMID:35654878

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9582109/

Abstract

A correct protocol assignment is critical to high-quality imaging examinations, and its automation can be amenable to natural language processing (NLP). Assigning protocols for abdominal imaging CT scans is particularly challenging given the multiple organ specific indications and parameters. We compared conventional machine learning, deep learning, and automated machine learning builder workflows for this multiclass text classification task. A total of 94,501 CT studies performed over 4 years and their assigned protocols were obtained. Text data associated with each study including the ordering provider generated free text study indication and ICD codes were used for NLP analysis and protocol class prediction. The data was classified into one of 11 abdominal CT protocol classes before and after augmentations used to account for imbalances in the class sample sizes. Four machine learning (ML) algorithms, one deep learning algorithm, and an automated machine learning (AutoML) builder were used for the multilabel classification task: Random Forest (RF), Tree Ensemble (TE), Gradient Boosted Tree (GBT), multi-layer perceptron (MLP), Universal Language Model Fine-tuning (ULMFiT), and Google's AutoML builder (Alphabet, Inc., Mountain View, CA), respectively. On the unbalanced dataset, the manually coded algorithms all performed similarly with F1 scores of 0.811 for RF, 0.813 for TE, 0.813 for GBT, 0.828 for MLP, and 0.847 for ULMFiT. The AutoML builder performed better with a F1 score of 0.854. On the balanced dataset, the tree ensemble machine learning algorithm performed the best with an F1 score of 0.803 and a Cohen's kappa of 0.612. AutoML methods took a longer time for completion of NLP model training and evaluation, 4 h and 45 min compared to an average of 51 min for manual methods. Machine learning and natural language processing can be used for the complex multiclass classification task of abdominal imaging CT scan protocol assignment.

摘要

正确的协议分配对高质量的成像检查至关重要，其自动化可以采用自然语言处理（NLP）。由于存在多种特定于器官的适应症和参数，因此为腹部成像 CT 扫描分配协议尤其具有挑战性。我们比较了传统机器学习、深度学习和自动化机器学习构建器工作流程在这个多类别文本分类任务中的表现。总共获得了在 4 年期间进行的 94501 项 CT 研究及其分配的协议。与每个研究相关的文本数据包括生成的免费文本研究指示和 ICD 代码的订购提供程序，用于 NLP 分析和协议分类预测。该数据在进行扩充之前和之后被分类为 11 种腹部 CT 协议类别之一，以弥补类别样本量的不平衡。对于多标签分类任务，使用了四种机器学习（ML）算法、一种深度学习算法和一个自动化机器学习（AutoML）构建器：随机森林（RF）、树集成（TE）、梯度提升树（GBT）、多层感知器（MLP）、通用语言模型微调（ULMFiT）和 Google 的 AutoML 构建器（Alphabet，Inc.，Mountain View，CA）。在不平衡数据集上，手动编码算法的 F1 分数均相似，RF 为 0.811，TE 为 0.813，GBT 为 0.813，MLP 为 0.828，ULMFiT 为 0.847。AutoML 构建器的性能更好，F1 分数为 0.854。在平衡数据集上，树集成机器学习算法的表现最佳，F1 得分为 0.803，Cohen's kappa 为 0.612。AutoML 方法完成 NLP 模型训练和评估所需的时间更长，为 4 小时 45 分钟，而手动方法的平均时间为 51 分钟。机器学习和自然语言处理可用于腹部成像 CT 扫描协议分配的复杂多类别分类任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c552/9582109/0ee3292f9483/10278_2022_633_Fig1_HTML.jpg

相似文献

Natural Language Processing for Imaging Protocol Assignment: Machine Learning for Multiclass Classification of Abdominal CT Protocols Using Indication Text Data.基于自然语言处理的成像协议分配：使用指示文本数据进行多类分类的腹部 CT 协议的机器学习。

J Digit Imaging. 2022 Oct;35(5):1120-1130. doi: 10.1007/s10278-022-00633-8. Epub 2022 Jun 2.

Automatic medical protocol classification using machine learning approaches.使用机器学习方法进行自动医疗协议分类。

Comput Methods Programs Biomed. 2021 Mar;200:105939. doi: 10.1016/j.cmpb.2021.105939. Epub 2021 Jan 16.

A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。

BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.

Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.基于机器学习模型集成与 BERT 语言模型的脑 CT 报告文本描述分析用于判断颅内出血的比较研究

Sovrem Tekhnologii Med. 2024;16(1):27-34. doi: 10.17691/stm2024.16.1.03. Epub 2024 Feb 28.

Machine Learning for Automation of Radiology Protocols for Quality and Efficiency Improvement.机器学习在放射学协议自动化中的应用，以提高质量和效率。

J Am Coll Radiol. 2020 Sep;17(9):1149-1158. doi: 10.1016/j.jacr.2020.03.012. Epub 2020 Apr 9.

Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification?Transformer 与传统自然语言处理：自动化放射科报告分类需要多少数据？

Br J Radiol. 2023 Sep;96(1149):20220769. doi: 10.1259/bjr.20220769. Epub 2023 May 25.

Development of machine learning and natural language processing algorithms for preoperative prediction and automated identification of intraoperative vascular injury in anterior lumbar spine surgery.开发机器学习和自然语言处理算法，用于在前路腰椎手术中进行术前预测和术中血管损伤的自动识别。

Spine J. 2021 Oct;21(10):1635-1642. doi: 10.1016/j.spinee.2020.04.001. Epub 2020 Apr 12.

Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.将自然语言处理和机器学习算法集成到放射学报告中的肿瘤反应分类中。

J Digit Imaging. 2018 Apr;31(2):178-184. doi: 10.1007/s10278-017-0027-x.

Automatic Determination of the Need for Intravenous Contrast in Musculoskeletal MRI Examinations Using IBM Watson's Natural Language Processing Algorithm.使用 IBM Watson 的自然语言处理算法自动确定肌肉骨骼 MRI 检查中是否需要静脉造影。

J Digit Imaging. 2018 Apr;31(2):245-251. doi: 10.1007/s10278-017-0021-3.

Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.老年人日常对话中的社会怀旧：使用自然语言处理和机器学习的自动检测。

J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.

引用本文的文献

Efficacy of Fine-Tuned Large Language Model in CT Protocol Assignment as Clinical Decision-Supporting System.微调大语言模型在CT检查方案分配中作为临床决策支持系统的有效性

J Imaging Inform Med. 2025 Feb 5. doi: 10.1007/s10278-025-01433-6.

Traditional Machine Learning, Deep Learning, and BERT (Large Language Model) Approaches for Predicting Hospitalizations From Nurse Triage Notes: Comparative Evaluation of Resource Management.用于根据护士分诊记录预测住院情况的传统机器学习、深度学习和BERT（大语言模型）方法：资源管理的比较评估

JMIR AI. 2024 Aug 27;3:e52190. doi: 10.2196/52190.

The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports.用于从非结构化放射学报告中提取进行性骨转移的微调大语言模型。

J Imaging Inform Med. 2025 Apr;38(2):865-872. doi: 10.1007/s10278-024-01242-3. Epub 2024 Aug 26.

本文引用的文献

Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.深度学习在放射学中的自然语言处理：报告复杂性、疾病流行率、数据集大小和算法类型对模型性能的影响。

J Med Syst. 2021 Sep 4;45(10):91. doi: 10.1007/s10916-021-01761-4.

Combat COVID-19 infodemic using explainable natural language processing models.使用可解释的自然语言处理模型应对新冠疫情信息疫情。

Inf Process Manag. 2021 Jul;58(4):102569. doi: 10.1016/j.ipm.2021.102569. Epub 2021 Mar 6.

Essential Elements of Natural Language Processing: What the Radiologist Should Know.自然语言处理的基本要素：放射科医生应该知道的内容。

Acad Radiol. 2020 Jan;27(1):6-12. doi: 10.1016/j.acra.2019.08.010. Epub 2019 Sep 17.

The Increasing Use of Emergency Department Imaging in the United States: Is It Appropriate?美国急诊部门影像学应用的增加：是否恰当？

AJR Am J Roentgenol. 2019 Oct;213(4):W180-W184. doi: 10.2214/AJR.19.21386. Epub 2019 Jun 25.

A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。

BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.

Imbalanced Deep Learning by Minority Class Incremental Rectification.通过少数类增量校正实现不平衡深度学习。

IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1367-1381. doi: 10.1109/TPAMI.2018.2832629. Epub 2018 May 3.

Reducing interruptions during duty radiology shifts, assessment of its benefits and review of factors affecting the radiology working environment.减少值班放射科轮班时的中断，评估其益处，并审查影响放射科工作环境的因素。

Clin Radiol. 2018 Aug;73(8):759.e19-759.e25. doi: 10.1016/j.crad.2018.04.007. Epub 2018 May 28.

Efficiency Improvement in a Busy Radiology Practice: Determination of Musculoskeletal Magnetic Resonance Imaging Protocol Using Deep-Learning Convolutional Neural Networks.繁忙放射科实践中的效率提升：使用深度学习卷积神经网络确定肌肉骨骼磁共振成像方案。

J Digit Imaging. 2018 Oct;31(5):604-610. doi: 10.1007/s10278-018-0066-y.

J Digit Imaging. 2018 Apr;31(2):245-251. doi: 10.1007/s10278-017-0021-3.

Workflow Dynamics and the Imaging Value Chain: Quantifying the Effect of Designating a Nonimage-Interpretive Task Workflow.工作流程动态与影像价值链：量化指定非影像解读任务工作流程的影响

Curr Probl Diagn Radiol. 2017 Jul-Aug;46(4):275-281. doi: 10.1067/j.cpradiol.2016.11.010. Epub 2016 Nov 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于自然语言处理的成像协议分配：使用指示文本数据进行多类分类的腹部 CT 协议的机器学习。

Natural Language Processing for Imaging Protocol Assignment: Machine Learning for Multiclass Classification of Abdominal CT Protocols Using Indication Text Data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献