University Hospital Basel, Clinic of Radiology & Nuclear Medicine, University of Basel, Petersgraben 4, 4031 Basel, Switzerland.
University Hospital Basel, Clinic of Radiology & Nuclear Medicine, University of Basel, Petersgraben 4, 4031 Basel, Switzerland.
Eur J Radiol. 2020 Apr;125:108862. doi: 10.1016/j.ejrad.2020.108862. Epub 2020 Feb 6.
To design and evaluate a self-trainable natural language processing (NLP)-based procedure to classify unstructured radiology reports. The method enabling the generation of curated datasets is exemplified on CT pulmonary angiogram (CTPA) reports.
We extracted the impressions of CTPA reports created at our institution from 2016 to 2018 (n = 4397; language: German). The status (pulmonary embolism: yes/no) was manually labelled for all exams. Data from 2016/2017 (n = 2801) served as a ground truth to train three NLP architectures that only require a subset of reference datasets for training to be operative. The three architectures were as follows: a convolutional neural network (CNN), a support vector machine (SVM) and a random forest (RF) classifier. Impressions of 2018 (n = 1377) were kept aside and used for general performance measurements. Furthermore, we investigated the dependence of classification performance on the amount of training data with multiple simulations.
The classification performance of all three models was excellent (accuracies: 97 %-99 %; F1 scores 0.88-0.97; AUCs: 0.993-0.997). Highest accuracy was reached by the CNN with 99.1 % (95 % CI 98.5-99.6 %). Training with 470 labelled impressions was sufficient to reach an accuracy of > 93 % with all three NLP architectures.
Our NLP-based approaches allow for an automated and highly accurate retrospective classification of CTPA reports with manageable effort solely using unstructured impression sections. We demonstrated that this approach is useful for the classification of radiology reports not written in English. Moreover, excellent classification performance is achieved at relatively small training set sizes.
设计并评估一种基于自然语言处理(NLP)的可自我训练的方法,以对非结构化放射学报告进行分类。该方法能够生成经过精心整理的数据集,以 CT 肺动脉造影(CTPA)报告为例进行说明。
我们从 2016 年至 2018 年提取了我院创建的 CTPA 报告的印象(n=4397;语言:德语)。所有检查的状态(肺栓塞:是/否)均经过人工标记。2016/2017 年的数据(n=2801)用作训练三个 NLP 架构的基础事实,这些架构仅需要一小部分参考数据集即可进行训练。这三个架构分别为:卷积神经网络(CNN)、支持向量机(SVM)和随机森林(RF)分类器。2018 年的印象(n=1377)保留在一旁,用于进行一般性能测量。此外,我们还通过多次模拟研究了分类性能对训练数据量的依赖性。
所有三种模型的分类性能均非常出色(准确率:97%-99%;F1 分数:0.88-0.97;AUC:0.993-0.997)。CNN 的准确率最高,为 99.1%(95%CI 98.5-99.6%)。使用 470 个标记的印象进行训练,三种 NLP 架构的准确率均超过 93%。
我们的基于 NLP 的方法仅使用非结构化的印象部分,即可实现对 CTPA 报告的自动且高度准确的回顾性分类,并且所需工作量适中。我们证明了该方法对于非英语撰写的放射学报告的分类是有用的。此外,在相对较小的训练集尺寸下,也可以实现出色的分类性能。