Bering Limited, London, United Kingdom.
Emergency Department, Queen Elizabeth University Hospital, Glasgow, Scotland.
PLoS One. 2020 Mar 10;15(3):e0229963. doi: 10.1371/journal.pone.0229963. eCollection 2020.
Chest radiography (CXR) is the most commonly used imaging modality and deep neural network (DNN) algorithms have shown promise in effective triage of normal and abnormal radiograms. Typically, DNNs require large quantities of expertly labelled training exemplars, which in clinical contexts is a major bottleneck to effective modelling, as both considerable clinical skill and time is required to produce high-quality ground truths. In this work we evaluate thirteen supervised classifiers using two large free-text corpora and demonstrate that bi-directional long short-term memory (BiLSTM) networks with attention mechanism effectively identify Normal, Abnormal, and Unclear CXR reports in internal (n = 965 manually-labelled reports, f1-score = 0.94) and external (n = 465 manually-labelled reports, f1-score = 0.90) testing sets using a relatively small number of expert-labelled training observations (n = 3,856 annotated reports). Furthermore, we introduce a general unsupervised approach that accurately distinguishes Normal and Abnormal CXR reports in a large unlabelled corpus. We anticipate that the results presented in this work can be used to automatically extract standardized clinical information from free-text CXR radiological reports, facilitating the training of clinical decision support systems for CXR triage.
胸部 X 线摄影(CXR)是最常用的成像方式,深度神经网络(DNN)算法在有效分诊正常和异常射线照片方面显示出了前景。通常,DNN 需要大量经过专业标记的训练示例,这在临床环境中是有效建模的主要瓶颈,因为需要相当多的临床技能和时间来生成高质量的地面真相。在这项工作中,我们使用两个大型自由文本语料库评估了十三个有监督分类器,并证明具有注意力机制的双向长短期记忆(BiLSTM)网络可以有效地识别内部(n = 965 份手动标记报告,f1 分数 = 0.94)和外部(n = 465 份手动标记报告,f1 分数 = 0.90)测试集中的正常、异常和不明确的 CXR 报告,使用相对较少的专家标记训练观察(n = 3856 个注释报告)。此外,我们引入了一种通用的无监督方法,可以在大型未标记语料库中准确地区分正常和异常的 CXR 报告。我们预计,本工作中提出的结果可用于从自由文本 CXR 放射报告中自动提取标准化临床信息,从而为 CXR 分诊的临床决策支持系统的培训提供便利。