Li Andrew Yu, Elliot Nikki
Department of Radiology, Canterbury District Health Board, Christchurch, New Zealand.
The Canterbury Initiative, Canterbury District Health Board, Christchurch, New Zealand.
J Med Imaging Radiat Oncol. 2019 Jun;63(3):307-310. doi: 10.1111/1754-9485.12861. Epub 2019 Feb 5.
Natural language processing (NLP) is an emerging tool which has the ability to automate data extraction from large volumes of unstructured text. One of the main described uses of NLP in radiology is cohort building for epidemiological studies. This study aims to assess the accuracy of NLP in identifying a group of patients positive for ureteric stones on Computed Tomography - Kidneys, Ureter, Bladder (CT KUB) reports.
Retrospective review of all CT KUB reports in a single calendar year. A locally available NLP tool was used to automatically classify the reports based on positivity for ureteric stones. This was validated by manual review and refined to maximize the accuracy of stone detection.
A total of 1874 CT KUB reports were identified. Manual classification of ureteric stone positivity was 36% compared with 27% using NLP. The accuracy of NLP was 85% with a sensitivity of 66% and specificity of 95%. Incorrect classification was due to misspellings, variable syntax, terminology, pluralization and the inability to exclude clinical request details from the search algorithm.
Our NLP tool demonstrated high specificity but low sensitivity at identifying CT KUB reports that are positive for ureteric stones. This was attributable to the lack of feature extraction tools tailored for analysing radiology text, incompleteness of the medical lexicon database and heterogeneity of unstructured reports. Improvements in these areas will help improve data extraction accuracy.
自然语言处理(NLP)是一种新兴工具,能够自动从大量非结构化文本中提取数据。NLP在放射学中的主要应用之一是为流行病学研究构建队列。本研究旨在评估NLP在计算机断层扫描——肾脏、输尿管、膀胱(CT KUB)报告中识别输尿管结石阳性患者组的准确性。
回顾单个日历年内所有CT KUB报告。使用本地可用的NLP工具根据输尿管结石阳性情况自动对报告进行分类。通过人工审核进行验证,并进行优化以最大限度提高结石检测的准确性。
共识别出1874份CT KUB报告。输尿管结石阳性的人工分类为36%,而使用NLP的分类为27%。NLP的准确性为85%,敏感性为66%,特异性为95%。分类错误是由于拼写错误、语法多变、术语、复数形式以及搜索算法无法排除临床请求细节。
我们的NLP工具在识别输尿管结石阳性的CT KUB报告时显示出高特异性但低敏感性。这归因于缺乏专门用于分析放射学文本的特征提取工具、医学词汇数据库的不完整性以及非结构化报告的异质性。这些领域的改进将有助于提高数据提取的准确性。