Suppr超能文献

利用自然语言处理技术在放射学报告中识别输尿管结石。

Natural language processing to identify ureteric stones in radiology reports.

作者信息

Li Andrew Yu, Elliot Nikki

机构信息

Department of Radiology, Canterbury District Health Board, Christchurch, New Zealand.

The Canterbury Initiative, Canterbury District Health Board, Christchurch, New Zealand.

出版信息

J Med Imaging Radiat Oncol. 2019 Jun;63(3):307-310. doi: 10.1111/1754-9485.12861. Epub 2019 Feb 5.

Abstract

INTRODUCTION

Natural language processing (NLP) is an emerging tool which has the ability to automate data extraction from large volumes of unstructured text. One of the main described uses of NLP in radiology is cohort building for epidemiological studies. This study aims to assess the accuracy of NLP in identifying a group of patients positive for ureteric stones on Computed Tomography - Kidneys, Ureter, Bladder (CT KUB) reports.

METHODS

Retrospective review of all CT KUB reports in a single calendar year. A locally available NLP tool was used to automatically classify the reports based on positivity for ureteric stones. This was validated by manual review and refined to maximize the accuracy of stone detection.

RESULTS

A total of 1874 CT KUB reports were identified. Manual classification of ureteric stone positivity was 36% compared with 27% using NLP. The accuracy of NLP was 85% with a sensitivity of 66% and specificity of 95%. Incorrect classification was due to misspellings, variable syntax, terminology, pluralization and the inability to exclude clinical request details from the search algorithm.

CONCLUSIONS

Our NLP tool demonstrated high specificity but low sensitivity at identifying CT KUB reports that are positive for ureteric stones. This was attributable to the lack of feature extraction tools tailored for analysing radiology text, incompleteness of the medical lexicon database and heterogeneity of unstructured reports. Improvements in these areas will help improve data extraction accuracy.

摘要

引言

自然语言处理(NLP)是一种新兴工具,能够自动从大量非结构化文本中提取数据。NLP在放射学中的主要应用之一是为流行病学研究构建队列。本研究旨在评估NLP在计算机断层扫描——肾脏、输尿管、膀胱(CT KUB)报告中识别输尿管结石阳性患者组的准确性。

方法

回顾单个日历年内所有CT KUB报告。使用本地可用的NLP工具根据输尿管结石阳性情况自动对报告进行分类。通过人工审核进行验证,并进行优化以最大限度提高结石检测的准确性。

结果

共识别出1874份CT KUB报告。输尿管结石阳性的人工分类为36%,而使用NLP的分类为27%。NLP的准确性为85%,敏感性为66%,特异性为95%。分类错误是由于拼写错误、语法多变、术语、复数形式以及搜索算法无法排除临床请求细节。

结论

我们的NLP工具在识别输尿管结石阳性的CT KUB报告时显示出高特异性但低敏感性。这归因于缺乏专门用于分析放射学文本的特征提取工具、医学词汇数据库的不完整性以及非结构化报告的异质性。这些领域的改进将有助于提高数据提取的准确性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验