Department of Radiology, University of Iowa Hospitals and Clinics, 200 Hawkins Drive, Iowa City, IA 52242, USA.
J Digit Imaging. 2011 Apr;24(2):234-42. doi: 10.1007/s10278-009-9250-4. Epub 2009 Nov 10.
Radiology reports contain information that can be mined using a search engine for teaching, research, and quality assurance purposes. Current search engines look for exact matches to the search term, but they do not differentiate between reports in which the search term appears in a positive context (i.e., being present) from those in which the search term appears in the context of negation and uncertainty. We describe RadReportMiner, a context-aware search engine, and compare its retrieval performance with a generic search engine, Google Desktop. We created a corpus of 464 radiology reports which described at least one of five findings (appendicitis, hydronephrosis, fracture, optic neuritis, and pneumonia). Each report was classified by a radiologist as positive (finding described to be present) or negative (finding described to be absent or uncertain). The same reports were then classified by RadReportMiner and Google Desktop. RadReportMiner achieved a higher precision (81%), compared with Google Desktop (27%; p < 0.0001). RadReportMiner had a lower recall (72%) compared with Google Desktop (87%; p = 0.006). We conclude that adding negation and uncertainty identification to a word-based radiology report search engine improves the precision of search results over a search engine that does not take this information into account. Our approach may be useful to adopt into current report retrieval systems to help radiologists to more accurately search for radiology reports.
放射学报告包含可通过搜索引擎挖掘的信息,可用于教学、研究和质量保证目的。当前的搜索引擎会查找与搜索词完全匹配的内容,但它们无法区分搜索词出现在肯定语境(即存在)和否定语境及不确定语境中的报告。我们描述了 RadReportMiner,这是一种上下文感知搜索引擎,并将其检索性能与通用搜索引擎 Google Desktop 进行了比较。我们创建了一个包含 464 份放射学报告的语料库,这些报告至少描述了五种发现之一(阑尾炎、肾积水、骨折、视神经炎和肺炎)。每位放射科医生对每份报告的分类为阳性(描述为存在的发现)或阴性(描述为不存在或不确定的发现)。然后,RadReportMiner 和 Google Desktop 对相同的报告进行了分类。与 Google Desktop(27%;p < 0.0001)相比,RadReportMiner 的准确率更高(81%)。与 Google Desktop(87%;p = 0.006)相比,RadReportMiner 的召回率较低(72%)。我们的结论是,在基于字词的放射学报告搜索引擎中添加否定和不确定性识别功能可提高搜索结果的准确性,而不考虑这些信息的搜索引擎则无法实现这一点。我们的方法可能有助于采用当前的报告检索系统,帮助放射科医生更准确地搜索放射学报告。