Schadow Gunther, McDonald Clement J
Regenstrief Intstitute and Indiana University School of Medicine, Indianapolis, IN, USA.
AMIA Annu Symp Proc. 2003;2003:584-8.
We have developed a method that extracts structured information about specimens and their related findings in free-text surgical pathology reports. Our method uses regular expressions that drive a state-automaton on top of XSLT and Java. Text fragments identified are coded against the UMLS. This paper describes the technical approach and reports on a preliminary evaluation study, designed to guide further development. We found that of 275 reviewed reports, 91% were coded at least so that all specimens and their critical pathologic findings were represented in codes.
我们开发了一种方法,可从手术病理报告的自由文本中提取有关标本及其相关发现的结构化信息。我们的方法使用正则表达式,在XSLT和Java之上驱动状态自动机。识别出的文本片段依据统一医学语言系统(UMLS)进行编码。本文描述了该技术方法,并报告了一项初步评估研究,旨在指导进一步的开发。我们发现,在275份审阅报告中,91%至少进行了编码,以便所有标本及其关键病理发现都能以编码形式呈现。