Zhou Li, Parsons Simon, Hripcsak George
Department of Biomedical Informatics, Columbia University, New York, NY, USA.
J Am Med Inform Assoc. 2008 Jan-Feb;15(1):99-106. doi: 10.1197/jamia.M2467. Epub 2007 Oct 18.
TimeText is a temporal reasoning system designed to represent, extract, and reason about temporal information in clinical text.
To measure the accuracy of the TimeText for processing clinical discharge summaries.
Six physicians with biomedical informatics training served as domain experts. Twenty discharge summaries were randomly selected for the evaluation. For each of the first 14 reports, 5 to 8 clinically important medical events were chosen. The temporal reasoning system generated temporal relations about the endpoints (start or finish) of pairs of medical events. Two experts (subjects) manually generated temporal relations for these medical events. The system and expert-generated results were assessed by four other experts (raters). All of the twenty discharge summaries were used to assess the system's accuracy in answering time-oriented clinical questions. For each report, five to ten clinically plausible temporal questions about events were generated. Two experts generated answers to the questions to serve as the gold standard. We wrote queries to retrieve answers from system's output.
Correctness of generated temporal relations, recall of clinically important relations, and accuracy in answering temporal questions.
The raters determined that 97% of subjects' 295 generated temporal relations were correct and that 96.5% of the system's 995 generated temporal relations were correct. The system captured 79% of 307 temporal relations determined to be clinically important by the subjects and raters. The system answered 84% of the temporal questions correctly.
The system encoded the majority of information identified by experts, and was able to answer simple temporal questions.
TimeText是一个用于表示、提取和推理临床文本中时间信息的时间推理系统。
评估TimeText处理临床出院小结的准确性。
六位接受过生物医学信息学培训的医生作为领域专家。随机选择20份出院小结进行评估。对于前14份报告中的每一份,选择5至8个具有临床重要意义的医疗事件。时间推理系统生成医疗事件对的端点(开始或结束)之间的时间关系。两位专家(受试者)手动生成这些医疗事件的时间关系。系统和专家生成的结果由另外四位专家(评估者)进行评估。所有20份出院小结都用于评估系统回答面向时间的临床问题的准确性。对于每份报告,生成5至10个关于事件的临床合理的时间问题。两位专家生成问题的答案作为金标准。我们编写查询以从系统输出中检索答案。
生成的时间关系的正确性、临床重要关系的召回率以及回答时间问题的准确性。
评估者确定受试者生成的295个时间关系中有97%是正确的,系统生成的995个时间关系中有96.5%是正确的。系统捕捉到了受试者和评估者确定为临床重要的307个时间关系中的79%。系统正确回答了84%的时间问题。
该系统编码了专家识别的大部分信息,并能够回答简单的时间问题。