Raghavan Preethi, Fosler-Lussier Eric, Lai Albert M
The Ohio State University, Columbus, OH, USA.
AMIA Annu Symp Proc. 2012;2012:1366-74. Epub 2012 Nov 3.
The manual annotation of clinical narratives is an important step for training and validating the performance of automated systems that utilize these clinical narratives. We build an annotation specification to capture medical events, and coreferences and temporal relations between medical events in clinical text. Unfortunately, the process of clinical data annotation is both time consuming and costly. Many annotation efforts have used physicians to annotate the data. We investigate using annotators that are current students or graduates from diverse clinical backgrounds with varying levels of clinical experience. In spite of this diversity, the annotation agreement across our team of annotators is high; the average inter-annotator kappa statistic for medical events, coreferences, temporal relations, and medical event concept unique identifiers was 0.843, 0.859, 0.833, and 0.806, respectively. We describe methods towards leveraging the annotations to support temporal reasoning with medical events.
临床叙述的人工标注是训练和验证利用这些临床叙述的自动化系统性能的重要步骤。我们构建了一个标注规范,以捕获医学事件以及临床文本中医学事件之间的共指关系和时间关系。不幸的是,临床数据标注过程既耗时又昂贵。许多标注工作都使用医生来标注数据。我们研究使用来自不同临床背景、具有不同临床经验水平的在校学生或毕业生作为标注人员。尽管存在这种多样性,但我们的标注团队之间的标注一致性很高;医学事件、共指关系、时间关系和医学事件概念唯一标识符的平均标注者间kappa统计量分别为0.843、0.859、0.833和0.806。我们描述了利用这些标注来支持医学事件时间推理的方法。