Asfeldt Anne Mette, Straume Bjørn, Paulssen Eyvind J
Institute of Community Medicine, University of Tromsø, Norway.
Scand J Gastroenterol. 2007 Sep;42(9):1106-12. doi: 10.1080/00365520701259240.
Endoscopy is an observer-dependent diagnostic method, which, until recently, has lacked precise guidelines for written reports. There is an increasing demand for improvement in endoscopy records, which may necessitate the supplementation of image documentation. The aim of this study was to estimate interobserver as well as intra-observer variability in the assessment of images from gastroscopy.
We designed an Internet interface presenting endoscopy images, accompanied by a multiple-choice questionnaire for assessing pathology in the images. Ten images from the distal oesophagus and 10 images from the pyloric antrum were chosen. In order to study interobserver variability, physicians with varying endoscopy experience were invited to complete the questionnaire. The physicians were re-invited 5 months later to assess the same images again, this time in order to assess intra-observer variability. Kappa statistics were used for analysis of agreement.
Initially, 13 of 20 invited physicians responded. Interobserver agreement varied between poor (kappa<0.2) and moderate (0.4<kappa<0.6). In the second part of the study, 10 of 11 invited physicians responded. Intra-observer agreement varied between moderate (0.4<kappa<0.6) and good (0.6<kappa<0.8). A higher level of experience does not imply either better interobserver or better intra-observer agreement. Images of concise endoscopy findings, such as the presence of an ulcer, resulted in better agreement than did the assessment of images of less definable findings.
The variability in the interpretation of endoscopy images is large. We therefore believe that systematic inclusion of a set of images into endoscopy reports will improve their quality.
内镜检查是一种依赖观察者的诊断方法,直到最近,它仍缺乏关于书面报告的精确指南。对改进内镜检查记录的需求日益增加,这可能需要补充图像文档。本研究的目的是评估胃镜图像评估中的观察者间以及观察者内的变异性。
我们设计了一个呈现内镜图像的互联网界面,并附带一份用于评估图像中病理情况的多项选择题问卷。从食管远端选取了10张图像,从幽门窦选取了10张图像。为了研究观察者间的变异性,邀请了具有不同内镜检查经验的医生完成问卷。5个月后再次邀请这些医生评估相同的图像,这次是为了评估观察者内的变异性。使用kappa统计量分析一致性。
最初,20名受邀医生中有13名做出了回应。观察者间的一致性在差(kappa<0.2)到中等(0.4<kappa<0.6)之间变化。在研究的第二部分,11名受邀医生中有10名做出了回应。观察者内的一致性在中等(0.4<kappa<0.6)到良好(0.6<kappa<0.8)之间变化。较高的经验水平并不意味着观察者间或观察者内的一致性更好。简洁的内镜检查结果图像,如溃疡的存在,比难以明确的检查结果图像评估的一致性更好。
内镜图像解读的变异性很大。因此,我们认为在内镜检查报告中系统地纳入一组图像将提高其质量。