Lagali Neil, Poletti Enea, Patel Dipika V, McGhee Charles N J, Hamrah Pedram, Kheirkhah Ahmad, Tavakoli Mitra, Petropoulos Ioannis N, Malik Rayaz A, Utheim Tor Paaske, Zhivov Andrey, Stachs Oliver, Falke Karen, Peschel Sabine, Guthoff Rudolf, Chao Cecilia, Golebiowski Blanka, Stapleton Fiona, Ruggeri Alfredo
Department of Ophthalmology Institute for Clinical and Experimental Medicine, Faculty of Health Sciences, Linköping University, Linköping, Sweden.
Department of Information Engineering, University of Padova, Padova, Italy.
Invest Ophthalmol Vis Sci. 2015 Aug;56(9):5102-9. doi: 10.1167/iovs.15-17284.
We examined agreement among experts in the assessment of corneal subbasal nerve tortuosity.
Images of corneal subbasal nerves were obtained from investigators at seven sites (Auckland, Boston, Linköping, Manchester, Oslo, Rostock, and Sydney) using laser-scanning in vivo confocal microscopy. A set of 30 images was assembled and ordered by increasing tortuosity by 10 expert graders from the seven sites. In a first experiment, graders assessed tortuosity without a specific definition and performed grading three times, with at least 1 week between sessions. In a second experiment, graders assessed the same image set using four focused tortuosity definitions. Intersession and intergrader repeatability for the experiments were determined using the Spearman rank correlation.
Expert graders without a specific tortuosity definition had high intersession (Spearman correlation coefficient 0.80), but poor intergrader (0.62) repeatability. Specific definitions improved intergrader repeatability to 0.79. In particular, tortuosity defined by frequent small-amplitude directional changes (short range tortuosity) or by infrequent large-amplitude directional changes (long range tortuosity), indicated largely independent measures and resulted in improved repeatability across the graders. A further refinement, grading only the most tortuous nerve in a given image, improved the average correlation of a given grader's ordering of images with the group average to 0.86 to 0.90.
Definitions of tortuosity specifying short or long-range tortuosity and considering only the most tortuous nerve in an image improved the agreement in tortuosity grading among a group of expert observers. These definitions could improve accuracy and consistency in quantifying subbasal nerve tortuosity in clinical studies.
我们研究了角膜基质下神经弯曲度评估方面专家之间的一致性。
通过激光扫描活体共聚焦显微镜从七个地点(奥克兰、波士顿、林雪平、曼彻斯特、奥斯陆、罗斯托克和悉尼)的研究人员处获取角膜基质下神经的图像。由来自七个地点的10位专家分级员将一组30张图像按弯曲度递增顺序排列。在第一个实验中,分级员在没有特定定义的情况下评估弯曲度,并进行三次分级,每次分级之间至少间隔1周。在第二个实验中,分级员使用四种聚焦的弯曲度定义评估同一组图像。使用Spearman等级相关性确定实验的分级员间和分级员内重复性。
没有特定弯曲度定义的专家分级员分级员内重复性较高(Spearman相关系数0.80),但分级员间重复性较差(0.62)。特定定义将分级员间重复性提高到0.79。特别是,由频繁的小幅度方向变化定义的弯曲度(短程弯曲度)或由不频繁的大幅度方向变化定义的弯曲度(长程弯曲度),显示出很大程度上是独立的测量方法,并提高了分级员间的重复性。进一步的改进是仅对给定图像中最弯曲的神经进行分级,这将给定分级员对图像的排序与组平均排序的平均相关性提高到0.86至0.90。
指定短程或长程弯曲度并仅考虑图像中最弯曲神经的弯曲度定义提高了一组专家观察者在弯曲度分级方面的一致性。这些定义可以提高临床研究中量化基质下神经弯曲度的准确性和一致性。