University of A Coruña, Department of Computer Science, Campus de Elviña, A Coruña, 15071, Spain.
CITIC-Research Center of Information and Communication Technologies, University of A Coruña, A Coruña, Spain.
BMC Med Res Methodol. 2018 Nov 20;18(1):144. doi: 10.1186/s12874-018-0598-3.
The retinal vascular tortuosity can be a potential indicator of relevant vascular and non-vascular diseases. However, the lack of a precise and standard guide for the tortuosity evaluation hinders its use for diagnostic and treatment purposes. This work aims to advance in the standardization of the retinal vascular tortuosity as a clinical biomarker with diagnostic potential, allowing, thereby, the validation of objective computational measurements on the basis of the entire spectrum of the expert knowledge.
This paper describes a multi-expert validation process of the computational vascular tortuosity measurements of reference. A group of five experts, covering the different clinical profiles of an ophthalmological service, and a four-grade scale from non-tortuous to severe tortuosity as well as non-tortuous / tortuous and asymptomatic / symptomatic binary classifications are considered for the analysis of the the multi-expert validation procedure. The specialists rating process comprises two rounds involving all the experts and a joint round to establish consensual rates. The expert agreement is analyzed throughout the rating procedure and, then, the consensual rates are set as the reference to validate the prognostic performance of four computational tortuosity metrics of reference.
The Kappa indexes for the intra-rater agreement analysis were obtained between 0.35 and 0.83 whereas for the inter-rater agreement in the asymptomatic / symptomatic classification were between 0.22 and 0.76. The Area Under the Curve (AUC) for each expert against the consensual rates were placed between 0.61 and 0.83 whereas the prognostic performance of the best objective tortuosity metric was 0.80.
There is a high inter and intra-rater variability, especially for the case of the four grade scale. The prognostic performance of the tortuosity measurements is close to the experts' performance, especially for Grisan measurement. However, there is a gap between the automatic effectiveness and the expert perception given the lack of clinical criteria in the computational measurements.
视网膜血管迂曲度可以作为相关血管和非血管疾病的潜在指标。然而,缺乏精确和标准的迂曲度评估指南,限制了其在诊断和治疗中的应用。本研究旨在推进视网膜血管迂曲度作为具有诊断潜力的临床生物标志物的标准化,从而能够基于专家知识的整体范围验证客观的计算测量。
本文描述了一种参考计算血管迂曲度的多专家验证过程。一组五名专家涵盖了眼科服务的不同临床特征,采用四级评分标准(从不迂曲到严重迂曲)以及非迂曲/迂曲和无症状/有症状的二元分类进行分析。专家评分过程包括两轮,涉及所有专家,以及一轮联合评分以建立共识评分。分析评分过程中的专家一致性,并将共识评分作为参考,验证四种参考计算迂曲度指标的预后性能。
内部观察者一致性分析的 Kappa 指数在 0.35 到 0.83 之间,而无症状/有症状分类的外部观察者一致性指数在 0.22 到 0.76 之间。每位专家与共识评分的曲线下面积(AUC)在 0.61 到 0.83 之间,而最佳客观迂曲度指标的预后性能为 0.80。
存在较高的内部和外部观察者变异性,尤其是在四级评分标准的情况下。迂曲度测量的预后性能接近专家的表现,尤其是对于 Grisan 测量。然而,由于计算测量中缺乏临床标准,自动有效性与专家感知之间存在差距。