Verhoeven G, Coopman F, Duchateau L, Saunders J H, van Rijssen B, van Bree H
Department of Medical Imaging, Ghent University, Salisburylaan 133, 9130 Merelbeke, Belgium.
J Small Anim Pract. 2007 Jul;48(7):387-93. doi: 10.1111/j.1748-5827.2007.00364.x.
To determine the agreement between observers and to investigate the effect of observer experience in diagnosing canine hip dysplasia and providing final scoring of hips using the standard ventrodorsal hip-extended radiographic method. The agreement of the final scoring, with a presumed correct assessment based on the Norberg angle, is also investigated.
Thirty observers were requested to read 50 ventrodorsal hip-extended radiographs of 25 dogs according to Federation Cynologique International criteria. Groups of experienced (nine members) and inexperienced (21 members) observers were used.
For providing the distinction between dysplastic versus non-dysplastic dogs, the average interobserver agreement was 72 per cent and was significantly higher (P<0.0001) than the score that could be expected by chance without any agreement between observers. For providing the final score (A, B, C, D or E), an average interobserver agreement of 43.6 per cent was found. In the experienced group, an agreement score of 76 per cent was found for the distinction between AB versus non-AB and an agreement score of 81 per cent was found for the distinction between C versus non-C. The agreement score was significantly higher (P<0.0001) for the experienced group than for the inexperienced group in all cases. Agreement between the presumed correct assessment based on the Norberg angle and the observer's evaluation was low (P=0.35), irrespective of whether the observers were experienced (71.8 per cent correct assessments) or inexperienced (69 per cent correct assessments).
Although interobserver agreement is low, observer experience increases agreement.
确定观察者之间的一致性,并研究观察者经验对使用标准腹背位髋关节伸展X线摄影方法诊断犬髋关节发育不良及对髋关节进行最终评分的影响。同时还研究了基于诺伯格角假定的正确评估与最终评分之间的一致性。
要求30名观察者根据国际犬业联盟标准阅读25只犬的50张腹背位髋关节伸展X线片。将观察者分为有经验组(9名成员)和无经验组(21名成员)。
在区分发育不良与非发育不良犬方面,观察者间平均一致性为72%,显著高于观察者之间无任何一致性时随机预期的得分(P<0.0001)。在给出最终评分(A、B、C、D或E)时,观察者间平均一致性为43.6%。在有经验组中,AB与非AB区分的一致性得分为76%,C与非C区分的一致性得分为81%。在所有情况下,有经验组的一致性得分均显著高于无经验组(P<0.0001)。基于诺伯格角的假定正确评估与观察者评估之间的一致性较低(P=0.35),无论观察者是有经验的(正确评估率为71.8%)还是无经验的(正确评估率为69%)。
尽管观察者间一致性较低,但观察者经验可提高一致性。