Schmitter M, Kress B, Hähnel S, Rammelsberg P
Poliklinik für zahnärztliche Prothetik, Im Neuenheimer Feld 400, D-69120 Heidelberg, Germany.
Dentomaxillofac Radiol. 2004 Jul;33(4):253-8. doi: 10.1259/dmfr/60552229.
Effects of calibration on interrater agreement in evaluating magnetic resonance (MR) images of the temporomandibular joint (TMJ) have already been examined. The objectives of the present study were to assess to what extent the quality of MR images of the TMJ influences interrater agreement and to evaluate interrater agreement with respect to image quality assessment.
Two non-calibrated medical radiologists and two general dentists evaluated sagittal images of 100 TMJs for both a rating of the image quality and the performance of five diagnostic tasks. The agreement between these raters with respect to the diagnoses was evaluated. Additionally, two additional raters, calibrated during a 5 h training including the evaluation of 70 MR images, also evaluated the diagnostic aspects and the image quality, on the basis of objective criteria. The agreement between the subjective diagnoses of the non-calibrated raters and the objective diagnoses of the calibrated raters was evaluated. Afterwards, the subjective and the objective quality assessments were compared using kappa statistics.
When good quality images were evaluated, higher kappa values were obtained for all diagnostic categories by the non-calibrated raters (mean Deltak for making diagnoses >0.1). This finding was confirmed by the value obtained for the agreement between the non-calibrated and the calibrated raters. The non-calibrated raters were in good agreement (k=0.67, standard error +/-0.09) with the calibrated raters for assessment of image quality.
The present study shows that it is possible even without calibration to obtain a better interrater agreement when higher quality MR images of the TMJ are evaluated.
已经研究了校准对颞下颌关节(TMJ)磁共振(MR)图像评估中评分者间一致性的影响。本研究的目的是评估TMJ的MR图像质量在多大程度上影响评分者间一致性,并评估在图像质量评估方面的评分者间一致性。
两名未经校准的医学放射科医生和两名普通牙医对100个TMJ的矢状面图像进行图像质量评分和五项诊断任务的评估。评估这些评分者之间在诊断方面的一致性。此外,另外两名在包括评估70幅MR图像的5小时培训中进行了校准的评分者,也根据客观标准评估了诊断方面和图像质量。评估未经校准的评分者的主观诊断与经校准的评分者的客观诊断之间的一致性。之后,使用kappa统计量比较主观和客观质量评估。
在评估高质量图像时,未经校准的评分者在所有诊断类别中获得了更高的kappa值(诊断时的平均Δk>0.1)。未经校准和经校准的评分者之间一致性的结果证实了这一发现。未经校准的评分者在图像质量评估方面与经校准的评分者有良好的一致性(k = 0.67,标准误差±0.09)。
本研究表明,即使不进行校准,在评估更高质量的TMJ的MR图像时,也有可能获得更好的评分者间一致性。