School of Medicine, The University of Jordan, Amman, Jordan.
Department of Radiology, School of Medicine, The University of Jordan, Amman, Jordan.
Sci Rep. 2019 Dec 5;9(1):18384. doi: 10.1038/s41598-019-54764-0.
Anatomists and radiologists use the Zaidi-Dayal and Richards-Jabbour scales to study the shape of the foramen magnum. Our aim is to measure the interrater and intrarater agreement and reliability of ratings made using the two scales. We invited 16 radiology residents to attend two sessions, four weeks apart. During each session, we asked the residents to classify the shape of the foramen magnum in 35 images using both scales. We used Fleiss' κ to measure interrater reliability and Cohen's κ to measure intrarater reliability. The interrater reliability of ratings made using the Zaidi-Dayal scale was 0.34 (0.26-0.46) for session one and 0.30 (0.24-0.39) for session two, and the intrarater reliability was 0.39 (0.34-0.44). The interrater reliability of ratings made using the Richards-Jabbour scale was 0.14 (0.10-0.19) for session one and 0.12 (0.09-0.17) for session two, and the intrarater reliability was 0.11 (0.07-0.15). In conclusion, the interrater and intrarater agreement and reliability of ratings made using the Zaidi-Dayal and Richards-Jabbour scales are inadequate. We recommend an objective method by Zdilla et al. to researchers interested in studying the shape of the foramen magnum.
解剖学家和放射科医生使用 Zaidi-Dayal 和 Richards-Jabbour 量表来研究枕骨大孔的形状。我们的目的是测量使用这两种量表进行评分的评分者间和评分者内一致性和可靠性。我们邀请了 16 名放射科住院医师参加两次会议,间隔四周。在每次会议中,我们要求住院医师使用这两种量表对 35 张图像中的枕骨大孔形状进行分类。我们使用 Fleiss'κ 来衡量评分者间的可靠性,使用 Cohen's κ 来衡量评分者内的可靠性。使用 Zaidi-Dayal 量表进行评分的评分者间可靠性在第一次会议时为 0.34(0.26-0.46),在第二次会议时为 0.30(0.24-0.39),评分者内可靠性为 0.39(0.34-0.44)。使用 Richards-Jabbour 量表进行评分的评分者间可靠性在第一次会议时为 0.14(0.10-0.19),在第二次会议时为 0.12(0.09-0.17),评分者内可靠性为 0.11(0.07-0.15)。总之,使用 Zaidi-Dayal 和 Richards-Jabbour 量表进行评分的评分者间和评分者内一致性和可靠性不足。我们建议感兴趣研究枕骨大孔形状的研究人员采用 Zdilla 等人提出的客观方法。