Tzavellas A N, Kenanidis E, Potoupnis M, Pellios S, Tsiridis E, Sayegh F
3 Academic Orthopaedic Unit, Medical School, Aristotle University of Thessaloniki, "Papageorgiou" General Hospital, Thessaloniki, Greece.
Hippokratia. 2016 Jul-Sep;20(3):222-226.
Prognostic value of Salter-Harris (SH) classification is well established. Its reliability, however, can be questioned. We aim to evaluate the interobserver and intraobserver reliability of SH classification and to correlate the level of rater's experience with the correct scoring for each SH subclass.
Twenty-eight independent raters stratified in three levels of seniority evaluated 50 randomly selected radiographs of physeal injuries. The interval for intraobserver reliability was 12 weeks. The overall agreement between raters was assessed using kappa statistics. Student's t-test and Spearman correlation coefficient used to compare results between groups.
Overall kappa for interobserver reliability was 0.45. The mean kappa difference between specialists and residents was significant (p <0.001). The mean kappa difference was also significant between senior and junior residents (p <0.001), favoring senior residents. Intraobserver kappa differs between specialists (0.55) and residents (0.49), but this did not reach statistical significance (p =0.34). SH type II and III demonstrated the highest category-specific kappa coefficient. Seniority was correlated significantly with the number of correct answers (Spearman rho =0.6 p =0.001).
Moderate interobserver reliability that was improved with greater rater's experience was found. Type II and III are the best scored regardless rater's experience. Type I, IV, and V when in doubt, require additional imaging. Hippokratia 2016, 20(3): 222-226.
Salter-Harris(SH)分类的预后价值已得到充分证实。然而,其可靠性可能受到质疑。我们旨在评估SH分类的观察者间和观察者内可靠性,并将评估者的经验水平与每个SH亚类的正确评分相关联。
28名独立评估者按资历分为三个级别,对随机选择的50张骨骺损伤X线片进行评估。观察者内可靠性评估的间隔时间为12周。使用kappa统计量评估评估者之间的总体一致性。采用学生t检验和Spearman相关系数比较组间结果。
观察者间可靠性的总体kappa值为0.45。专家与住院医师之间的平均kappa差异具有统计学意义(p<0.001)。高级住院医师与初级住院医师之间的平均kappa差异也具有统计学意义(p<0.001),高级住院医师表现更优。观察者内kappa值在专家(0.55)和住院医师(0.49)之间有所不同,但未达到统计学意义(p =0.34)。SH II型和III型显示出最高的类别特异性kappa系数。资历与正确答案数量显著相关(Spearman相关系数rho =0.6,p =0.001)。
发现观察者间可靠性为中等水平,且随着评估者经验的增加而提高。无论评估者经验如何,II型和III型的评分最佳。对于I型、IV型和V型,如有疑问,需要额外的影像学检查。《希波克拉底》2016年,20(3): 222 - 226。