Herring John A, Kim Hui Taek, Browne Richard
Texas Scottish Rite Hospital for Children, 2222 Welborn Street, Dallas, TX 75129, USA.
J Bone Joint Surg Am. 2004 Oct;86(10):2103-20.
Accurate and reliable radiographic classifications of the relative severity and outcome of Legg-Calve-Perthes disease are essential in the study of that disease. As part of a prospective multicenter study, we sought to define more clearly the lateral pillar classification of severity and the Stulberg classification of outcome; we sought especially to define the borderlines between classification groups.
We performed interobserver and intraobserver trials of the lateral pillar and Stulberg classifications using sets of twenty radiographs chosen from a prospective study of 345 hips. To establish reliable definitions of the lateral pillar classification, we added a new, intermediate group termed the B/C border group, which includes femoral heads with a thin or poorly ossified lateral pillar and those with a loss of exactly 50% of the original height of the lateral pillar. The resulting classification consists of four groups: A, B, B/C border, and C. In our application of the classification system of Stulberg et al., we defined a class-II femoral head as round and fitting within 2 mm of a circle on both anteroposterior and frog-leg lateral radiographs. We defined a Stulberg class-III femoral head as out of round by more than 2 mm on either view and a Stulberg class-IV femoral head as one with at least 1 cm of flattening of the weight-bearing articular surface. To assess interobserver and intraobserver agreement, we performed two trials of each classification with six orthopaedic surgeons reviewing twenty radiographs or pairs of radiographs.
In the first trial of the lateral pillar classification, there was 81% agreement per radiograph and the average weighted kappa was 0.71. In the second trial, there was 85% agreement per radiograph and the weighted kappa averaged 0.79. Intraobserver reliability testing showed a 77% match between Trials 1 and 2, an average weighted kappa of 0.81, and an average generalizability coefficient of 0.91. In Trial 1 of the Stulberg classification, there was 91% agreement per radiograph and an average weighted kappa of 0.82. In Trial 2, there was 92% agreement per radiograph and an average weighted kappa of 0.82. Intraobserver reliability testing showed an 89% match between Trials 1 and 2, an average weighted kappa value of 0.88, and an average generalizability coefficient of 0.92.
The interobserver and intraobserver trials of these classifications produced kappa values and generalizability coefficients in the excellent range. The modified lateral pillar classification and the redefined Stulberg classification are sufficiently reliable and accurate for use in studies of Legg-Calve-Perthes disease.
准确可靠的关于Legg-Calve-Perthes病相对严重程度和预后的放射学分类对于该病的研究至关重要。作为一项前瞻性多中心研究的一部分,我们试图更明确地界定严重程度的外侧柱分类和预后的Stulberg分类;我们尤其试图界定分类组之间的边界。
我们使用从345例髋关节的前瞻性研究中选取的20套X线片,对外侧柱和Stulberg分类进行了观察者间和观察者内试验。为了确立外侧柱分类的可靠定义,我们增加了一个新的中间组,称为B/C边界组,其中包括外侧柱薄或骨化不良的股骨头以及外侧柱高度正好损失原始高度50%的股骨头。最终的分类包括四组:A、B、B/C边界和C。在我们应用Stulberg等人的分类系统时,我们将II级股骨头定义为在前后位和蛙式侧位X线片上均呈圆形且与圆形的拟合度在2mm以内。我们将Stulberg III级股骨头定义为在任一视图上圆形度超过2mm,将Stulberg IV级股骨头定义为负重关节面至少有1cm扁平的股骨头。为了评估观察者间和观察者内的一致性,我们对每种分类进行了两次试验,由六位骨科医生查看20张X线片或成对的X线片。
在外侧柱分类的第一次试验中,每张X线片的一致性为81%,平均加权kappa值为0.71。在第二次试验中,每张X线片的一致性为85%,加权kappa值平均为0.79。观察者内可靠性测试显示试验1和试验2之间的匹配率为77%,平均加权kappa值为0.81,平均可推广系数为0.91。在Stulberg分类的试验1中,每张X线片的一致性为91%,平均加权kappa值为0.82。在试验2中,每张X线片的一致性为92%,平均加权kappa值为0.82。观察者内可靠性测试显示试验1和试验2之间的匹配率为89%,平均加权kappa值为0.88,平均可推广系数为0.92。
这些分类的观察者间和观察者内试验产生的kappa值和可推广系数处于优秀范围。改良的外侧柱分类和重新定义的Stulberg分类在Legg-Calve-Perthes病的研究中使用足够可靠和准确。