Çelebi Süleyman, Özaydın Seyithan, Baştaş Cemile Beşik, Kuzdan Özgür, Erdoğan Cankat, Yazıcı Mehmet, Caymaz İsmail, Sander Serdar
Department of Pediatric Urology, Kanuni Sultan Suleyman Education and Research Hospital, 34300 Istanbul, Turkey.
Department of Pediatric Surgery, Kanuni Sultan Suleyman Education and Research Hospital, 34300 Istanbul, Turkey.
Adv Urol. 2016;2016:1684190. doi: 10.1155/2016/1684190. Epub 2016 Mar 16.
Aim. Vesicoureteral reflux (VUR) is one of the most common conditions seen in pediatric urology. Fortunately, there are many treatment options for this disorder. The grading system for VUR varies among doctors, and the literature on its reliability is sparse. Here, we assessed the effectiveness of the current VUR grading system. Methods. A series of 40 voiding cystourethrogram (VCUG) studies were selected. Four pediatric urologists (PU) and four pediatric radiologists (PR) independently graded each VCUG and then agreed on a uniform interpretation. For statistical analysis the intraclass correlation coefficient (ICC) was applied to assess interrater agreement. Results. ICC values ranging from 0.82 to 0.88 reflected the strong reliability of VCUG for grading cases of VUR among pediatric urologists and radiologists as separate groups, and the reliability between the two groups was also good, as indicated by an ICC of 0.89. Despite the high ICC, disagreement existed between raters; the lowest agreement was associated with middle grades (III and IV). Conclusions. The interrater reliability of the international grading system for VUR was high but imperfect. Thus, grading differences at middle grades can profoundly influence the type of treatment pursued.
目的。膀胱输尿管反流(VUR)是小儿泌尿外科最常见的病症之一。幸运的是,针对这种病症有多种治疗选择。VUR的分级系统在医生之间存在差异,且关于其可靠性的文献较少。在此,我们评估了当前VUR分级系统的有效性。方法。选取了40例排尿性膀胱尿道造影(VCUG)研究。四位小儿泌尿外科医生(PU)和四位小儿放射科医生(PR)各自独立对每例VCUG进行分级,然后达成统一的解读。为进行统计分析,应用组内相关系数(ICC)来评估评分者间的一致性。结果。ICC值在0.82至0.88之间,反映出VCUG在小儿泌尿外科医生和放射科医生作为独立组对VUR病例进行分级时具有较强的可靠性,两组之间的可靠性也良好,ICC为0.89表明了这一点。尽管ICC较高,但评分者之间仍存在分歧;最低的一致性与中级(III级和IV级)相关。结论。VUR国际分级系统的评分者间可靠性较高但并不完美。因此,中级的分级差异会对所采用的治疗类型产生深远影响。