Lockwood Gina M, Leach Alexander J, Shelton Jude, Carter Knute D, Hansen Jacob, Edwards Angelena, Storm Douglas W, Zee Rebecca S, Ebert Kristin, Wojcik Louis, Cannon Shannon T, Chamberlin Joshua D, Sheth Kunj, Fox Janelle, Braga Luis H, Welch Valre, Khoury Antoine E, Kern Nora G, Davis-Dao Carol, Dudley Anne G, Wehbi Elias, Williamson Sarah, Herndon Cd Anthony, Cooper Christopher S
Department of Urology, University of Iowa, Iowa City, IA, USA.
Department of Urology, University of Iowa, Iowa City, IA, USA.
J Pediatr Urol. 2025 Jun 23. doi: 10.1016/j.jpurol.2025.06.025.
The ideal parameters to predict significant antenatal hydronephrosis remain controversial. Given the subjectivity of the Society of Fetal Urology (SFU) and Urinary Tract Dilation (UTD) grading systems, more objective measurements like medullary pyramidal thickness (PT) and parenchymal thickness (ParT) may be useful.
We sought to assess the interrater reliability of objective measures of hydronephrosis and the UTD grading system among pediatric urologists at multiple institutions through the Societies of Pediatric Urology (SPU) Hydronephrosis Task Force Registry.
Fifteen renal sonograms of infants from a single center were chosen from patients enrolled in the registry's prospective database. Images were shared confidentially with pediatric urologists at participating institutions. Reviewers were taught standardized measurement techniques. Eight reviewers from five institutions analyzed each study and recorded anterior posterior renal pelvis diameter (APD), PT, ParT, and UTD grade. Interrater reliability was analyzed using Intraclass Correlation Coefficient (ICC) with 95 % CI, one-way random effects, consistency model for continuous variables and percent agreement for binary variables. Light's kappa with 95 % CI was calculated for reliability of UTD grade.
Reviewers collected data on fifteen renal sonograms for a total of 30 units. APD had excellent reliability, PT moderate to good reliability, and ParT poor to moderate reliability. One reviewer was found to be an outlier with respect to ParT measurements. When assessed as a binary variable (>3 mm vs. <3 mm) the percent agreement between reviewers for PT was 70 %. UTD grade was considered to have weak to moderate reliability with a Light's Kappa of 0.43. The most common discrepancy between graders was the distinction between UTD P2 and P3.
DISCUSSION/CONCLUSION: APD demonstrated the highest interrater reliability among objective assessments of antenatal hydronephrosis. PT had moderate to good reliability and was more reliable than ParT. This analysis highlights the need for incorporation of more reliable methods to characterize UTD.
预测显著产前肾积水的理想参数仍存在争议。鉴于胎儿泌尿外科学会(SFU)和泌尿道扩张(UTD)分级系统的主观性,像髓质锥体厚度(PT)和实质厚度(ParT)这样更客观的测量方法可能会有所帮助。
我们试图通过小儿泌尿外科学会(SPU)肾积水特别工作组登记处,评估多个机构的小儿泌尿科医生对肾积水客观测量指标及UTD分级系统的评分者间信度。
从登记处前瞻性数据库中的患者里选取了来自单一中心的15例婴儿肾脏超声图像。图像被保密地分享给参与机构的小儿泌尿科医生。对审阅者进行标准化测量技术培训。来自五个机构的八名审阅者分析每项研究,并记录肾肾盂前后径(APD)、PT、ParT和UTD分级。使用组内相关系数(ICC)及95%置信区间、单向随机效应、连续变量的一致性模型和二元变量的百分比一致性来分析评分者间信度。计算Light's kappa及95%置信区间以评估UTD分级的可靠性。
审阅者收集了15例肾脏超声图像的数据,共30个单位。APD具有极好的信度,PT具有中等至良好的信度,ParT具有较差至中等的信度。发现一名审阅者在ParT测量方面是离群值。当作为二元变量(>3毫米与<3毫米)评估时,审阅者之间PT的百分比一致性为70%。UTD分级被认为具有弱至中等的可靠性,Light's Kappa为0.43。分级者之间最常见的差异是UTD P2和P3之间的区分。
讨论/结论:在产前肾积水的客观评估中,APD显示出最高的评分者间信度。PT具有中等至良好的信度,并且比ParT更可靠。该分析强调需要采用更可靠的方法来描述UTD。