Chung Kee, Wu Shaoju, Jeanne Chow, Tsai Andy
Department of Radiology, Boston Children's Hospital, 300 Longwood Ave, Boston, MA, 02115, USA.
Pediatr Radiol. 2025 Aug;55(9):1846-1856. doi: 10.1007/s00247-025-06311-5. Epub 2025 Jul 4.
Urinary tract dilation (UTD) is a frequent problem in infants. Automated and objective classification of UTD from renal ultrasounds would streamline their interpretations.
To develop and evaluate the performance of different deep learning models in predicting UTD classifications from renal ultrasound images.
We searched our image archive to identify renal ultrasounds performed in infants ≤ 3-months-old for the clinical indications of prenatal UTD and urinary tract infection (9/2023-8/2024). An expert pediatric uroradiologist provided the ground truth UTD labels for representative sagittal sonographic renal images. Three different deep learning models trained with cross-entropy loss were adapted with four-fold cross-validation experiments to determine the overall performance.
Our curated database included 492 right and 487 left renal ultrasounds (mean age ± standard deviation = 1.2 ± 0.1 months for both cohorts, with 341 boys/151 girls and 339 boys/148 girls, respectively). The model prediction accuracies for the right and left kidneys were 88.7% (95% confidence interval [CI], [85.8%, 91.5%]) and 80.5% (95% CI, [77.6%, 82.9%]), with weighted kappa scores of 0.90 (95% CI, [0.88, 0.91]) and 0.87 (95% CI, [0.82, 0.92]), respectively. When predictions were binarized into mild (normal/P1) and severe (UTD P2/P3) dilation, accuracies of the right and left kidneys increased to 96.3% (95% CI, [94.9%, 97.8%]) and 91.3% (95% CI, [88.5%, 94.2%]), but agreements decreased to 0.78 (95% CI, [0.73, 0.82]) and 0.75 (95% CI, [0.68, 0.82]), respectively.
Deep learning models demonstrated high accuracy and agreement in classifying UTD from infant renal ultrasounds, supporting their potential as decision-support tools in clinical workflows.
尿路扩张(UTD)是婴儿常见的问题。通过肾脏超声对UTD进行自动、客观的分类将简化其解读过程。
开发并评估不同深度学习模型在根据肾脏超声图像预测UTD分类方面的性能。
我们在图像存档中搜索,以识别在≤3个月大的婴儿中进行的肾脏超声检查,用于产前UTD和尿路感染的临床指征(2023年9月 - 2024年8月)。一位专业的儿科泌尿放射科医生为代表性的矢状面肾脏超声图像提供UTD的真实标签。使用交叉熵损失训练的三种不同深度学习模型通过四折交叉验证实验进行调整,以确定整体性能。
我们精心策划的数据库包括492例右侧和487例左侧肾脏超声检查(两个队列的平均年龄±标准差均为1.2±0.1个月,分别有341名男孩/151名女孩和339名男孩/148名女孩)。右侧和左侧肾脏的模型预测准确率分别为88.7%(95%置信区间[CI],[85.8%,91.5%])和80.5%(95%CI,[77.6%,82.9%]),加权kappa评分分别为0.90(95%CI,[0.88,0.91])和0.87(95%CI,[0.82,0.92])。当预测结果二分类为轻度(正常/P1)和重度(UTD P2/P3)扩张时,右侧和左侧肾脏的准确率分别提高到96.3%(95%CI,[94.9%,97.8%])和91.3%(95%CI,[88.5%,94.2%]),但一致性分别降至0.78(95%CI,[0.73,0.82])和0.75(95%CI,[0.68,0.82])。
深度学习模型在根据婴儿肾脏超声对UTD进行分类方面表现出高准确率和一致性,支持其作为临床工作流程中决策支持工具的潜力。