Department of Radiology, Peking University First Hospital, Beijing, China.
School of Basic Medical Sciences, Capital Medical University Beijing, Beijing, China.
Med Phys. 2024 Aug;51(8):5457-5467. doi: 10.1002/mp.17025. Epub 2024 Mar 13.
Accurate measurement of ureteral diameters plays a pivotal role in diagnosing and monitoring urinary tract obstruction (UTO). While three-dimensional magnetic resonance urography (3D MRU) represents a significant advancement in imaging, the traditional manual methods for assessing ureteral diameters are characterized by labor-intensive procedures and inherent variability. In the realm of medical image analysis, deep learning has led to a paradigm shift, yet the development of a comprehensive automated tool for the precise segmentation and measurement of ureters in MR images is an unaddressed challenge.
The ureter was quantitatively measured on 3D MRU images using a deep learning model.
A retrospective cohort of 445 3D MRU scans (443 patients, 52 ± 18 years; 217 female patients) was collected and split into training, validation, and internal testing cohorts. A 3D V-Net model was trained for urinary tract segmentation, and a post-processing algorithm was developed for ureteral measurements. The accuracy of the segmentation was evaluated using the Dice similarity coefficient (DSC) and volume intraclass correlation coefficient (ICC), with ground truth segmentations provided by experienced radiologists. The external cohort comprised 50 scans (50 patients, 55 ± 21 years; 30 female patients), and the model-predicted ureteral diameter measurements were compared with manual measurements to assess system performance. The various diameter parameters of ureter among the different measurement methods (ground truth, auto-segmentation with automatic diameter extraction, and manual segmentation with automatic diameter extraction) were assessed with Friedman tests and post hoc Dunn test. The effectiveness of the UTO diagnosis was assessed by receiver operating characteristic (ROC) curves and their respective areas under the curve (AUC) between different methods.
In both the internal test and external cohorts, the mean DSC values for bilateral ureters exceeded 0.70. The ICCs for the bilateral ureter volume obtained by comparing the model and manual segmentation were all greater than 0.96 (p < 0.05), except for the right ureter in the internal test cohort, for which the ICC was 0.773 (p < 0.05). The mean DSCs for interobserver and intraobserver reliability were all above 0.97. The maximum diameter of the ureter exhibited no statistically significant differences either in the dilated (p = 0.08) or in the non-dilated (p = 0.32) ureters across the three measurement methods. The AUCs of ground truth, auto-segmentation with automatic diameter extraction, and manual segmentation with automatic diameter extraction in diagnosing UTO were 0.988 (95% CI: 0.934, 1.000), 0.961 (95% CI: 0.893, 0.991), and 0.979 (95% CI: 0.919, 0.998), respectively. There was no statistical difference between AUCs of the different methods (p > 0.05).
The proposed deep learning model and post-processing algorithm provide an effective means for the quantitative evaluation of urinary diseases using 3D MRU images.
输尿管直径的精确测量在诊断和监测尿路梗阻(UTO)方面起着关键作用。虽然三维磁共振尿路成像(3D MRU)代表了成像技术的重大进展,但传统的手动评估输尿管直径的方法存在劳动强度大且存在固有变异性的问题。在医学图像分析领域,深度学习带来了范式转变,但开发用于精确分割和测量 MR 图像中输尿管的全面自动化工具仍然是一个未解决的挑战。
使用深度学习模型对 3D MRU 图像中的输尿管进行定量测量。
回顾性收集了 445 例 3D MRU 扫描(443 例患者,52±18 岁;217 例女性患者),并将其分为训练、验证和内部测试队列。训练了一个 3D V-Net 模型进行尿路分割,并开发了一个后处理算法进行输尿管测量。使用经验丰富的放射科医生提供的真实分割来评估分割的准确性,使用 Dice 相似系数(DSC)和体积内类间相关系数(ICC)进行评估。外部队列包括 50 例扫描(50 例患者,55±21 岁;30 例女性患者),将模型预测的输尿管直径测量值与手动测量值进行比较,以评估系统性能。使用 Friedman 检验和事后 Dunn 检验评估不同测量方法(真实值、自动分割和自动直径提取、手动分割和自动直径提取)之间输尿管的各种直径参数。使用受试者工作特征(ROC)曲线及其各自的曲线下面积(AUC)评估不同方法对 UTO 诊断的有效性。
在内部测试和外部队列中,双侧输尿管的平均 DSC 值均超过 0.70。模型和手动分割之间双侧输尿管体积的 ICC 值均大于 0.96(p<0.05),除内部测试队列中的右侧输尿管外,其 ICC 为 0.773(p<0.05)。观察者间和观察者内可靠性的平均 DSCs 均高于 0.97。在三种测量方法中,扩张(p=0.08)或非扩张(p=0.32)输尿管的最大直径均无统计学差异。在诊断UTO 方面,真实值、自动分割和自动直径提取、手动分割和自动直径提取的 AUC 分别为 0.988(95%CI:0.934,1.000)、0.961(95%CI:0.893,0.991)和 0.979(95%CI:0.919,0.998)。不同方法的 AUC 之间没有统计学差异(p>0.05)。
提出的深度学习模型和后处理算法为使用 3D MRU 图像对尿路疾病进行定量评估提供了一种有效的方法。