Huang Zeping, Qu Enze, Meng Yishuang, Zhang Man, Wei Qiuwen, Bai Xianghui, Zhang Xinling
Department of Ultrasound, the Third Affiliated Hospital of Sun Yat-sen University, 600 Tianhe Road, Guangzhou 510630, China.
Philips (China) Investment Co. Ltd, 6F, Building A2, 718 Lingshi Road, Shanghai 200072, China.
Eur J Radiol Open. 2022 Mar 24;9:100412. doi: 10.1016/j.ejro.2022.100412. eCollection 2022.
To automatically segment and measure the levator hiatus with a deep learning approach and evaluate the performance between algorithms, sonographers, and different devices.
Three deep learning models (UNet-ResNet34, HR-Net, and SegNet) were trained with 360 images and validated with 42 images. The trained models were tested with two test sets. The first set included 138 images to evaluate the performance between the algorithms and sonographers. An independent dataset including 679 images assessed the performances of algorithms between different ultrasound devices. Four metrics were used for evaluation: DSC, HDD, the relative error of segmentation area, and the absolute error of segmentation area.
The UNet model outperformed HR-Net and SegNet. It could achieve a mean DSC of 0.964 for the first test set and 0.952 for the independent test set. UNet was creditable compared with three senior sonographers with a noninferiority test in the first test set and equivalent in the two test sets collected by different devices. On average, it took two seconds to process one case with a GPU and 2.4 s with a CPU.
The deep learning approach has good performance for levator hiatus segmentation and good generalization ability on independent test sets. This automatic levator hiatus segmentation approach could help shorten the clinical examination time and improve consistency.
采用深度学习方法自动分割和测量肛提肌裂孔,并评估算法、超声检查医师和不同设备之间的性能。
使用360幅图像训练三种深度学习模型(UNet-ResNet34、HR-Net和SegNet),并使用42幅图像进行验证。使用两个测试集对训练好的模型进行测试。第一组包括138幅图像,用于评估算法和超声检查医师之间的性能。一个包含679幅图像的独立数据集评估了不同超声设备之间算法的性能。使用四个指标进行评估:DSC、HDD、分割面积相对误差和分割面积绝对误差。
UNet模型优于HR-Net和SegNet。在第一个测试集中,其平均DSC为0.964,在独立测试集中为0.952。在第一个测试集中通过非劣效性测试与三位资深超声检查医师相比,UNet是可信的,并且在不同设备收集的两个测试集中是等效的。平均而言,使用GPU处理一个病例需要两秒,使用CPU需要2.4秒。
深度学习方法在肛提肌裂孔分割方面具有良好的性能,并且在独立测试集上具有良好的泛化能力。这种自动肛提肌裂孔分割方法有助于缩短临床检查时间并提高一致性。