Ma Yuanyuan, Mao Jingfang, Liu Xinguo, Dai Zhongying, Zhang Hui, Zhang Xinyang, Li Qiang
Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou, China.
Key Laboratory of Heavy Ion Radiation Biology and Medicine of Chinese Academy of Sciences, Lanzhou, China.
Med Phys. 2023 Apr;50(4):2303-2316. doi: 10.1002/mp.16106. Epub 2022 Nov 25.
Contouring of internal gross target volume (iGTV) is an essential part of treatment planning in radiotherapy to mitigate the impact of intra-fractional target motion. However, it is usually time-consuming and easily subjected to intra-observer and inter-observer variability. So far, few studies have been explored to directly predict iGTV by deep learning technique, because the iGTV contains not only the gross target volume (GTV) but also the motion information of the GTV.
This work was an exploratory study to present a deep learning-based framework to segment iGTV rapidly and accurately in 4D CT images for lung cancers.
Five models, including 3D UNet, mmUNet with point-wise add merging approach (mmUNet-add), mmUNet with concatenate fusion strategy (mmUNet-cat), gruUNet with point-wise add fusion approach (gruUNet-add), and gruUNet with concatenate method (gruUNet-cat), were adopted for iGTV segmentation. All the models originated from the 3D UNet network, with multi-channel multi-path and convolutional gated recurrent unit (GRU) added in the mmUNet and gruUNet networks, respectively. Seventy patients with lung cancers were collected and 55 cases were randomly selected as the training set, and 15 cases as the testing set. In addition, the segmentation results of the five models were compared with the ground truths qualitatively and quantitatively.
In terms of Dice Similarity Coefficient (DSC), the proposed four networks (mmUNet-add, mmUNet-cat, gruUNet-add, and gruUNet-cat) increased the DSC score of 3D UNet from 0.6945 to 0.7342, 0.7253, 0.7405, and 0.7365, respectively. However, the differences were not statistically significant (p > 0.05). After a simple post-processing to remove the small isolated connected regions, the mean 95th percentile Hausdorff distances (HD_95s) of the 3D UNet, mmUNet-add, mmUNet-cat, gruUNet-add, and gruUNet-cat networks were 19.70, 15.75, 15.84, 15.61, and 15.83 mm, respectively, corresponding to 25.35, 25.96, 25.11, 28.23, and 24.47 mm before the post-processing. With regard to runtime, significant elapsed time growths (about 70s and 230s) were observed both in the mmUNet and gruUNet architectures due to the increasing parameters. But the mmUNet structure showed less growth.
Our study demonstrated the ability of the deep learning technique to predict iGTVs directly. With the introduction of multi-channel multi-path and convolutional GRU, the segmentation accuracy was improved under certain conditions with a reduced segmentation efficiency and a further research topic when the 3D UNet network would lead to poor performance is elicited. Less efficiency degradation was observed in the mmUNet structure. Besides, the element-wise add fusing strategy was favorable to increase DSC, whereas HD_95 benefited from the concentrate merging approach. Nevertheless, the segmentation accuracy by deep learning still remains to be improved.
勾画内部大体肿瘤靶区(iGTV)是放射治疗中治疗计划的重要组成部分,以减轻分次内靶区运动的影响。然而,这通常很耗时,并且容易受到观察者内和观察者间差异的影响。到目前为止,很少有研究探索通过深度学习技术直接预测iGTV,因为iGTV不仅包含大体肿瘤靶区(GTV),还包含GTV的运动信息。
本研究是一项探索性研究,旨在提出一种基于深度学习的框架,用于在肺癌的4D CT图像中快速准确地分割iGTV。
采用5种模型进行iGTV分割,包括3D UNet、采用逐点相加合并方法的mmUNet(mmUNet-add)、采用拼接融合策略的mmUNet(mmUNet-cat)、采用逐点相加融合方法的gruUNet(gruUNet-add)和采用拼接方法的gruUNet(gruUNet-cat)。所有模型均源自3D UNet网络,mmUNet和gruUNet网络分别添加了多通道多路径和卷积门控循环单元(GRU)。收集了70例肺癌患者,随机选择55例作为训练集,15例作为测试集。此外,对5种模型的分割结果与真实情况进行了定性和定量比较。
在骰子相似系数(DSC)方面,所提出的4种网络(mmUNet-add、mmUNet-cat、gruUNet-add和gruUNet-cat)分别将3D UNet的DSC分数从0.6945提高到0.7342、0.7253、0.7405和0.7365。然而,差异无统计学意义(p>0.05)。经过简单的后处理以去除小的孤立连接区域后,3D UNet、mmUNet-add、mmUNet-cat、gruUNet-add和gruUNet-cat网络的平均第95百分位数豪斯多夫距离(HD_95s)分别为19.70、15.75、15.84、15.61和15.83mm,后处理前分别对应25.35、25.96、25.11、28.23和24.47mm。在运行时间方面,由于参数增加,mmUNet和gruUNet架构均观察到显著的运行时间增长(约70秒和230秒)。但mmUNet结构的增长较小。
我们的研究证明了深度学习技术直接预测iGTV的能力。通过引入多通道多路径和卷积GRU,在一定条件下提高了分割精度,但分割效率降低,并引出了一个进一步的研究课题,即3D UNet网络何时会导致性能不佳。mmUNet结构的效率下降较少。此外,逐元素相加融合策略有利于提高DSC,而HD_95受益于集中合并方法。然而,深度学习的分割精度仍有待提高。