Department of Radiation Oncology, The Netherlands Cancer Institute, Amsterdam, The Netherlands.
Med Phys. 2022 Mar;49(3):1701-1711. doi: 10.1002/mp.15421. Epub 2022 Feb 1.
Automatic cervix-uterus segmentation of the clinical target volume (CTV) on CT and cone-beam CT (CBCT) scans is challenged by the limited visibility and the non-anatomical definition of certain border regions. We study the potential performance gain of convolutional neural networks by regulating the segmentation predictions as diffeomorphic deformations of a segmentation prior.
We introduce a 3D convolutional neural network that segments the target scan by joint voxel-wise classification and the registration of a given prior. We compare this network to two other 3D baseline models: One treating segmentation as a classification problem (segmentation-only), the other as a registration problem (deformation-only). For reference and to highlight the benefits of a 3D model, these models are also benchmarked against a 2D segmentation model. Network performances are reported for CT and CBCT segmentation of the cervix-uterus CTV. We train the networks on the data of 84 patients. The prior is provided by the CTV segmentation of a planning CT. Repeat CT or CBCT scans constitute the target scans to be segmented.
All 3D models outperformed the 2D segmentation model. For CT segmentation, combining classification and registration in the proposed joint model proved beneficial, achieving a Dice score of 0.87 and a mean squared error (MSE) of the surface distance below 1.7 mm. No such synergy was observed for CBCT segmentation, for which the joint and the deformation-only model performed similarly, achieving a Dice score of about 0.80 and an MSE surface distance of 2.5 mm. However, the segmentation-only model performed notably worse in this low contrast regime. Visual inspection revealed that this performance drop translated into geometric inconsistencies between the prior and target segmentation. Such inconsistencies were not observed for the deformation-based models.
Constraining the solution space of admissible segmentation predictions to those reachable by a diffeomorphic deformation of the prior proved beneficial as it improved geometric consistency. Especially for CBCT, with its poor soft-tissue contrast, this type of regularization becomes important as shown by quantitative and qualitative evaluation.
在 CT 和锥形束 CT(CBCT)扫描上,由于某些边界区域的可见度有限和非解剖定义,自动对临床靶区(CTV)的子宫颈-子宫进行分割具有挑战性。我们通过将分割预测调节为分割先验的可变形变形,来研究卷积神经网络的潜在性能增益。
我们引入了一种 3D 卷积神经网络,该网络通过对给定先验的体素分类和配准来分割目标扫描。我们将该网络与其他两个 3D 基线模型进行了比较:一个将分割视为分类问题(仅分割),另一个将分割视为配准问题(仅变形)。为了参考并突出 3D 模型的优势,这些模型也针对 2D 分割模型进行了基准测试。报告了用于 CT 和 CBCT 分割子宫颈-子宫 CTV 的网络性能。我们在 84 名患者的数据上训练了网络。先验由计划 CT 的 CTV 分割提供。重复 CT 或 CBCT 扫描构成要分割的目标扫描。
所有 3D 模型均优于 2D 分割模型。对于 CT 分割,在拟议的联合模型中结合分类和配准被证明是有益的,其 Dice 评分达到 0.87,表面距离的均方误差(MSE)低于 1.7mm。对于 CBCT 分割,未观察到这种协同作用,联合模型和变形模型的性能相似,Dice 评分约为 0.80,表面距离的 MSE 为 2.5mm。然而,在这种低对比度的情况下,仅分割模型的性能明显下降。视觉检查表明,这种性能下降导致了先验和目标分割之间的几何不一致。变形模型没有观察到这种不一致。
将可接受的分割预测的解空间限制为通过先验的可变形变形可达到的空间,这被证明是有益的,因为它提高了几何一致性。特别是对于 CBCT,由于其软组织对比度差,这种类型的正则化变得很重要,如定量和定性评估所示。