Department of Radiology, Graduate School of Medical Science, Kyoto Prefectural University of Medicine, Kyoto, Japan.
Department of Radiation Oncology, Tohoku University Graduate School of Medicine, Sendai, Japan.
Med Phys. 2022 Jul;49(7):4353-4364. doi: 10.1002/mp.15697. Epub 2022 May 17.
This study aimed to evaluate the accuracy of deep learning (DL)-based computed tomography (CT) ventilation imaging (CTVI).
A total of 71 cases that underwent single-photon emission CT Kr-gas ventilation (SPECT V) and CT imaging were included. Sixty cases were assigned to the training and validation sets, and the remaining 11 cases were assigned to the test set. To directly transform three-dimensional (3D) CT (free-breathing CT) images to SPECT V images, a DL-based model was implemented based on the U-Net architecture. The input and output data were 3DCT- and SPECT V-masked, respectively, except for whole-lung volumes. These data were rearranged in voxel size, registered rigidly, cropped, and normalized in preprocessing. In addition to a standard estimation method (i.e., without dropout during the estimation process), a Monte Carlo dropout (MCD) method (i.e., with dropout during the estimation process) was used to calculate prediction uncertainty. To evaluate the two models' (CTVI , CTVI ) performance, we used fivefold cross-validation for the training and validation sets. To test the final model performances for both approaches, we applied the test set to each trained model and averaged the test prediction results from the five trained models to acquire the mean test result (bagging) for each approach. For the MCD method, the models were predicted repeatedly (sample size = 200), and the average and standard deviation (SD) maps were calculated in each voxel from the predicted results: The average maps were defined as test prediction results in each fold. As an evaluation index, the voxel-wise Spearman rank correlation coefficient (Spearman r ) and Dice similarity coefficient (DSC) were calculated. The DSC was calculated for three functional regions (high, moderate, and low) separated by an almost equal volume. The coefficient of variation was defined as prediction uncertainty, and these average values were calculated within three functional regions. The Wilcoxon signed-rank test was used to test for a significant difference between the two DL-based approaches.
The average indexes with one SD (1SD) between CTVI and SPECT V were 0.76 ± 0.06, 0.69 ± 0.07, 0.51 ± 0.06, and 0.75 ± 0.04 for Spearman r , DSC , DSC , and DSC , respectively. The average indexes with 1SD between CTVI and SPECT V were 0.72 ± 0.05, 0.66 ± 0.04, 0.48 ± 0.04, and 0.74 ± 0.06 for Spearman r , DSC , DSC , and DSC , respectively. These indexes between CTVI and CTVI showed no significance difference (Spearman r , p = 0.175; DSC , p = 0.123; DSC , p = 0.278; DSC , p = 0.520). The average coefficient of variations with 1SD were 0.27 ± 0.00, 0.27 ± 0.01, and 0.36 ± 0.03 for the high-, moderate-, and low-functional regions, respectively, and the low-functional region showed a tendency to exhibit larger uncertainties than the others.
We evaluated DL-based framework for estimating lung-functional ventilation images only from CT images. The results indicated that the DL-based approach could potentially be used for lung-ventilation estimation.
本研究旨在评估基于深度学习(DL)的计算机断层扫描(CT)通气成像(CTVI)的准确性。
共纳入 71 例接受单光子发射 CT Kr-气体通气(SPECT V)和 CT 成像的患者。其中 60 例被分配到训练集和验证集,其余 11 例被分配到测试集。为了直接将三维(3D)CT(自由呼吸 CT)图像转换为 SPECT V 图像,我们基于 U-Net 架构实施了基于 DL 的模型。输入和输出数据分别为 3DCT 和 SPECT V 掩模,除了全肺体积外。这些数据以体素大小进行重新排列,刚性注册,裁剪和标准化预处理。除了标准估计方法(即在估计过程中不使用随机失活)之外,还使用蒙特卡罗随机失活(MCD)方法(即在估计过程中使用随机失活)来计算预测不确定性。为了评估两种模型(CTVI,CTVI)的性能,我们使用五折交叉验证对训练集和验证集进行评估。为了测试两种方法的最终模型性能,我们将测试集应用于每个训练模型,并对五个训练模型的测试预测结果进行平均,以获得每个方法的平均测试结果(袋装)。对于 MCD 方法,模型会重复进行预测(样本量=200),并从预测结果中计算每个体素的平均和标准偏差(SD)图:平均图被定义为每个折叠的测试预测结果。作为评估指标,计算了体素水平的斯皮尔曼等级相关系数(Spearman r)和 Dice 相似系数(DSC)。根据几乎相等的体积将三个功能区域(高、中、低)分开,并计算 DSC。变异系数被定义为预测不确定性,并在三个功能区域内计算这些平均值。使用 Wilcoxon 符号秩检验检验两种基于 DL 的方法之间是否存在显著差异。
CTVI 与 SPECT V 之间的 1SD 平均指标分别为 0.76 ± 0.06、0.69 ± 0.07、0.51 ± 0.06 和 0.75 ± 0.04 的 Spearman r、DSC、DSC 和 DSC。CTVI 与 SPECT V 之间的 1SD 平均指标分别为 0.72 ± 0.05、0.66 ± 0.04、0.48 ± 0.04 和 0.74 ± 0.06 的 Spearman r、DSC、DSC 和 DSC。CTVI 与 CTVI 之间的这些指标没有显著差异(Spearman r,p=0.175;DSC,p=0.123;DSC,p=0.278;DSC,p=0.520)。1SD 的平均变异系数分别为 0.27 ± 0.00、0.27 ± 0.01 和 0.36 ± 0.03,分别用于高、中和低功能区域,并且低功能区域表现出比其他区域更大的不确定性的趋势。
我们评估了基于 DL 的框架,仅从 CT 图像估计肺功能通气图像。结果表明,基于 DL 的方法可能可用于肺通气估计。