一种基于临床结构未对齐配对数据集的用于CT去噪的两阶段深度学习框架。

A two-stage deep-learning framework for CT denoising based on a clinically structure-unaligned paired data set.

作者信息

Hu Ruibao, Xie Yongsheng, Zhang Lulu, Liu Lijian, Luo Honghong, Wu Ruodai, Luo Dehong, Liu Zhou, Hu Zhanli

机构信息

Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.

School of Computer and Information, Anhui Normal University, Wuhu, China.

出版信息

Quant Imaging Med Surg. 2024 Jan 3;14(1):335-351. doi: 10.21037/qims-23-403. Epub 2024 Jan 2.

DOI:10.21037/qims-23-403

PMID:38223072

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10784028/

Abstract

BACKGROUND

In low-dose computed tomography (LDCT) lung cancer screening, soft tissue is hardly appreciable due to high noise levels. While deep learning-based LDCT denoising methods have shown promise, they typically rely on structurally aligned synthesized paired data, which lack consideration of the clinical reality that there are no aligned LDCT and normal-dose CT (NDCT) images available. This study introduces an LDCT denoising method using clinically structure-unaligned but paired data sets (LDCT and NDCT scans from the same patients) to improve lesion detection during LDCT lung cancer screening.

METHODS

A cohort of 64 patients undergoing both LDCT and NDCT was randomly divided into training (n=46) and testing (n=18) sets. A two-stage training approach was adopted. First, Gaussian noise was added to NDCT data to create simulated LDCT data for generator training. Then, the model was trained on a clinically structure-unaligned paired data set using a Wasserstein generative adversarial network (WGAN) framework with the initial generator weights obtained during the first stage of training. An attention mechanism was also incorporated into the network.

RESULTS

Validated on a clinical CT data set, our proposed method outperformed other available methods [CycleGAN, Pixel2Pixel, block-matching and three-dimensional filtering (BM3D)] in noise removal and detail retention tasks in terms of the peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and root mean square error (RMSE) metrics. Compared with the results produced by BM3D, our method yielded an average improvement of approximately 7% in terms of the three evaluation indicators. The probability density profile of the denoised CT output produced using our method best fit the reference NDCT scan. Additionally, our two-stage model outperformed the one-stage WGAN-based model in both objective and subjective evaluations, further demonstrating the higher effectiveness of our two-stage training approach.

CONCLUSIONS

The proposed method performed the best in removing noise from LDCT scans and exhibited good detail retention, which could potentially enhance the lesion detection and characterization effects obtained for soft tissues in the scanning scope of LDCT lung cancer screening.

摘要

背景

在低剂量计算机断层扫描（LDCT）肺癌筛查中，由于噪声水平高，软组织很难被清晰观察到。虽然基于深度学习的LDCT去噪方法已显示出前景，但它们通常依赖于结构对齐的合成配对数据，而没有考虑到临床实际情况，即没有对齐的LDCT和正常剂量CT（NDCT）图像可用。本研究引入一种使用临床结构未对齐但配对的数据集（来自同一患者的LDCT和NDCT扫描）的LDCT去噪方法，以改善LDCT肺癌筛查期间的病变检测。

方法

将64例同时接受LDCT和NDCT检查的患者队列随机分为训练组（n = 46）和测试组（n = 18）。采用两阶段训练方法。首先，向NDCT数据中添加高斯噪声以创建用于生成器训练的模拟LDCT数据。然后，使用Wasserstein生成对抗网络（WGAN）框架在临床结构未对齐的配对数据集上训练模型，初始生成器权重在训练的第一阶段获得。还将注意力机制纳入网络。

结果

在临床CT数据集上进行验证，我们提出的方法在去噪和细节保留任务方面，在峰值信噪比（PSNR）、结构相似性指数测量（SSIM）和均方根误差（RMSE）指标上优于其他现有方法[循环生成对抗网络（CycleGAN）、像素到像素（Pixel2Pixel）、块匹配和三维滤波（BM3D）]。与BM3D产生的结果相比，我们的方法在三个评估指标方面平均提高了约7%。使用我们的方法生成的去噪CT输出的概率密度分布与参考NDCT扫描最佳拟合。此外，我们的两阶段模型在客观和主观评估中均优于基于单阶段WGAN的模型，进一步证明了我们两阶段训练方法的更高有效性。