Clark Darin P, Cao Joseph Y, Badea Cristian T
Quantitative Imaging and Analysis Lab, Department of Radiology, Duke University, Durham, North Carolina, USA.
Center for Virtual Imaging Trials, Carl E. Ravin Advanced Imaging Laboratories, Department of Radiology, Duke University, Durham, North Carolina, USA.
Med Phys. 2025 Jul;52(7):e17918. doi: 10.1002/mp.17918.
The judicious use of CT in pediatric cardiac applications is warranted because young patients face the need for repeated imaging and increased lifetime cancer risk after ionizing radiation exposure. The quality of pediatric cardiac CT scans is variable because of limited protocols optimizations for pediatric patients, the common presence of metallic implants following treatment, and disparities in denoising algorithm performance between adult and pediatric scans. Two recent technological developments promise to improve the average quality of pediatric CT scans at fixed or reduced dose: clinical photon-counting CT (PCCT) and deep learning (DL) algorithms for CT image denoising. Given advancements to accommodate variable image quality, these technologies will deliver improved spatial resolution, noise performance, and contrast resolution for pediatric cardiac CT imaging.
To advance self-supervised DL denoising methods to accommodate variable image quality in pediatric cardiac CT data.
Starting with the popular Vision Transformer (ViT) DL architecture, two targeted architectural changes were made: (1) the multi-layer perceptrons (MLPs) were modified to allow cross-token recombination of encoded image data following attention computations (parallels patch-wise weighting and averaging in non-local means [NLM]), and (2) the network head was replaced with the equivalent of an overcomplete dictionary to perform dictionary sparse coding (SC). This modified, 3D ViT (mViT) was then trained in a dynamic fashion: the balance between data fidelity and representation sparsity was adjusted during training such that the average fidelity error remained consistent with localized estimates of image noise. To demonstrate the newly proposed method, the mViT was trained with pediatric cardiac photon-counting x-ray CT data with variable levels of image noise (NAEOTOM Alpha PCCT scanner; retrospective data from 20 patients scanned at Duke University; ages: 1-18 years; iterative reconstruction noise level in the left ventricle: 20-55 HU). Data from one patient with the highest levels of noise was reserved for validation. Testing data included Alpha data from three additional Duke patients (2 < 1 year old) and a murine cardiac PCCT data set acquired on a preclinical system.
The validation denoising results demonstrate that SC with the mViT preserves anatomic structures relevant to the diagnosis and treatment of congenital heart defects (coronary artery origins; valve leaflets; left ventricle boundaries) while achieving similar intensity bias and lower intensity variance values than competing denoising methods (bilateral filtration [BF], NLM, dictionary SC, block matching 4D, orthogonal matching pursuit, Noise2Void). Applying the trained mViT network to preclinical PCCT demonstrated robust generalization performance to high levels of image noise (∼230 HU) and differing image contrast; however, applying the network to clinical PCCT data in younger patients (< 1 year old) demonstrated some smoothing of image details in data already heavily denoised during reconstruction.
This work demonstrates robust, self-supervised denoising of pediatric cardiac PCCT data through data adaptation during network training based on local noise estimates. The trained network generalizes to data sets with high levels of noise and differing image contrast relative to the training data, suggesting that self-supervised fine tuning may allow the trained network to address related CT denoising problems.
由于年轻患者需要反复成像,且电离辐射暴露后终生患癌风险增加,因此在儿科心脏应用中合理使用CT是必要的。儿科心脏CT扫描的质量参差不齐,原因包括针对儿科患者的扫描方案优化有限、治疗后常见金属植入物的存在,以及成人和儿科扫描在去噪算法性能上的差异。最近的两项技术进展有望在固定剂量或降低剂量的情况下提高儿科CT扫描的平均质量:临床光子计数CT(PCCT)和用于CT图像去噪的深度学习(DL)算法。鉴于在适应可变图像质量方面取得的进展,这些技术将为儿科心脏CT成像带来更高的空间分辨率、噪声性能和对比度分辨率。
推进自监督DL去噪方法,以适应儿科心脏CT数据中的可变图像质量。
从流行的视觉Transformer(ViT)DL架构开始,进行了两项有针对性的架构更改:(1)修改多层感知器(MLP),以便在注意力计算后允许对编码图像数据进行跨令牌重组(类似于非局部均值(NLM)中的逐块加权和平均),以及(2)用相当于一个超完备字典的结构替换网络头部,以执行字典稀疏编码(SC)。然后以动态方式训练这种经过修改的3D ViT(mViT):在训练过程中调整数据保真度和表示稀疏性之间的平衡,以使平均保真度误差与图像噪声的局部估计保持一致。为了演示新提出的方法,使用具有可变图像噪声水平的儿科心脏光子计数X射线CT数据(NAEOTOM Alpha PCCT扫描仪;来自杜克大学扫描的20名患者的回顾性数据;年龄:1 - 18岁;左心室的迭代重建噪声水平:20 - 55 HU)对mViT进行训练。将一名噪声水平最高的患者的数据留作验证。测试数据包括来自另外三名杜克患者(2名<1岁)的Alpha数据以及在临床前系统上获取的小鼠心脏PCCT数据集。
验证去噪结果表明,使用mViT进行的SC在保留与先天性心脏缺陷诊断和治疗相关的解剖结构(冠状动脉起源;瓣膜小叶;左心室边界)的同时,与竞争去噪方法(双边滤波[BF]、NLM、字典SC、块匹配4D、正交匹配追踪、Noise2Void)相比,实现了相似的强度偏差和更低的强度方差值。将训练好的mViT网络应用于临床前PCCT显示出对高水平图像噪声(约230 HU)和不同图像对比度的强大泛化性能;然而,将该网络应用于较年轻患者(<1岁)的临床PCCT数据时,显示出在重建过程中已经大量去噪的数据中图像细节有些平滑。
这项工作通过基于局部噪声估计在网络训练期间进行数据自适应,展示了对儿科心脏PCCT数据进行强大的自监督去噪。训练好的网络相对于训练数据能够泛化到具有高水平噪声和不同图像对比度的数据集,这表明自监督微调可能使训练好的网络能够解决相关的CT去噪问题。