Kong Fanning, Shi Zaifeng, Cao Huaisheng, Hao Yudong, Cao Qingjie
School of Microelectronics, Tianjin University, Tianjin 300072, People's Republic of China.
School of Mathematical Sciences, Tianjin Normal University, Tianjin 300387, People's Republic of China.
Phys Med Biol. 2025 Mar 10;70(6). doi: 10.1088/1361-6560/adbaae.
. Metal artifacts severely damaged human tissue information from the computed tomography (CT) image, posing significant challenges to disease diagnosis. Deep learning has been widely explored for the metal artifact reduction (MAR) task. Nevertheless, paired metal artifact CT datasets suitable for training do not exist in reality. Although the synthetic CT image dataset provides additional training data, the trained networks still generalize poorly to real metal artifact data.A self-supervised U-shaped transformer network is proposed to focus on model generalizability enhancement in MAR tasks. This framework consists of a self-supervised mask reconstruction pre-text task and a down-stream task. In the pre-text task, the CT images are randomly corrupted by masks. They are recovered with themselves as the label, aiming at acquiring the artifacts and tissue structure of the actual physical situation. Down-stream task fine-tunes MAR target through labeled images. Utilizing the multi-layer long-range feature extraction capabilities of the Transformer efficiently captures features of metal artifacts. The incorporation of the MAR bottleneck allows for the distinction of metal artifact features through cross-channel self-attention.. Experiments demonstrate that the framework maintains strong generalization ability in the MAR task, effectively preserving tissue details while suppressing metal artifacts. The results achieved a peak signal-to-noise ratio of 43.86 dB and a structural similarity index of 0.9863 while ensuring the efficiency of the model inference. In addition, the Dice coefficient and mean intersection over union are improved by 11.70% and 9.51% in the segmentation of the MAR image, respectively.The combination of unlabeled real-artifact CT images and labeled synthetic-artifact CT images facilitates a self-supervised learning process that positively contributes to model generalizability.
金属伪影严重破坏了计算机断层扫描(CT)图像中的人体组织信息,给疾病诊断带来了重大挑战。深度学习已被广泛用于金属伪影减少(MAR)任务。然而,现实中不存在适合训练的配对金属伪影CT数据集。尽管合成CT图像数据集提供了额外的训练数据,但训练后的网络对真实金属伪影数据的泛化能力仍然很差。提出了一种自监督U型变压器网络,以专注于MAR任务中模型泛化能力的增强。该框架由一个自监督掩码重建预训练任务和一个下游任务组成。在预训练任务中,CT图像被掩码随机破坏。以自身作为标签进行恢复,旨在获取实际物理情况的伪影和组织结构。下游任务通过标记图像对MAR目标进行微调。利用Transformer的多层远程特征提取能力有效地捕获金属伪影的特征。MAR瓶颈的引入允许通过跨通道自注意力区分金属伪影特征。实验表明,该框架在MAR任务中保持了强大的泛化能力,在抑制金属伪影的同时有效地保留了组织细节。结果在确保模型推理效率的同时,实现了43.86 dB的峰值信噪比和0.9863的结构相似性指数。此外,在MAR图像分割中,Dice系数和平均交并比分别提高了11.70%和9.51%。未标记的真实伪影CT图像和标记的合成伪影CT图像的结合促进了自监督学习过程,对模型泛化能力有积极贡献。