Alaoui Abdalaoui Slimani Faiçal, Bentourkia M'hamed
Department of Nuclear Medicine and Radiobiology, 12th Avenue North, 3001, Sherbrooke, QC, J1H5N4, Canada.
Phys Eng Sci Med. 2025 Mar;48(1):59-73. doi: 10.1007/s13246-024-01489-8. Epub 2024 Nov 4.
Since its introduction in 2015, the U-Net architecture used in Deep Learning has played a crucial role in medical imaging. Recognized for its ability to accurately discriminate small structures, the U-Net has received more than 2600 citations in academic literature, which motivated continuous enhancements to its architecture. In hospitals, chest radiography is the primary diagnostic method for pulmonary disorders, however, accurate lung segmentation in chest X-ray images remains a challenging task, primarily due to the significant variations in lung shapes and the presence of intense opacities caused by various diseases. This article introduces a new approach for the segmentation of lung X-ray images. Traditional max-pooling operations, commonly employed in conventional U-Net++ models, were replaced with the discrete wavelet transform (DWT), offering a more accurate down-sampling technique that potentially captures detailed features of lung structures. Additionally, we used attention gate (AG) mechanisms that enable the model to focus on specific regions in the input image, which improves the accuracy of the segmentation process. When compared with current techniques like Atrous Convolutions, Improved FCN, Improved SegNet, U-Net, and U-Net++, our method (U-Net++-DWT) showed remarkable efficacy, particularly on the Japanese Society of Radiological Technology dataset, achieving an accuracy of 99.1%, specificity of 98.9%, sensitivity of 97.8%, Dice Coefficient of 97.2%, and Jaccard Index of 96.3%. Its performance on the Montgomery County dataset further demonstrated its consistent effectiveness. Moreover, when applied to additional datasets of Chest X-ray Masks and Labels and COVID-19, our method maintained high performance levels, achieving up to 99.3% accuracy, thereby underscoring its adaptability and potential for broad applications in medical imaging diagnostics.
自2015年推出以来,深度学习中使用的U-Net架构在医学成像中发挥了关键作用。U-Net因其能够准确区分小结构的能力而受到认可,在学术文献中已获得超过2600次引用,这促使其架构不断得到改进。在医院中,胸部X线摄影是肺部疾病的主要诊断方法,然而,胸部X线图像中的准确肺部分割仍然是一项具有挑战性的任务,主要原因是肺形状的显著变化以及各种疾病导致的强烈不透明度的存在。本文介绍了一种用于肺X线图像分割的新方法。传统U-Net++模型中常用的传统最大池化操作被离散小波变换(DWT)所取代,它提供了一种更准确的下采样技术,有可能捕捉肺结构的详细特征。此外,我们使用了注意力门(AG)机制,使模型能够专注于输入图像中的特定区域,从而提高了分割过程的准确性。与当前的技术如空洞卷积、改进的全卷积网络(FCN)、改进的SegNet、U-Net和U-Net++相比,我们的方法(U-Net++-DWT)显示出显著的效果,特别是在日本放射技术学会数据集上,准确率达到99.1%,特异性为98.9%,灵敏度为97.8%,骰子系数为97.2%,杰卡德指数为96.3%。其在蒙哥马利县数据集上的表现进一步证明了其持续有效性。此外,当应用于胸部X线掩码和标签以及新冠肺炎的其他数据集时,我们的方法保持了高性能水平,准确率高达99.3%,从而突出了其在医学成像诊断中的适应性和广泛应用潜力。