School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China.
School of Biomedical Engineering, Southern Medical University, Guangzhou, 510515, China; Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, 510515, China; Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou, 510515, China.
Comput Med Imaging Graph. 2022 Dec;102:102137. doi: 10.1016/j.compmedimag.2022.102137. Epub 2022 Oct 18.
Automatic chest X-ray (CXR) disease classification has drawn increasing public attention as CXR is widely used in thoracic disease diagnosis. Existing classification networks typically employ a global average pooling layer to produce the final feature for the subsequent classifier. This limits the classification performance owing to the characteristics of lesions in CXR images, including small relative sizes, varied absolute sizes, and different occurrence locations. In this study, we propose a pixel-wise classification and attention network (PCAN) to simultaneously perform disease classification and weakly supervised localization, which provides interpretability for disease classification. The PCAN comprises a backbone network for extracting mid-level features, a pixel-wise classification branch (pc-branch) for generating pixel-wise diagnoses, and a pixel-wise attention branch (pa-branch) for producing pixel-wise weights. The pc-branch is capable of explicitly detecting small lesions, and the pa-branch is capable of adaptively focusing on different regions when classifying different thoracic diseases. Then, the pixel-wise diagnoses are multiplied with the pixel-wise weights to obtain the disease localization map, which provides the sizes and locations of lesions in a manner of weakly supervised learning. The final image-wise diagnosis is obtained by summing up the disease localization map at the spatial dimension. Comprehensive experiments conducted on the ChestX-ray14 and CheXpert datasets demonstrate the effectiveness of the proposed PCAN, which has great potential for thoracic disease diagnosis and treatment. The source codes are available at https://github.com/fzfs/PCAN.
自动胸部 X 射线 (CXR) 疾病分类作为 CXR 在胸部疾病诊断中的广泛应用而受到越来越多的关注。现有的分类网络通常采用全局平均池化层来生成后续分类器的最终特征。这限制了分类性能,因为 CXR 图像中的病变具有以下特点:相对大小小、绝对大小不同和发生位置不同。在这项研究中,我们提出了一种像素分类和注意网络 (PCAN),以同时进行疾病分类和弱监督定位,为疾病分类提供可解释性。PCAN 包括一个用于提取中级特征的骨干网络、一个用于生成像素级诊断的像素分类分支 (pc-分支) 和一个用于生成像素级权重的像素注意分支 (pa-分支)。pc-分支能够明确检测到小病变,而 pa-分支在对不同的胸部疾病进行分类时能够自适应地关注不同的区域。然后,将像素级诊断乘以像素级权重,以获得疾病定位图,以弱监督学习的方式提供病变的大小和位置。最后通过在空间维度上对疾病定位图求和得到图像级诊断。在 ChestX-ray14 和 CheXpert 数据集上进行的综合实验证明了所提出的 PCAN 的有效性,它在胸部疾病诊断和治疗方面具有很大的潜力。源代码可在 https://github.com/fzfs/PCAN 上获得。