Liu Yiwen, Xing Wenyu, Zhao Mingbo, Lin Mingquan
College of Information Science and Technology, Donghua University, Shanghai, People's Republic of China.
School of Information Science and Technology, Fudan University, Shanghai, People's Republic of China.
Neural Comput Appl. 2023 Apr 28:1-13. doi: 10.1007/s00521-023-08604-y.
During the past three years, the coronavirus disease 2019 (COVID-19) has swept the world. The rapid and accurate recognition of covid-19 pneumonia are ,therefore, of great importance. To handle this problem, we propose a new pipeline of deep learning framework for diagnosing COVID-19 pneumonia via chest X-ray images from normal, COVID-19, and other pneumonia patients. In detail, the self-trained YOLO-v4 network was first used to locate and segment the thoracic region, and the output images were scaled to the same size. Subsequently, the pre-trained convolutional neural network was adopted to extract the features of X-ray images from 13 convolutional layers, which were fused with the original image to form a 14-dimensional image matrix. It was then put into three parallel pyramid multi-layer perceptron (MLP)-Mixer modules for comprehensive feature extraction through spatial fusion and channel fusion based on different scales so as to grasp more extensive feature correlation. Finally, by combining all image features from the 14-channel output, the classification task was achieved using two fully connected layers as well as Softmax classifier for classification. Extensive simulations based on a total of 4099 chest X-ray images were conducted to verify the effectiveness of the proposed method. Experimental results indicated that our proposed method can achieve the best performance in almost all cases, which is good for auxiliary diagnosis of COVID-19 and has great clinical application potential.
在过去三年里,2019冠状病毒病(COVID-19)席卷全球。因此,快速准确地识别COVID-19肺炎至关重要。为解决这一问题,我们提出了一种新的深度学习框架流程,用于通过来自正常、COVID-19和其他肺炎患者的胸部X光图像诊断COVID-19肺炎。具体而言,首先使用自训练的YOLO-v4网络定位并分割胸部区域,然后将输出图像缩放到相同大小。随后,采用预训练的卷积神经网络从13个卷积层提取X光图像的特征,并将这些特征与原始图像融合,形成一个14维的图像矩阵。接着将其放入三个并行的金字塔多层感知器(MLP)-Mixer模块中,通过基于不同尺度的空间融合和通道融合进行全面的特征提取,以便掌握更广泛的特征相关性。最后,通过组合来自14通道输出的所有图像特征,使用两个全连接层以及Softmax分类器完成分类任务。基于总共4099张胸部X光图像进行了广泛的模拟,以验证所提方法的有效性。实验结果表明,我们提出的方法在几乎所有情况下都能取得最佳性能,这有利于COVID-19的辅助诊断,具有很大的临床应用潜力。