Sun Hongfei, Ren Ge, Teng Xinzhi, Song Liming, Li Kang, Yang Jianhua, Hu Xiaofei, Zhan Yuefu, Wan Shiu Bun Nelson, Wong Man Fung Esther, Chan King Kwong, Tsang Hoi Ching Hailey, Xu Lu, Wu Tak Chiu, Kong Feng-Ming Spring, Wang Yi Xiang J, Qin Jing, Chan Wing Chi Lawrence, Ying Michael, Cai Jing
Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong, China.
School of Automation, Northwestern Polytechnical University, Xi'an, China.
Quant Imaging Med Surg. 2023 Jan 1;13(1):394-416. doi: 10.21037/qims-22-610. Epub 2022 Nov 10.
The coronavirus disease 2019 (COVID-19) led to a dramatic increase in the number of cases of patients with pneumonia worldwide. In this study, we aimed to develop an AI-assisted multistrategy image enhancement technique for chest X-ray (CXR) images to improve the accuracy of COVID-19 classification.
Our new classification strategy consisted of 3 parts. First, the improved U-Net model with a variational encoder segmented the lung region in the CXR images processed by histogram equalization. Second, the residual net (ResNet) model with multidilated-rate convolution layers was used to suppress the bone signals in the 217 lung-only CXR images. A total of 80% of the available data were allocated for training and validation. The other 20% of the remaining data were used for testing. The enhanced CXR images containing only soft tissue information were obtained. Third, the neural network model with a residual cascade was used for the super-resolution reconstruction of low-resolution bone-suppressed CXR images. The training and testing data consisted of 1,200 and 100 CXR images, respectively. To evaluate the new strategy, improved visual geometry group (VGG)-16 and ResNet-18 models were used for the COVID-19 classification task of 2,767 CXR images. The accuracy of the multistrategy enhanced CXR images was verified through comparative experiments with various enhancement images. In terms of quantitative verification, 8-fold cross-validation was performed on the bone suppression model. In terms of evaluating the COVID-19 classification, the CXR images obtained by the improved method were used to train 2 classification models.
Compared with other methods, the CXR images obtained based on the proposed model had better performance in the metrics of peak signal-to-noise ratio and root mean square error. The super-resolution CXR images of bone suppression obtained based on the neural network model were also anatomically close to the real CXR images. Compared with the initial CXR images, the classification accuracy rates of the internal and external testing data on the VGG-16 model increased by 5.09% and 12.81%, respectively, while the values increased by 3.51% and 18.20%, respectively, for the ResNet-18 model. The numerical results were better than those of the single-enhancement, double-enhancement, and no-enhancement CXR images.
The multistrategy enhanced CXR images can help to classify COVID-19 more accurately than the other existing methods.
2019年冠状病毒病(COVID-19)导致全球肺炎患者病例数急剧增加。在本研究中,我们旨在开发一种用于胸部X光(CXR)图像的人工智能辅助多策略图像增强技术,以提高COVID-19分类的准确性。
我们的新分类策略由三部分组成。首先,具有变分编码器的改进型U-Net模型对经直方图均衡化处理的CXR图像中的肺区域进行分割。其次,使用具有多扩张率卷积层的残差网络(ResNet)模型来抑制217张仅含肺部的CXR图像中的骨骼信号。总共80%的可用数据用于训练和验证。其余20%的数据用于测试。获得了仅包含软组织信息的增强型CXR图像。第三,使用具有残差级联的神经网络模型对低分辨率骨骼抑制的CXR图像进行超分辨率重建。训练和测试数据分别由1200张和100张CXR图像组成。为了评估新策略,使用改进的视觉几何组(VGG)-16和ResNet-18模型对2767张CXR图像进行COVID-19分类任务。通过与各种增强图像的对比实验验证了多策略增强型CXR图像的准确性。在定量验证方面,对骨骼抑制模型进行了8折交叉验证。在评估COVID-19分类方面,使用改进方法获得的CXR图像训练了2个分类模型。
与其他方法相比,基于所提出模型获得的CXR图像在峰值信噪比和均方根误差指标上具有更好的性能。基于神经网络模型获得的骨骼抑制超分辨率CXR图像在解剖结构上也与真实CXR图像接近。与初始CXR图像相比,VGG-16模型上内部和外部测试数据的分类准确率分别提高了5.09%和12.81%,而ResNet-18模型的该值分别提高了3.51%和18.20%。数值结果优于单增强、双增强和无增强的CXR图像。
多策略增强型CXR图像比其他现有方法能更准确地对COVID-19进行分类。