Suppr超能文献

PneuNet:使用 Vision Transformer 进行胸部 X 射线图像分析的 COVID-19 肺炎诊断的深度学习。

PneuNet: deep learning for COVID-19 pneumonia diagnosis on chest X-ray image analysis using Vision Transformer.

机构信息

Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, China.

State Key Laboratory of Tribology in Advanced Equipment, Tsinghua University, Beijing, 100084, China.

出版信息

Med Biol Eng Comput. 2023 Jun;61(6):1395-1408. doi: 10.1007/s11517-022-02746-2. Epub 2023 Jan 31.

Abstract

A long-standing challenge in pneumonia diagnosis is recognizing the pathological lung texture, especially the ground-glass appearance pathological texture. One main difficulty lies in precisely extracting and recognizing the pathological features. The patients, especially those with mild symptoms, show very little difference in lung texture, neither conventional computer vision methods nor convolutional neural networks perform well on pneumonia diagnosis based on chest X-ray (CXR) images. In the meanwhile, the Coronavirus Disease 2019 (COVID-19) pandemic continues wreaking havoc around the world, where quick and accurate diagnosis backed by CXR images is in high demand. Rather than simply recognizing the patterns, extracting feature maps from the original CXR image is what we need in the classification process. Thus, we propose a Vision Transformer (VIT)-based model called PneuNet to make an accurate diagnosis backed by channel-based attention through X-ray images of the lung, where multi-head attention is applied on channel patches rather than feature patches. The techniques presented in this paper are oriented toward the medical application of deep neural networks and VIT. Extensive experiment results show that our method can reach 94.96% accuracy in the three-categories classification problem on the test set, which outperforms previous deep learning models.

摘要

肺炎诊断中的一个长期挑战是识别病理肺纹理,特别是磨玻璃样外观的病理纹理。主要的困难之一在于精确地提取和识别病理特征。患者,特别是那些症状较轻的患者,肺纹理几乎没有差异,无论是传统的计算机视觉方法还是卷积神经网络,都无法很好地基于胸部 X 光(CXR)图像进行肺炎诊断。与此同时,2019 年冠状病毒病(COVID-19)大流行继续在全球肆虐,对 CXR 图像支持的快速准确诊断的需求很高。在分类过程中,我们需要的不是简单地识别模式,而是从原始 CXR 图像中提取特征图。因此,我们提出了一种基于 Vision Transformer(VIT)的模型,称为 PneuNet,通过肺部的 X 射线图像进行基于通道注意力的准确诊断,其中多头注意力应用于通道补丁而不是特征补丁。本文提出的技术针对的是深度神经网络和 VIT 在医学中的应用。广泛的实验结果表明,我们的方法在测试集上的三分类问题中可以达到 94.96%的准确率,优于以前的深度学习模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e7d/9887581/fd8c65e4265c/11517_2022_2746_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验