Zhang Ruize, Wang Liejun, Cheng Shuli, Song Shiji
College of Information Science and Engineering, Xinjiang University, Urumqi, 830046, Xinjiang, China.
Department of Automation, Tsinghua University, Beijing, 100084, China.
Expert Syst Appl. 2023 Oct 15;228:120389. doi: 10.1016/j.eswa.2023.120389. Epub 2023 May 10.
Recent years have witnessed a growing interest in neural network-based medical image classification methods, which have demonstrated remarkable performance in this field. Typically, convolutional neural network (CNN) architectures have been commonly employed to extract local features. However, the transformer, a newly emerged architecture, has gained popularity due to its ability to explore the relevance of remote elements in an image through a self-attention mechanism. Despite this, it is crucial to establish not only local connectivity but also remote relationships between lesion features and capture the overall image structure to improve image classification accuracy. Therefore, to tackle the aforementioned issues, this paper proposes a network based on multilayer perceptrons (MLPs) that can learn the local features of medical images on the one hand and capture the overall feature information in both spatial and channel dimensions on the other hand, thus utilizing image features effectively. This paper has been extensively validated on COVID19-CT dataset and ISIC 2018 dataset, and the results show that the method in this paper is more competitive and has higher performance in medical image classification compared with existing methods. This shows that the use of MLP to capture image features and establish connections between lesions is expected to provide novel ideas for medical image classification tasks in the future.
近年来,基于神经网络的医学图像分类方法越来越受到关注,这些方法在该领域已展现出卓越的性能。通常,卷积神经网络(CNN)架构被广泛用于提取局部特征。然而,Transformer这一新兴架构因其能够通过自注意力机制探索图像中远程元素的相关性而受到欢迎。尽管如此,为提高图像分类准确率,不仅要建立局部连通性,还要建立病变特征之间的远程关系并捕捉整体图像结构,这一点至关重要。因此,为解决上述问题,本文提出一种基于多层感知器(MLP)的网络,该网络一方面能够学习医学图像的局部特征,另一方面能够在空间和通道维度上捕捉整体特征信息,从而有效利用图像特征。本文在COVID19-CT数据集和ISIC 2018数据集上进行了广泛验证,结果表明,与现有方法相比,本文方法在医学图像分类中更具竞争力且性能更高。这表明使用MLP捕捉图像特征并建立病变之间的联系有望为未来的医学图像分类任务提供新思路。