ResMLP：具有高效数据训练的图像分类前馈网络。

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):5314-5321. doi: 10.1109/TPAMI.2022.3206148. Epub 2023 Mar 7.

DOI:10.1109/TPAMI.2022.3206148

Abstract

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly good results. We share pre-trained models and our code based on the Timm library.

摘要

我们提出了 ResMLP，这是一种完全基于多层感知机的图像分类架构。它是一种简单的残差网络，交替使用（i）线性层，图像补丁在通道中独立且相同地相互作用，以及（ii）两层前馈网络，其中每个补丁的通道独立相互作用。当使用现代训练策略进行训练时，该策略使用大量数据增强和可选的蒸馏，它在 ImageNet 上实现了惊人的准确性/复杂度权衡。我们还在自监督设置中训练 ResMLP 模型，以进一步从使用标记数据集的过程中去除先验。最后，通过将我们的模型应用于机器翻译，我们取得了惊人的效果。我们基于 Timm 库共享预训练模型和代码。

相似文献

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training.ResMLP：具有高效数据训练的图像分类前馈网络。

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):5314-5321. doi: 10.1109/TPAMI.2022.3206148. Epub 2023 Mar 7.

Semi-supervised training of deep convolutional neural networks with heterogeneous data and few local annotations: An experiment on prostate histopathology image classification.基于异构数据和少量局部标注的深度卷积神经网络的半监督学习：前列腺组织病理学图像分类实验。

Med Image Anal. 2021 Oct;73:102165. doi: 10.1016/j.media.2021.102165. Epub 2021 Jul 14.

GFNet: Global Filter Networks for Visual Recognition.GFNet：用于视觉识别的全局滤波器网络

IEEE Trans Pattern Anal Mach Intell. 2023 Sep;45(9):10960-10973. doi: 10.1109/TPAMI.2023.3263824. Epub 2023 Aug 7.

A conventional-to-spectral CT image translation augmentation workflow for robust contrast injection-independent organ segmentation.一种常规到光谱 CT 图像翻译增强工作流程，用于稳健的对比注入独立器官分割。

Med Phys. 2022 Feb;49(2):1108-1122. doi: 10.1002/mp.15310. Epub 2021 Dec 20.

Classification of focal liver lesions in CT images using convolutional neural networks with lesion information augmented patches and synthetic data augmentation.基于病灶信息增强补丁和合成数据增强的卷积神经网络对 CT 图像中的肝脏局灶性病变进行分类。

Med Phys. 2021 Sep;48(9):5029-5046. doi: 10.1002/mp.15118. Epub 2021 Aug 4.

A real use case of semi-supervised learning for mammogram classification in a local clinic of Costa Rica.半监督学习在哥斯达黎加当地诊所的乳房 X 光分类中的实际应用案例。

Med Biol Eng Comput. 2022 Apr;60(4):1159-1175. doi: 10.1007/s11517-021-02497-6. Epub 2022 Mar 3.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

CLIP knows image aesthetics.CLIP了解图像美学。

Front Artif Intell. 2022 Nov 25;5:976235. doi: 10.3389/frai.2022.976235. eCollection 2022.

Recurrent network of perceptrons with three state synapses achieves competitive classification on real inputs.具有三态突触的递归感知机网络可实现对真实输入的竞争分类。

Front Comput Neurosci. 2012 Jun 22;6:39. doi: 10.3389/fncom.2012.00039. eCollection 2012.

Determining Top Fully Connected Layer's Hidden Neuron Count for Transfer Learning, Using Knowledge Distillation: a Case Study on Chest X-Ray Classification of Pneumonia and COVID-19.确定全连接层隐藏神经元数量用于迁移学习，使用知识蒸馏：以肺炎和 COVID-19 的胸部 X 射线分类为例。

J Digit Imaging. 2021 Dec;34(6):1349-1358. doi: 10.1007/s10278-021-00518-2. Epub 2021 Sep 29.

引用本文的文献

MLP-MFF: Lightweight Pyramid Fusion MLP for Ultra-Efficient End-to-End Multi-Focus Image Fusion.MLP-MFF：用于超高效端到端多焦点图像融合的轻量级金字塔融合多层感知器

Sensors (Basel). 2025 Aug 19;25(16):5146. doi: 10.3390/s25165146.

Enhancing mental health diagnostics through deep learning-based image classification.通过基于深度学习的图像分类增强心理健康诊断。

Front Med (Lausanne). 2025 Aug 4;12:1627617. doi: 10.3389/fmed.2025.1627617. eCollection 2025.

Spectral-spatial wave and frequency interactive transformer for hyperspectral image classification.用于高光谱图像分类的光谱-空间波与频率交互式变压器

Sci Rep. 2025 Jul 26;15(1):27259. doi: 10.1038/s41598-025-12489-3.

Deep learning-based image classification for integrating pathology and radiology in AI-assisted medical imaging.基于深度学习的图像分类，用于在人工智能辅助医学成像中整合病理学和放射学。

Sci Rep. 2025 Jul 25;15(1):27029. doi: 10.1038/s41598-025-07883-w.

Rock blasting evaluation - image recognition method based on deep learning.基于深度学习的岩石爆破评估——图像识别方法

Sci Rep. 2025 Jul 4;15(1):23980. doi: 10.1038/s41598-025-09973-1.

Hierarchical in-out fusion for incomplete multimodal brain tumor segmentation.用于不完整多模态脑肿瘤分割的分层进出融合

Sci Rep. 2025 Jul 2;15(1):23017. doi: 10.1038/s41598-025-07466-9.

RMIS-Net: a fast medical image segmentation network based on multilayer perceptron.RMIS-Net：一种基于多层感知器的快速医学图像分割网络。

PeerJ Comput Sci. 2025 May 14;11:e2882. doi: 10.7717/peerj-cs.2882. eCollection 2025.

Construction of VAE-GRU-XGBoost intrusion detection model for network security.用于网络安全的VAE-GRU-XGBoost入侵检测模型的构建

PLoS One. 2025 Jun 25;20(6):e0326205. doi: 10.1371/journal.pone.0326205. eCollection 2025.

Deep learning-based image classification for AI-assisted integration of pathology and radiology in medical imaging.基于深度学习的图像分类，用于医学成像中病理学与放射学的人工智能辅助整合。

Front Med (Lausanne). 2025 Jun 2;12:1574514. doi: 10.3389/fmed.2025.1574514. eCollection 2025.

HR-NeRF: advancing realism and accuracy in highlight scene representation.HR-NeRF：提升高光场景表示的真实感和准确性。

Front Neurorobot. 2025 Apr 16;19:1558948. doi: 10.3389/fnbot.2025.1558948. eCollection 2025.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ResMLP：具有高效数据训练的图像分类前馈网络。

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献