基于张量表示的紧凑型神经架构设计

Compact Neural Architecture Designs by Tensor Representations.

作者信息

Su Jiahao, Li Jingling, Liu Xiaoyu, Ranadive Teresa, Coley Christopher, Tuan Tai-Ching, Huang Furong

机构信息

Department of Electrical and Computer Engineering, University of Maryland, College Park, MD, United States.

Department of Computer Science, University of Maryland, College Park, MD, United States.

出版信息

Front Artif Intell. 2022 Mar 8;5:728761. doi: 10.3389/frai.2022.728761. eCollection 2022.

DOI:10.3389/frai.2022.728761

PMID:35355829

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8959219/

Abstract

We propose a framework of tensorial neural networks (TNNs) extending existing linear layers on low-order tensors to multilinear operations on higher-order tensors. TNNs have three advantages over existing networks: First, TNNs naturally apply to higher-order data without flattening, which preserves their multi-dimensional structures. Second, compressing a pre-trained network into a TNN results in a model with similar expressive power but fewer parameters. Finally, TNNs interpret advanced compact designs of network architectures, such as bottleneck modules and interleaved group convolutions. To learn TNNs, we derive their backpropagation rules using a novel suite of generalized tensor algebra. With backpropagation, we can either learn TNNs from scratch or pre-trained models using knowledge distillation. Experiments on VGG, ResNet, and Wide-ResNet demonstrate that TNNs outperform the state-of-the-art low-rank methods on a wide range of backbone networks and datasets.

摘要

我们提出了一个张量神经网络（TNN）框架，将现有的低阶张量线性层扩展为高阶张量上的多线性运算。与现有网络相比，TNN有三个优点：第一，TNN自然适用于高阶数据，无需展平，从而保留其多维结构。第二，将预训练网络压缩为TNN会得到一个具有相似表达能力但参数更少的模型。最后，TNN解释了网络架构的先进紧凑设计，如瓶颈模块和交错分组卷积。为了学习TNN，我们使用一套新颖的广义张量代数推导其反向传播规则。通过反向传播，我们可以从零开始学习TNN，也可以使用知识蒸馏从预训练模型中学习。在VGG、ResNet和Wide-ResNet上的实验表明，TNN在广泛的骨干网络和数据集上优于当前最先进的低秩方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97a4/8959219/298ea684a9f4/frai-05-728761-g0001.jpg

相似文献

Compact Neural Architecture Designs by Tensor Representations.基于张量表示的紧凑型神经架构设计

Front Artif Intell. 2022 Mar 8;5:728761. doi: 10.3389/frai.2022.728761. eCollection 2022.

Stable tensor neural networks for efficient deep learning.用于高效深度学习的稳定张量神经网络。

Front Big Data. 2024 May 30;7:1363978. doi: 10.3389/fdata.2024.1363978. eCollection 2024.

Multi-way backpropagation for training compact deep neural networks.多向反向传播训练紧凑的深度神经网络。

Neural Netw. 2020 Jun;126:250-261. doi: 10.1016/j.neunet.2020.03.001. Epub 2020 Mar 26.

Taylor Neural Network for Real-World Image Super-Resolution.用于真实世界图像超分辨率的泰勒神经网络。

IEEE Trans Image Process. 2023;32:1942-1951. doi: 10.1109/TIP.2023.3255107. Epub 2023 Mar 24.

ADA-Tucker: Compressing deep neural networks via adaptive dimension adjustment tucker decomposition.ADA-Tucker：通过自适应维调整 Tucker 分解压缩深度神经网络。

Neural Netw. 2019 Feb;110:104-115. doi: 10.1016/j.neunet.2018.10.016. Epub 2018 Nov 13.

Balanced Unfolding Induced Tensor Nuclear Norms for High-Order Tensor Completion.用于高阶张量补全的平衡展开诱导张量核范数

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4724-4737. doi: 10.1109/TNNLS.2024.3373384. Epub 2025 Feb 28.

Nonlinear tensor train format for deep neural network compression.非线性张量火车格式用于深度神经网络压缩。

Neural Netw. 2021 Dec;144:320-333. doi: 10.1016/j.neunet.2021.08.028. Epub 2021 Sep 8.

Deep Neural Network Self-Distillation Exploiting Data Representation Invariance.深度神经网络自蒸馏利用数据表示不变性。

IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):257-269. doi: 10.1109/TNNLS.2020.3027634. Epub 2022 Jan 5.

Improving efficiency in convolutional neural networks with multilinear filters.利用多元线性滤波器提高卷积神经网络的效率。

Neural Netw. 2018 Sep;105:328-339. doi: 10.1016/j.neunet.2018.05.017. Epub 2018 Jun 7.

Nonparametric tensor ring decomposition with scalable amortized inference.具有可扩展摊销推理的非参数张量环分解

Neural Netw. 2024 Jan;169:431-441. doi: 10.1016/j.neunet.2023.10.031. Epub 2023 Oct 27.

本文引用的文献

QTTNet: Quantized tensor train neural networks for 3D object and video recognition.QTTNet：用于3D物体和视频识别的量化张量列车神经网络

Neural Netw. 2021 Sep;141:420-432. doi: 10.1016/j.neunet.2021.05.034. Epub 2021 Jun 5.

Compressing 3DCNNs based on tensor train decomposition.基于张量树分解的 3DCNN 压缩。

Neural Netw. 2020 Nov;131:215-230. doi: 10.1016/j.neunet.2020.07.028. Epub 2020 Aug 7.

Block-term tensor neural networks.块张量神经网络。

Neural Netw. 2020 Oct;130:11-21. doi: 10.1016/j.neunet.2020.05.034. Epub 2020 Jun 7.

Faster identification of optimal contraction sequences for tensor networks.更快地识别张量网络的最优收缩序列。

Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Sep;90(3):033315. doi: 10.1103/PhysRevE.90.033315. Epub 2014 Sep 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于张量表示的紧凑型神经架构设计

Compact Neural Architecture Designs by Tensor Representations.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献