机器中的学习：随机反向传播与深度学习通道

Learning in the Machine: Random Backpropagation and the Deep Learning Channel.

作者信息

Baldi Pierre, Sadowski Peter, Lu Zhiqin

机构信息

Department of Computer Science, University of California, Irvine.

Department of Mathematics, University of California, Irvine.

出版信息

Artif Intell. 2018 Jul;260:1-35. doi: 10.1016/j.artint.2018.03.003. Epub 2018 Apr 3.

DOI:10.1016/j.artint.2018.03.003

PMID:29731511

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5931406/

Abstract

Random backpropagation (RBP) is a variant of the backpropagation algorithm for training neural networks, where the transpose of the forward matrices are replaced by fixed random matrices in the calculation of the weight updates. It is remarkable both because of its effectiveness, in spite of using random matrices to communicate error information, and because it completely removes the taxing requirement of maintaining symmetric weights in a physical neural system. To better understand random backpropagation, we first connect it to the notions of local learning and learning channels. Through this connection, we derive several alternatives to RBP, including skipped RBP (SRPB), adaptive RBP (ARBP), sparse RBP, and their combinations (e.g. ASRBP) and analyze their computational complexity. We then study their behavior through simulations using the MNIST and CIFAR-10 bechnmark datasets. These simulations show that most of these variants work robustly, almost as well as backpropagation, and that multiplication by the derivatives of the activation functions is important. As a follow-up, we study also the low-end of the number of bits required to communicate error information over the learning channel. We then provide partial intuitive explanations for some of the remarkable properties of RBP and its variations. Finally, we prove several mathematical results, including the convergence to fixed points of linear chains of arbitrary length, the convergence to fixed points of linear autoencoders with decorrelated data, the long-term existence of solutions for linear systems with a single hidden layer and convergence in special cases, and the convergence to fixed points of non-linear chains, when the derivative of the activation functions is included.

摘要

随机反向传播（RBP）是用于训练神经网络的反向传播算法的一种变体，在计算权重更新时，前向矩阵的转置被固定的随机矩阵所取代。它之所以引人注目，一方面是因为尽管使用随机矩阵来传递误差信息，但它仍然有效；另一方面是因为它完全消除了在物理神经网络中维持对称权重这一繁重要求。为了更好地理解随机反向传播，我们首先将其与局部学习和学习通道的概念联系起来。通过这种联系，我们推导出了几种RBP的替代方法，包括跳跃式RBP（SRPB）、自适应RBP（ARBP）、稀疏RBP及其组合（如ASRBP），并分析了它们的计算复杂度。然后，我们通过使用MNIST和CIFAR - 10基准数据集进行模拟来研究它们的行为。这些模拟表明，这些变体中的大多数都能稳健地工作，几乎与反向传播一样好，并且激活函数导数的乘法很重要。作为后续研究，我们还研究了在学习通道上传递误差信息所需的低位数情况。然后，我们对RBP及其变体的一些显著特性提供了部分直观解释。最后，我们证明了几个数学结果，包括任意长度线性链收敛到不动点、具有去相关数据的线性自动编码器收敛到不动点、具有单个隐藏层的线性系统解的长期存在性以及在特殊情况下的收敛性，以及当包含激活函数导数时非线性链收敛到不动点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c1/5931406/b48f57d1f526/nihms961685f1.jpg

相似文献

Learning in the Machine: Random Backpropagation and the Deep Learning Channel.机器中的学习：随机反向传播与深度学习通道

Artif Intell. 2018 Jul;260:1-35. doi: 10.1016/j.artint.2018.03.003. Epub 2018 Apr 3.

Learning in the machine: The symmetries of the deep learning channel.机器学习：深度学习通道的对称性。

Neural Netw. 2017 Nov;95:110-133. doi: 10.1016/j.neunet.2017.08.008. Epub 2017 Sep 5.

Learning in the machine: Recirculation is random backpropagation.机器中的学习：再循环是随机反向传播。

Neural Netw. 2018 Dec;108:479-494. doi: 10.1016/j.neunet.2018.09.006. Epub 2018 Sep 27.

Biologically plausible deep learning - But how far can we go with shallow networks?生物学上合理的深度学习——但我们可以在浅层网络中走多远？

Neural Netw. 2019 Oct;118:90-101. doi: 10.1016/j.neunet.2019.06.001. Epub 2019 Jun 20.

Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

A theory of local learning, the learning channel, and the optimality of backpropagation.一种关于局部学习、学习通道及反向传播最优性的理论。

Neural Netw. 2016 Nov;83:51-74. doi: 10.1016/j.neunet.2016.07.006. Epub 2016 Aug 5.

Biologically Plausible Training Mechanisms for Self-Supervised Learning in Deep Networks.深度网络中自监督学习的生物学合理训练机制

Front Comput Neurosci. 2022 Mar 21;16:789253. doi: 10.3389/fncom.2022.789253. eCollection 2022.

Contrastive Hebbian learning with random feedback weights.对比随机反馈权重的Hebbian 学习。

Neural Netw. 2019 Jun;114:1-14. doi: 10.1016/j.neunet.2019.01.008. Epub 2019 Feb 21.

Noise-boosted bidirectional backpropagation and adversarial learning.噪声增强的双向反向传播和对抗学习。

Neural Netw. 2019 Dec;120:9-31. doi: 10.1016/j.neunet.2019.09.016. Epub 2019 Oct 17.

On-line learning algorithms for locally recurrent neural networks.局部递归神经网络的在线学习算法

IEEE Trans Neural Netw. 1999;10(2):253-71. doi: 10.1109/72.750549.

引用本文的文献

Fine-Pruning: A biologically inspired algorithm for personalization of machine learning models.精细调整：一种受生物启发的用于机器学习模型个性化的算法。

Patterns (N Y). 2025 Apr 29;6(5):101242. doi: 10.1016/j.patter.2025.101242. eCollection 2025 May 9.

The neural coding framework for learning generative models.用于学习生成模型的神经编码框架。

Nat Commun. 2022 Apr 19;13(1):2064. doi: 10.1038/s41467-022-29632-7.

Learning Without Feedback: Fixed Random Learning Signals Allow for Feedforward Training of Deep Neural Networks.无反馈学习：固定随机学习信号实现深度神经网络的前馈训练

Front Neurosci. 2021 Feb 10;15:629892. doi: 10.3389/fnins.2021.629892. eCollection 2021.

Direct Feedback Alignment With Sparse Connections for Local Learning.用于局部学习的具有稀疏连接的直接反馈对齐

Front Neurosci. 2019 May 24;13:525. doi: 10.3389/fnins.2019.00525. eCollection 2019.

Learning in the machine: Recirculation is random backpropagation.机器中的学习：再循环是随机反向传播。

Neural Netw. 2018 Dec;108:479-494. doi: 10.1016/j.neunet.2018.09.006. Epub 2018 Sep 27.

Data and Power Efficient Intelligence with Neuromorphic Learning Machines.基于神经形态学习机器的数据与功率高效智能

iScience. 2018 Jul 27;5:52-68. doi: 10.1016/j.isci.2018.06.010. Epub 2018 Jul 3.

Deep Supervised Learning Using Local Errors.使用局部误差的深度监督学习

Front Neurosci. 2018 Aug 31;12:608. doi: 10.3389/fnins.2018.00608. eCollection 2018.

Neural and Synaptic Array Transceiver: A Brain-Inspired Computing Framework for Embedded Learning.神经与突触阵列收发器：一种用于嵌入式学习的受大脑启发的计算框架。

Front Neurosci. 2018 Aug 29;12:583. doi: 10.3389/fnins.2018.00583. eCollection 2018.

Multi-Timescale Memory Dynamics Extend Task Repertoire in a Reinforcement Learning Network With Attention-Gated Memory.多时间尺度记忆动态扩展了具有注意力门控记忆的强化学习网络中的任务储备。

Front Comput Neurosci. 2018 Jul 12;12:50. doi: 10.3389/fncom.2018.00050. eCollection 2018.

SuperSpike: Supervised Learning in Multilayer Spiking Neural Networks.超级脉冲：多层脉冲神经网络中的监督学习

Neural Comput. 2018 Jun;30(6):1514-1541. doi: 10.1162/neco_a_01086. Epub 2018 Apr 13.

本文引用的文献

Learning in the machine: The symmetries of the deep learning channel.机器学习：深度学习通道的对称性。

Neural Netw. 2017 Nov;95:110-133. doi: 10.1016/j.neunet.2017.08.008. Epub 2017 Sep 5.

Random synaptic feedback weights support error backpropagation for deep learning.随机突触反馈权重支持深度学习的误差反向传播。

Nat Commun. 2016 Nov 8;7:13276. doi: 10.1038/ncomms13276.

A theory of local learning, the learning channel, and the optimality of backpropagation.一种关于局部学习、学习通道及反向传播最优性的理论。

Neural Netw. 2016 Nov;83:51-74. doi: 10.1016/j.neunet.2016.07.006. Epub 2016 Aug 5.

What time is it? Deep learning approaches for circadian rhythms.现在几点了？用于昼夜节律的深度学习方法。

Bioinformatics. 2016 Jun 15;32(12):i8-i17. doi: 10.1093/bioinformatics/btw243.

Predicting effects of noncoding variants with deep learning-based sequence model.使用基于深度学习的序列模型预测非编码变异的影响。

Nat Methods. 2015 Oct;12(10):931-4. doi: 10.1038/nmeth.3547. Epub 2015 Aug 24.

Searching for exotic particles in high-energy physics with deep learning.用深度学习在高能物理学中寻找奇异粒子。

Nat Commun. 2014 Jul 2;5:4308. doi: 10.1038/ncomms5308.

The Dropout Learning Algorithm.辍学学习算法

Artif Intell. 2014 May;210:78-122. doi: 10.1016/j.artint.2014.02.004.

Deep architectures for protein contact map prediction.用于蛋白质接触图预测的深度架构。

Bioinformatics. 2012 Oct 1;28(19):2449-57. doi: 10.1093/bioinformatics/bts475. Epub 2012 Jul 30.

Complex-valued autoencoders.复值自编码器。

Neural Netw. 2012 Sep;33:136-47. doi: 10.1016/j.neunet.2012.04.011. Epub 2012 May 4.

Receptive fields, binocular interaction and functional architecture in the cat's visual cortex.猫视觉皮层中的感受野、双眼相互作用及功能结构

J Physiol. 1962 Jan;160(1):106-54. doi: 10.1113/jphysiol.1962.sp006837.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验