神经网络中电路扩展用于学习的新作用。

Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138, USA.

Department of Physics, Harvard University, Cambridge, Massachusetts 02138, USA.

Phys Rev E. 2021 Feb;103(2-1):022404. doi: 10.1103/PhysRevE.103.022404.

Many sensory pathways in the brain include sparsely active populations of neurons downstream from the input stimuli. The biological purpose of this expanded structure is unclear, but it may be beneficial due to the increased expressive power of the network. In this work, we show that certain ways of expanding a neural network can improve its generalization performance even when the expanded structure is pruned after the learning period. To study this setting, we use a teacher-student framework where a perceptron teacher network generates labels corrupted with small amounts of noise. We then train a student network structurally matched to the teacher. In this scenario, the student can achieve optimal accuracy if given the teacher's synaptic weights. We find that sparse expansion of the input layer of a student perceptron network both increases its capacity and improves the generalization performance of the network when learning a noisy rule from a teacher perceptron when the expansion is pruned after learning. We find similar behavior when the expanded units are stochastic and uncorrelated with the input and analyze this network in the mean-field limit. By solving the mean-field equations, we show that the generalization error of the stochastic expanded student network continues to drop as the size of the network increases. This improvement in generalization performance occurs despite the increased complexity of the student network relative to the teacher it is trying to learn. We show that this effect is closely related to the addition of slack variables in artificial neural networks and suggest possible implications for artificial and biological neural networks.

大脑中的许多感觉通路包括输入刺激后下游稀疏活跃的神经元群体。这种扩展结构的生物学目的尚不清楚，但由于网络的表达能力增强，它可能是有益的。在这项工作中，我们表明，即使在学习期后修剪扩展结构，扩展神经网络的某些方法也可以提高其泛化性能。为了研究这种情况，我们使用了一种师生框架，其中感知器教师网络生成带有少量噪声污染的标签。然后，我们训练与教师结构匹配的学生网络。在这种情况下，如果学生获得了教师的突触权重，它可以达到最佳的准确性。我们发现，当学生感知器网络的输入层稀疏扩展时，当从感知器教师学习带有噪声的规则时，扩展在学习后被修剪，可以提高网络的容量和泛化性能。当扩展单元是随机的且与输入无关时，我们发现了类似的行为，并在平均场极限下分析了这个网络。通过求解平均场方程，我们表明，即使相对于其试图学习的教师，学生网络的复杂性增加，随机扩展的学生网络的泛化误差仍继续下降。尽管相对于它试图学习的教师，学生网络的复杂性增加，但泛化性能的这种提高确实发生了。我们表明，这种效应与人工神经网络中的松弛变量的添加密切相关，并为人工和生物神经网络提出了可能的启示。

相似文献

New role for circuit expansion for learning in neural networks.

Phys Rev E. 2021 Feb;103(2-1):022404. doi: 10.1103/PhysRevE.103.022404.

Dynamics of Supervised and Reinforcement Learning in the Non-Linear Perceptron.

ArXiv. 2025 Feb 24:arXiv:2409.03749v3.

A learning rule for very simple universal approximators consisting of a single layer of perceptrons.

Neural Netw. 2008 Jun;21(5):786-95. doi: 10.1016/j.neunet.2007.12.036. Epub 2007 Dec 31.

Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup.

J Stat Mech. 2020 Dec;2020(12):124010. doi: 10.1088/1742-5468/abc61e. Epub 2020 Dec 21.

Dendritic normalisation improves learning in sparsely connected artificial neural networks.

PLoS Comput Biol. 2021 Aug 9;17(8):e1009202. doi: 10.1371/journal.pcbi.1009202. eCollection 2021 Aug.

Learning with incomplete information in the committee machine.

Biol Cybern. 2009 Dec;101(5-6):401-10. doi: 10.1007/s00422-009-0345-2. Epub 2009 Nov 4.

Robust Student Network Learning.

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2455-2468. doi: 10.1109/TNNLS.2019.2929114. Epub 2019 Aug 16.

Efficient Combination of CNN and Transformer for Dual-Teacher Uncertainty-guided Semi-supervised Medical Image Segmentation.

Comput Methods Programs Biomed. 2022 Nov;226:107099. doi: 10.1016/j.cmpb.2022.107099. Epub 2022 Sep 2.

Learning Student Networks via Feature Embedding.

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):25-35. doi: 10.1109/TNNLS.2020.2970494. Epub 2021 Jan 4.

Deep convolutional neural network and IoT technology for healthcare.

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

引用本文的文献

The information theory of developmental pruning: Optimizing global network architectures using local synaptic rules.

PLoS Comput Biol. 2021 Oct 11;17(10):e1009458. doi: 10.1371/journal.pcbi.1009458. eCollection 2021 Oct.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

New role for circuit expansion for learning in neural networks.

Phys Rev E. 2021 Feb;103(2-1):022404. doi: 10.1103/PhysRevE.103.022404.

Dynamics of Supervised and Reinforcement Learning in the Non-Linear Perceptron.

ArXiv. 2025 Feb 24:arXiv:2409.03749v3.

A learning rule for very simple universal approximators consisting of a single layer of perceptrons.

Neural Netw. 2008 Jun;21(5):786-95. doi: 10.1016/j.neunet.2007.12.036. Epub 2007 Dec 31.

Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup.

J Stat Mech. 2020 Dec;2020(12):124010. doi: 10.1088/1742-5468/abc61e. Epub 2020 Dec 21.

Dendritic normalisation improves learning in sparsely connected artificial neural networks.

PLoS Comput Biol. 2021 Aug 9;17(8):e1009202. doi: 10.1371/journal.pcbi.1009202. eCollection 2021 Aug.

Learning with incomplete information in the committee machine.

Biol Cybern. 2009 Dec;101(5-6):401-10. doi: 10.1007/s00422-009-0345-2. Epub 2009 Nov 4.

Robust Student Network Learning.

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2455-2468. doi: 10.1109/TNNLS.2019.2929114. Epub 2019 Aug 16.

Efficient Combination of CNN and Transformer for Dual-Teacher Uncertainty-guided Semi-supervised Medical Image Segmentation.

Comput Methods Programs Biomed. 2022 Nov;226:107099. doi: 10.1016/j.cmpb.2022.107099. Epub 2022 Sep 2.

Learning Student Networks via Feature Embedding.

IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):25-35. doi: 10.1109/TNNLS.2020.2970494. Epub 2021 Jan 4.

Deep convolutional neural network and IoT technology for healthcare.

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

引用本文的文献

The information theory of developmental pruning: Optimizing global network architectures using local synaptic rules.

PLoS Comput Biol. 2021 Oct 11;17(10):e1009458. doi: 10.1371/journal.pcbi.1009458. eCollection 2021 Oct.

New role for circuit expansion for learning in neural networks.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献