Frenkel Charlotte, Lefebvre Martin, Bol David
Institute of Neuroinformatics, University of Zürich and ETH Zürich, Zurich, Switzerland.
ICTEAM Institute, Université catholique de Louvain, Louvain-la-Neuve, Belgium.
Front Neurosci. 2021 Feb 10;15:629892. doi: 10.3389/fnins.2021.629892. eCollection 2021.
While the backpropagation of error algorithm enables deep neural network training, it implies (i) bidirectional synaptic weight transport and (ii) update locking until the forward and backward passes are completed. Not only do these constraints preclude biological plausibility, but they also hinder the development of low-cost adaptive smart sensors at the edge, as they severely constrain memory accesses and entail buffering overhead. In this work, we show that the one-hot-encoded labels provided in supervised classification problems, denoted as targets, can be viewed as a proxy for the error sign. Therefore, their fixed random projections enable a layerwise feedforward training of the hidden layers, thus solving the weight transport and update locking problems while relaxing the computational and memory requirements. Based on these observations, we propose the direct random target projection (DRTP) algorithm and demonstrate that it provides a tradeoff between accuracy and computational cost that is suitable for adaptive edge computing devices.
虽然误差反向传播算法能够实现深度神经网络的训练,但它意味着(i)双向突触权重传输以及(ii)在正向和反向传播完成之前进行更新锁定。这些限制不仅排除了生物学上的合理性,还阻碍了低成本自适应智能传感器在边缘设备的发展,因为它们严重限制了内存访问并带来了缓冲开销。在这项工作中,我们表明,在监督分类问题中提供的独热编码标签(表示为目标)可以被视为误差符号的代理。因此,它们的固定随机投影能够对隐藏层进行逐层前馈训练,从而解决权重传输和更新锁定问题,同时放宽计算和内存要求。基于这些观察结果,我们提出了直接随机目标投影(DRTP)算法,并证明它在准确性和计算成本之间提供了一种适合自适应边缘计算设备的权衡。