Chen Xing, Liu Dongshu, Laydevant Jérémie, Grollier Julie
Laboratoire Albert Fert, CNRS, Thales, Université Paris-Saclay, Palaiseau, France.
School of Applied and Engineering Physics, Cornell University, Ithaca, NY, USA.
Nat Commun. 2025 Jul 1;16(1):5978. doi: 10.1038/s41467-025-61037-0.
Agents that operate autonomously benefit from lifelong learning capabilities. However, compatible training algorithms must comply with the decentralized nature of these systems which imposes constraints on both the parameters counts and the computational resources. The Forward-Forward (FF) algorithm is one of these. FF relies only on feedforward operations, the same used for inference, for optimizing layer-wise objectives. This purely forward approach eliminates the need for transpose operations required in traditional backpropagation. Despite its potential, FF has failed to reach state-of-the-art performance on most standard benchmark tasks, in part due to unreliable negative data generation methods for unsupervised learning. In this work, we propose Self-Contrastive Forward-Forward (SCFF) algorithm, a competitive training method aimed at closing this performance gap. Inspired by standard self-supervised contrastive learning for vision tasks, SCFF generates positive and negative inputs applicable across various datasets. The method demonstrates superior performance compared to existing unsupervised local learning algorithms on several benchmark datasets, including MNIST, CIFAR-10, STL-10 and Tiny ImageNet. We extend FF's application to training recurrent neural networks, expanding its utility to sequential data tasks. These findings pave the way for high-accuracy, real-time learning on resource-constrained edge devices.
自主运行的智能体受益于终身学习能力。然而,与之兼容的训练算法必须符合这些系统的分散特性,这对参数数量和计算资源都施加了限制。前向-前向(FF)算法就是其中之一。FF仅依靠用于推理的前馈操作来优化逐层目标。这种纯粹的前向方法消除了传统反向传播中所需的转置操作。尽管FF有其潜力,但在大多数标准基准任务上,它未能达到当前的最优性能,部分原因是无监督学习中不可靠的负数据生成方法。在这项工作中,我们提出了自对比前向-前向(SCFF)算法,这是一种旨在弥合这种性能差距的竞争性训练方法。受视觉任务中标准的自监督对比学习启发,SCFF生成适用于各种数据集的正输入和负输入。与现有的无监督局部学习算法相比,该方法在包括MNIST、CIFAR-10、STL-10和Tiny ImageNet在内的几个基准数据集上表现出卓越的性能。我们将FF的应用扩展到训练循环神经网络,将其效用扩展到序列数据任务。这些发现为在资源受限的边缘设备上进行高精度实时学习铺平了道路。