Faculty of Science, Radboud University, Nijmegen, The Netherlands.
Trinity College Institute of Neuroscience, Trinity College Dublin, Dublin, Ireland.
PLoS One. 2024 Apr 16;19(4):e0290706. doi: 10.1371/journal.pone.0290706. eCollection 2024.
In many applications, artificial neural networks are best trained for a task by following a curriculum, in which simpler concepts are learned before more complex ones. This curriculum can be hand-crafted by the engineer or optimised like other hyperparameters, by evaluating many curricula. However, this is computationally intensive and the hyperparameters are unlikely to generalise to new datasets. An attractive alternative, demonstrated in influential prior works, is that the network could choose its own curriculum by monitoring its learning. This would be particularly beneficial for continual learning, in which the network must learn from an environment that is changing over time, relevant both to practical applications and in the modelling of human development. In this paper we test the generality of this approach using a proof-of-principle model, training a network on two sequential tasks under static and continual conditions, and investigating both the benefits of a curriculum and the handicap induced by continuous learning. Additionally, we test a variety of prior task-switching metrics, and find that in some cases even in this simple scenario the a network is often unable to choose the optimal curriculum, as the benefits are sometimes only apparent with hindsight, at the end of training. We discuss the implications of the results for network engineering and models of human development.
在许多应用中,通过遵循课程来训练人工神经网络以完成任务效果最佳,在此过程中,先学习简单的概念,然后再学习更复杂的概念。课程可以由工程师手工设计,也可以像其他超参数一样进行优化,通过评估许多课程来实现。然而,这种方法计算量很大,超参数不太可能推广到新的数据集。一个有吸引力的替代方案是,网络可以通过监控其学习过程来选择自己的课程,这在持续学习中尤其有益,因为网络必须从随时间变化的环境中学习,这与实际应用和人类发展的建模都相关。在本文中,我们使用一个原理验证模型来测试这种方法的通用性,该模型在静态和持续条件下对两个连续任务进行训练,并研究了课程的好处和连续学习带来的障碍。此外,我们还测试了各种先前的任务切换指标,发现即使在这种简单的情况下,网络也常常无法选择最佳课程,因为在训练结束时,收益有时只有事后才能看出来。我们讨论了这些结果对网络工程和人类发展模型的影响。