Singh Parantak, Li You, Sikarwar Ankur, Lei Weixian, Gao Difei, Talbot Morgan B, Sun Ying, Shou Mike Zheng, Kreiman Gabriel, Zhang Mengmi
Nanyang Technological University (NTU), Singapore.
CFAR and I2R, Agency for Science, Technology and Research, Singapore.
IEEE Int Conf Comput Vis Workshops. 2023 Oct;2023:11674-11685. doi: 10.1109/iccv51070.2023.01075. Epub 2024 Jan 15.
Curriculum design is a fundamental component of education. For example, when we learn mathematics at school, we build upon our knowledge of addition to learn multiplication. These and other concepts must be mastered before our first algebra lesson, which also reinforces our addition and multiplication skills. Designing a curriculum for teaching either a human or a machine shares the underlying goal of maximizing knowledge transfer from earlier to later tasks, while also minimizing forgetting of learned tasks. Prior research on curriculum design for image classification focuses on the ordering of training examples during a single offline task. Here, we investigate the effect of the order in which multiple distinct tasks are learned in a sequence. We focus on the online class-incremental continual learning setting, where algorithms or humans must learn image classes one at a time during a single pass through a dataset. We find that curriculum consistently influences learning outcomes for humans and for multiple continual machine learning algorithms across several benchmark datasets. We introduce a novel-object recognition dataset for human curriculum learning experiments and observe that curricula that are effective for humans are highly correlated with those that are effective for machines. As an initial step towards automated curriculum design for online class-incremental learning, we propose a novel algorithm, dubbed Curriculum Designer (CD), that designs and ranks curricula based on inter-class feature similarities. We find significant overlap between curricula that are empirically highly effective and those that are highly ranked by our CD. Our study establishes a framework for further research on teaching humans and machines to learn continuously using optimized curricula. Our code and data are available through this link.
课程设计是教育的一个基本组成部分。例如,当我们在学校学习数学时,我们在加法知识的基础上学习乘法。在我们上第一节代数课之前,必须掌握这些以及其他概念,而代数课也会强化我们的加法和乘法技能。设计针对人类或机器的教学课程有着共同的基本目标,即最大限度地将知识从早期任务转移到后期任务,同时尽量减少对所学任务的遗忘。先前关于图像分类课程设计的研究集中在单个离线任务中训练示例的排序上。在这里,我们研究依次学习多个不同任务的顺序所产生的影响。我们关注在线类增量持续学习设置,即在单次遍历数据集的过程中,算法或人类必须一次学习一个图像类别。我们发现,在几个基准数据集上,课程始终会影响人类以及多种持续机器学习算法的学习成果。我们引入了一个用于人类课程学习实验的新颖目标识别数据集,并观察到对人类有效的课程与对机器有效的课程高度相关。作为在线类增量学习自动化课程设计的第一步,我们提出了一种新颖的算法,称为课程设计器(CD),它基于类间特征相似度来设计课程并进行排序。我们发现,经验上非常有效的课程与我们的CD高度排序的课程之间存在显著重叠。我们的研究为进一步研究如何使用优化课程来教导人类和机器持续学习建立了一个框架。我们的代码和数据可通过此链接获取。