Department of Cognitive and Neural Systems, Boston University, Boston, Massachusetts 02215, USA.
Neural Netw. 2010 Mar;23(2):265-82. doi: 10.1016/j.neunet.2009.07.026. Epub 2009 Jul 23.
Computational models of learning typically train on labeled input patterns (supervised learning), unlabeled input patterns (unsupervised learning), or a combination of the two (semi-supervised learning). In each case input patterns have a fixed number of features throughout training and testing. Human and machine learning contexts present additional opportunities for expanding incomplete knowledge from formal training, via self-directed learning that incorporates features not previously experienced. This article defines a new self-supervised learning paradigm to address these richer learning contexts, introducing a neural network called self-supervised ARTMAP. Self-supervised learning integrates knowledge from a teacher (labeled patterns with some features), knowledge from the environment (unlabeled patterns with more features), and knowledge from internal model activation (self-labeled patterns). Self-supervised ARTMAP learns about novel features from unlabeled patterns without destroying partial knowledge previously acquired from labeled patterns. A category selection function bases system predictions on known features, and distributed network activation scales unlabeled learning to prediction confidence. Slow distributed learning on unlabeled patterns focuses on novel features and confident predictions, defining classification boundaries that were ambiguous in the labeled patterns. Self-supervised ARTMAP improves test accuracy on illustrative low-dimensional problems and on high-dimensional benchmarks. Model code and benchmark data are available from: http://techlab.eu.edu/SSART/.
学习的计算模型通常在有标签的输入模式(监督学习)、无标签的输入模式(无监督学习)或两者的组合(半监督学习)上进行训练。在每种情况下,输入模式在整个训练和测试过程中都具有固定数量的特征。人类和机器学习环境提供了从正式培训中扩展不完整知识的额外机会,通过自我指导的学习来整合以前没有经验过的特征。本文定义了一种新的自监督学习范例来解决这些更丰富的学习环境问题,引入了一种称为自监督 ARTMAP 的神经网络。自监督学习整合了来自教师的知识(带有一些特征的有标签模式)、来自环境的知识(带有更多特征的无标签模式)和来自内部模型激活的知识(自我标记的模式)。自监督 ARTMAP 可以从无标签模式中学习新特征,而不会破坏以前从有标签模式中获得的部分知识。类别选择函数基于已知特征对系统预测进行预测,分布式网络激活将无标签学习扩展到预测置信度。无标签模式上的缓慢分布式学习专注于新特征和有信心的预测,从而定义了在有标签模式中不明确的分类边界。自监督 ARTMAP 提高了在说明性低维问题和高维基准上的测试准确性。模型代码和基准数据可从以下网址获得:http://techlab.eu.edu/SSART/。