Department of Statistics and Computer Science, University of Toronto, Toronto, ON M5S 3G3, Canada.
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1958-71. doi: 10.1109/TPAMI.2012.269.
We introduce HD (or “Hierarchical-Deep”) models, a new compositional learning architecture that integrates deep learning models with structured hierarchical Bayesian (HB) models. Specifically, we show how we can learn a hierarchical Dirichlet process (HDP) prior over the activities of the top-level features in a deep Boltzmann machine (DBM). This compound HDP-DBM model learns to learn novel concepts from very few training example by learning low-level generic features, high-level features that capture correlations among low-level features, and a category hierarchy for sharing priors over the high-level features that are typical of different kinds of concepts. We present efficient learning and inference algorithms for the HDP-DBM model and show that it is able to learn new concepts from very few examples on CIFAR-100 object recognition, handwritten character recognition, and human motion capture datasets.
我们介绍了 HD(或“层次深”)模型,这是一种新的组合学习架构,将深度学习模型与结构化层次贝叶斯(HB)模型集成在一起。具体来说,我们展示了如何在深度玻尔兹曼机(DBM)的顶级特征的活动中学习层次狄利克雷过程(HDP)先验。这个复合 HDP-DBM 模型通过学习低级通用特征、捕捉低级特征之间相关性的高级特征以及用于共享高级特征先验的类别层次结构来学习从极少数训练示例中学习新的概念,这些高级特征是不同种类概念的典型特征。我们提出了用于 HDP-DBM 模型的高效学习和推理算法,并表明它能够在 CIFAR-100 目标识别、手写字符识别和人体运动捕获数据集上从极少数示例中学习新的概念。