增强型梯度用于训练受限玻尔兹曼机。

Restricted Boltzmann machines (RBMs) are often used as building blocks in greedy learning of deep networks. However, training this simple model can be laborious. Traditional learning algorithms often converge only with the right choice of metaparameters that specify, for example, learning rate scheduling and the scale of the initial weights. They are also sensitive to specific data representation. An equivalent RBM can be obtained by flipping some bits and changing the weights and biases accordingly, but traditional learning rules are not invariant to such transformations. Without careful tuning of these training settings, traditional algorithms can easily get stuck or even diverge. In this letter, we present an enhanced gradient that is derived to be invariant to bit-flipping transformations. We experimentally show that the enhanced gradient yields more stable training of RBMs both when used with a fixed learning rate and an adaptive one.

受限玻尔兹曼机（RBM）通常用作深度网络贪婪学习的构建块。但是，训练这个简单的模型可能很麻烦。传统的学习算法通常只有在选择合适的超参数时才会收敛，例如学习率调度和初始权重的规模。它们也对特定的数据表示敏感。通过翻转一些位并相应地更改权重和偏差，可以获得等效的 RBM，但传统的学习规则对此类变换并不不变。如果不仔细调整这些训练设置，传统算法很容易陷入困境甚至发散。在这封信中，我们提出了一种增强的梯度，该梯度被推导出对位翻转变换是不变的。我们通过实验表明，增强的梯度在使用固定学习率和自适应学习率时都能更稳定地训练 RBM。

新学期，新优惠

Suppr 超能文献

新学期，新优惠

Suppr 超能文献

Enhanced gradient for training restricted Boltzmann machines.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

推荐工具