Chen Hao, Barthel Thomas
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):7825-7832. doi: 10.1109/TPAMI.2024.3396386. Epub 2024 Nov 6.
Tensor networks developed in the context of condensed matter physics try to approximate order- N tensors with a reduced number of degrees of freedom that is only polynomial in N and arranged as a network of partially contracted smaller tensors. As we have recently demonstrated in the context of quantum many-body physics, computation costs can be further substantially reduced by imposing constraints on the canonical polyadic (CP) rank of the tensors in such networks. Here, we demonstrate how tree tensor networks (TTN) with CP rank constraints and tensor dropout can be used in machine learning. The approach is found to outperform other tensor-network-based methods in Fashion-MNIST image classification. A low-rank TTN classifier with branching ratio b=4 reaches a test set accuracy of 90.3% with low computation costs. Consisting of mostly linear elements, tensor network classifiers avoid the vanishing gradient problem of deep neural networks. The CP rank constraints have additional advantages: The number of parameters can be decreased and tuned more freely to control overfitting, improve generalization properties, and reduce computation costs. They allow us to employ trees with large branching ratios, substantially improving the representation power.
在凝聚态物理背景下发展起来的张量网络试图用自由度数量减少的情况来近似 N 阶张量,该自由度数量仅是 N 的多项式形式,并排列成由部分收缩的较小张量组成的网络。正如我们最近在量子多体物理背景下所证明的,通过对这类网络中张量的典范多adic(CP)秩施加约束,计算成本可以进一步大幅降低。在这里,我们展示了具有 CP 秩约束和张量随机失活的树张量网络(TTN)如何用于机器学习。结果发现,该方法在时尚 MNIST 图像分类中优于其他基于张量网络的方法。具有分支比 b = 4 的低秩 TTN 分类器以低计算成本达到了 90.3%的测试集准确率。张量网络分类器主要由线性元素组成,避免了深度神经网络的梯度消失问题。CP 秩约束还有其他优点:参数数量可以减少并更自由地调整,以控制过拟合、改善泛化特性并降低计算成本。它们使我们能够采用具有大分支比的树,从而大幅提高表示能力。