Convy Ian, Huggins William, Liao Haoran, Whaley K Birgitta
Department of Chemistry, University of California, Berkeley, CA 94720, USA.
Berkeley Quantum Information and Computation Center, University of California, Berkeley, CA 94720, USA.
Mach Learn Sci Technol. 2022 Mar;3(1). doi: 10.1088/2632-2153/ac44a9. Epub 2022 Jan 20.
Tensor networks have emerged as promising tools for machine learning, inspired by their widespread use as variational ansatze in quantum many-body physics. It is well known that the success of a given tensor network ansatz depends in part on how well it can reproduce the underlying entanglement structure of the target state, with different network designs favoring different scaling patterns. We demonstrate here how a related correlation analysis can be applied to tensor network machine learning, and explore whether classical data possess correlation scaling patterns similar to those found in quantum states which might indicate the best network to use for a given dataset. We utilize mutual information as measure of correlations in classical data, and show that it can serve as a lower-bound on the entanglement needed for a probabilistic tensor network classifier. We then develop a logistic regression algorithm to estimate the mutual information between bipartitions of data features, and verify its accuracy on a set of Gaussian distributions designed to mimic different correlation patterns. Using this algorithm, we characterize the scaling patterns in the MNIST and Tiny Images datasets, and find clear evidence of boundary-law scaling in the latter. This quantum-inspired classical analysis offers insight into the design of tensor networks which are best suited for specific learning tasks.
受张量网络在量子多体物理中作为变分假设的广泛应用启发,它已成为机器学习中颇具前景的工具。众所周知,给定张量网络假设的成功部分取决于它能够多好地再现目标状态的潜在纠缠结构,不同的网络设计有利于不同的缩放模式。我们在此展示如何将相关的相关性分析应用于张量网络机器学习,并探索经典数据是否具有与量子态中发现的相似的相关性缩放模式,这可能表明针对给定数据集应使用的最佳网络。我们利用互信息作为经典数据中相关性的度量,并表明它可以作为概率张量网络分类器所需纠缠的下限。然后,我们开发了一种逻辑回归算法来估计数据特征二分之间的互信息,并在一组旨在模拟不同相关模式的高斯分布上验证其准确性。使用该算法,我们刻画了MNIST和Tiny Images数据集中的缩放模式,并在后一个数据集中发现了边界律缩放的明确证据。这种受量子启发的经典分析为最适合特定学习任务的张量网络设计提供了见解。