College of Systems Engineering, National University of Defense Technology, 410073, Changsha, China.
Department of Physics, Washington University in St. Louis, Campus Box 1105, 1 Brookings Drive, St. Louis, MO, 63130, USA.
Nat Commun. 2023 Feb 9;14(1):725. doi: 10.1038/s41467-023-36363-w.
Spin glasses are disordered magnets with random interactions that are, generally, in conflict with each other. Finding the ground states of spin glasses is not only essential for understanding the nature of disordered magnets and many other physical systems, but also useful to solve a broad array of hard combinatorial optimization problems across multiple disciplines. Despite decades-long efforts, an algorithm with both high accuracy and high efficiency is still lacking. Here we introduce DIRAC - a deep reinforcement learning framework, which can be trained purely on small-scale spin glass instances and then applied to arbitrarily large ones. DIRAC displays better scalability than other methods and can be leveraged to enhance any thermal annealing method. Extensive calculations on 2D, 3D and 4D Edwards-Anderson spin glass instances demonstrate the superior performance of DIRAC over existing methods. The presented framework will help us better understand the nature of the low-temperature spin-glass phase, which is a fundamental challenge in statistical physics. Moreover, the gauge transformation technique adopted in DIRAC builds a deep connection between physics and artificial intelligence. In particular, this opens up a promising avenue for reinforcement learning models to explore in the enormous configuration space, which would be extremely helpful to solve many other hard combinatorial optimization problems.
自旋玻璃是无序磁体,其随机相互作用通常相互冲突。找到自旋玻璃的基态不仅对于理解无序磁体和许多其他物理系统的性质至关重要,而且对于解决多个学科中广泛的硬组合优化问题也很有用。尽管经过了数十年的努力,但仍然缺乏一种具有高精度和高效率的算法。在这里,我们介绍了 DIRAC——一种深度强化学习框架,它可以仅在小规模自旋玻璃实例上进行训练,然后应用于任意大规模实例。DIRAC 显示出比其他方法更好的可扩展性,并可用于增强任何热退火方法。在 2D、3D 和 4D Edwards-Anderson 自旋玻璃实例上的大量计算表明,DIRAC 优于现有方法。所提出的框架将帮助我们更好地理解低温自旋玻璃相的性质,这是统计物理学中的一个基本挑战。此外,DIRAC 中采用的规范变换技术在物理和人工智能之间建立了深刻的联系。特别是,这为强化学习模型在巨大的配置空间中探索开辟了一条有前途的途径,这对于解决许多其他硬组合优化问题将非常有帮助。