Institute of Applied Computer Science, Jagiellonian University, ul. Łojasiewicza 11, 30-348 Kraków, Poland.
Institute of Theoretical Physics, Jagiellonian University, ul. Łojasiewicza 11, 30-348 Kraków, Poland.
Phys Rev E. 2023 Jan;107(1-2):015303. doi: 10.1103/PhysRevE.107.015303.
We provide a deepened study of autocorrelations in neural Markov chain Monte Carlo (NMCMC) simulations, a version of the traditional Metropolis algorithm which employs neural networks to provide independent proposals. We illustrate our ideas using the two-dimensional Ising model. We discuss several estimates of autocorrelation times in the context of NMCMC, some inspired by analytical results derived for the Metropolized independent sampler (MIS). We check their reliability by estimating them on a small system where analytical results can also be obtained. Based on the analytical results for MIS, we propose a loss function and study its impact on the autocorrelation times. Although, this function's performance is a bit inferior to the traditional Kullback-Leibler divergence, it offers two training algorithms which in some situations may be beneficial. By studying a small 4×4 system, we gain access to the dynamics of the training process, which we visualize using several observables. Furthermore, we quantitatively investigate the impact of imposing global discrete symmetries of the system in the neural network training process on the autocorrelation times. Eventually, we propose a scheme which incorporates partial heat-bath updates, which considerably improves the quality of the training. The impact of the above enhancements is discussed for a 16×16 spin system. The summary of our findings may serve as guidance to the implementation of NMCMC simulations for more complicated models.
我们对神经马尔可夫链蒙特卡罗(NMCMC)模拟中的自相关进行了深入研究,这是一种传统的 Metropolis 算法的变体,它使用神经网络来提供独立的提案。我们使用二维伊辛模型来说明我们的想法。我们在 NMCMC 的背景下讨论了几种自相关时间的估计方法,其中一些方法是受针对 Metropolized independent sampler(MIS)得出的分析结果启发的。我们在一个可以获得分析结果的小系统上对它们进行了估计,以检查它们的可靠性。基于 MIS 的分析结果,我们提出了一个损失函数,并研究了它对自相关时间的影响。虽然这个函数的性能略逊于传统的 Kullback-Leibler 散度,但它提供了两种训练算法,在某些情况下可能会有所帮助。通过研究一个小的 4×4 系统,我们可以深入了解训练过程的动态,并使用几个可观测量对其进行可视化。此外,我们还定量研究了在神经网络训练过程中对系统施加全局离散对称性对自相关时间的影响。最终,我们提出了一种方案,其中包含部分热浴更新,这大大提高了训练的质量。我们讨论了上述增强对 16×16 自旋系统的影响。我们的研究结果总结可为更复杂模型的 NMCMC 模拟的实施提供指导。