从自旋玻璃的角度看，利用流、扩散和自回归神经网络进行采样。

Sampling with flows, diffusion, and autoregressive neural networks from a spin-glass perspective.

作者信息

Ghio Davide, Dandi Yatin, Krzakala Florent, Zdeborová Lenka

机构信息

Information, Learning and Physics Laboratory, École Polytechnique Fédérale de Lausanne, Lausanne CH-1015, Switzerland.

Statistical Physics of Computation Laboratory, École Polytechnique Fédérale de Lausanne, Lausanne CH-1015, Switzerland.

出版信息

Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2311810121. doi: 10.1073/pnas.2311810121. Epub 2024 Jun 24.

DOI:10.1073/pnas.2311810121

PMID:38913892

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11228464/

Abstract

Recent years witnessed the development of powerful generative models based on flows, diffusion, or autoregressive neural networks, achieving remarkable success in generating data from examples with applications in a broad range of areas. A theoretical analysis of the performance and understanding of the limitations of these methods remain, however, challenging. In this paper, we undertake a step in this direction by analyzing the efficiency of sampling by these methods on a class of problems with a known probability distribution and comparing it with the sampling performance of more traditional methods such as the Monte Carlo Markov chain and Langevin dynamics. We focus on a class of probability distribution widely studied in the statistical physics of disordered systems that relate to spin glasses, statistical inference, and constraint satisfaction problems. We leverage the fact that sampling via flow-based, diffusion-based, or autoregressive networks methods can be equivalently mapped to the analysis of a Bayes optimal denoising of a modified probability measure. Our findings demonstrate that these methods encounter difficulties in sampling stemming from the presence of a first-order phase transition along the algorithm's denoising path. Our conclusions go both ways: We identify regions of parameters where these methods are unable to sample efficiently, while that is possible using standard Monte Carlo or Langevin approaches. We also identify regions where the opposite happens: standard approaches are inefficient while the discussed generative methods work well.

摘要

近年来，基于流、扩散或自回归神经网络的强大生成模型得到了发展，在从示例生成数据方面取得了显著成功，并在广泛领域得到应用。然而，对这些方法的性能进行理论分析并理解其局限性仍然具有挑战性。在本文中，我们朝着这个方向迈出了一步，通过分析这些方法在一类具有已知概率分布的问题上的采样效率，并将其与更传统的方法（如蒙特卡罗马尔可夫链和朗之万动力学）的采样性能进行比较。我们关注一类在无序系统统计物理学中广泛研究的概率分布，这些分布与自旋玻璃、统计推断和约束满足问题相关。我们利用这样一个事实，即通过基于流、基于扩散或自回归网络的方法进行采样可以等效地映射到对修改后的概率测度的贝叶斯最优去噪分析。我们的研究结果表明，这些方法在采样时遇到困难，这源于算法去噪路径上存在一阶相变。我们的结论有两方面：我们确定了这些方法无法有效采样的参数区域，而使用标准蒙特卡罗或朗之万方法则是可行的。我们还确定了相反情况发生的区域：标准方法效率低下，而所讨论的生成方法效果良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e839/11228464/3f6c6fe66cd1/pnas.2311810121fig01.jpg

相似文献

Sampling with flows, diffusion, and autoregressive neural networks from a spin-glass perspective.从自旋玻璃的角度看，利用流、扩散和自回归神经网络进行采样。

Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2311810121. doi: 10.1073/pnas.2311810121. Epub 2024 Jun 24.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Boosting Monte Carlo simulations of spin glasses using autoregressive neural networks.使用自回归神经网络增强自旋玻璃的蒙特卡罗模拟。

Phys Rev E. 2020 May;101(5-1):053312. doi: 10.1103/PhysRevE.101.053312.

Diffusion Models in Vision: A Survey.视觉中的扩散模型：综述

IEEE Trans Pattern Anal Mach Intell. 2023 Sep;45(9):10850-10869. doi: 10.1109/TPAMI.2023.3261988. Epub 2023 Aug 7.

Deep Autoregressive Models for the Efficient Variational Simulation of Many-Body Quantum Systems.用于多体量子系统高效变分模拟的深度自回归模型

Phys Rev Lett. 2020 Jan 17;124(2):020503. doi: 10.1103/PhysRevLett.124.020503.

Hamiltonian Monte Carlo methods for efficient parameter estimation in steady state dynamical systems.用于稳态动力系统中有效参数估计的哈密顿蒙特卡罗方法。

BMC Bioinformatics. 2014 Jul 28;15(1):253. doi: 10.1186/1471-2105-15-253.

Generative neural samplers for the quantum Heisenberg chain.用于量子海森堡链的生成神经采样器。

Phys Rev E. 2021 Jun;103(6-1):063304. doi: 10.1103/PhysRevE.103.063304.

Data augmentation for models based on rejection sampling.基于拒绝采样的模型的数据增强。

Biometrika. 2016 Jun;103(2):319-335. doi: 10.1093/biomet/asw005. Epub 2016 May 6.

Autoregressive Neural Network for Simulating Open Quantum Systems via a Probabilistic Formulation.基于概率公式的自回归神经网络模拟开放量子系统。

Phys Rev Lett. 2022 Mar 4;128(9):090501. doi: 10.1103/PhysRevLett.128.090501.

Semi-Implicit Denoising Diffusion Models (SIDDMs).半隐式去噪扩散模型（SIDDMs）。

Adv Neural Inf Process Syst. 2023 Dec;36:17383-17394. Epub 2024 May 30.

引用本文的文献

Opportunities and challenges of diffusion models for generative AI.生成式人工智能扩散模型的机遇与挑战。

Natl Sci Rev. 2024 Oct 3;11(12):nwae348. doi: 10.1093/nsr/nwae348. eCollection 2024 Dec.

Dynamical regimes of diffusion models.扩散模型的动力学机制

Nat Commun. 2024 Nov 17;15(1):9957. doi: 10.1038/s41467-024-54281-3.

Machine learning meets physics: A two-way street.机器学习与物理学相遇：一条双向道。

Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2403580121. doi: 10.1073/pnas.2403580121. Epub 2024 Jun 24.

本文引用的文献

Efficient generative modeling of protein sequences using simple autoregressive models.使用简单自回归模型高效生成蛋白质序列。

Nat Commun. 2021 Oct 4;12(1):5800. doi: 10.1038/s41467-021-25756-4.

Equivariant Flow-Based Sampling for Lattice Gauge Theory.用于格点规范理论的基于等变流的采样

Phys Rev Lett. 2020 Sep 18;125(12):121601. doi: 10.1103/PhysRevLett.125.121601.

Typology of phase transitions in Bayesian inference problems.贝叶斯推理问题中的相变类型学。

Phys Rev E. 2019 Apr;99(4-1):042109. doi: 10.1103/PhysRevE.99.042109.

Solving Statistical Mechanics Using Variational Autoregressive Networks.利用变分自回归网络解决统计力学问题。

Phys Rev Lett. 2019 Mar 1;122(8):080602. doi: 10.1103/PhysRevLett.122.080602.

Optimal errors and phase transitions in high-dimensional generalized linear models.高维广义线性模型中的最优误差与相变

Proc Natl Acad Sci U S A. 2019 Mar 19;116(12):5451-5460. doi: 10.1073/pnas.1802705116. Epub 2019 Mar 1.

Random pinning glass transition: hallmarks, mean-field theory and renormalization group analysis.随机钉扎玻璃化转变：特征标志、平均场理论和重整化群分析。

J Chem Phys. 2013 Mar 28;138(12):12A547. doi: 10.1063/1.4790400.

Performance of a cavity-method-based algorithm for the prize-collecting Steiner tree problem on graphs.基于腔方法的算法在图上的奖品收集斯坦纳树问题中的性能。

Phys Rev E Stat Nonlin Soft Matter Phys. 2012 Aug;86(2 Pt 2):026706. doi: 10.1103/PhysRevE.86.026706. Epub 2012 Aug 13.

On melting dynamics and the glass transition. I. Glassy aspects of melting dynamics.关于熔融动力学和玻璃化转变。I. 熔融动力学的玻璃态方面。

J Chem Phys. 2011 Jan 21;134(3):034512. doi: 10.1063/1.3506841.

Message-passing algorithms for compressed sensing.基于消息传递的压缩感知算法。

Proc Natl Acad Sci U S A. 2009 Nov 10;106(45):18914-9. doi: 10.1073/pnas.0909892106. Epub 2009 Oct 26.

Hiding quiet solutions in random constraint satisfaction problems.在随机约束满足问题中隐藏安静解。

Phys Rev Lett. 2009 Jun 12;102(23):238701. doi: 10.1103/PhysRevLett.102.238701. Epub 2009 Jun 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从自旋玻璃的角度看，利用流、扩散和自回归神经网络进行采样。

Sampling with flows, diffusion, and autoregressive neural networks from a spin-glass perspective.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献