Wang Fangyikang, Zhu Huminhao, Zhang Chao, Zhao Hanbin, Qian Hui
College of Computer Science and Technology, Zhejiang University, Hangzhou 310058, China.
Entropy (Basel). 2024 Aug 11;26(8):679. doi: 10.3390/e26080679.
Particle-based Variational Inference (ParVI) methods have been widely adopted in deep Bayesian inference tasks such as Bayesian neural networks or Gaussian Processes, owing to their efficiency in generating high-quality samples given the score of the target distribution. Typically, ParVI methods evolve a weighted-particle system by approximating the first-order Wasserstein gradient flow to reduce the dissimilarity between the particle system's empirical distribution and the target distribution. Recent advancements in ParVI have explored sophisticated gradient flows to obtain refined particle systems with either accelerated position updates or dynamic weight adjustments. In this paper, we introduce the semi-Hamiltonian gradient flow on a novel Information-Fisher-Rao space, known as the SHIFR flow, and propose the first ParVI framework that possesses both accelerated position update and dynamical weight adjustment simultaneously, named the General Accelerated Dynamic-Weight Particle-based Variational Inference (GAD-PVI) framework. GAD-PVI is compatible with different dissimilarities between the empirical distribution and the target distribution, as well as different approximation approaches to gradient flow. Moreover, when the appropriate dissimilarity is selected, GAD-PVI is also suitable for obtaining high-quality samples even when analytical scores cannot be obtained. Experiments conducted under both the score-based tasks and sample-based tasks demonstrate the faster convergence and reduced approximation error of GAD-PVI methods over the state-of-the-art.
基于粒子的变分推理(ParVI)方法已在深度贝叶斯推理任务(如贝叶斯神经网络或高斯过程)中被广泛采用,这是因为在给定目标分布得分的情况下,它们在生成高质量样本方面具有效率。通常,ParVI方法通过逼近一阶瓦瑟斯坦梯度流来演化加权粒子系统,以减少粒子系统的经验分布与目标分布之间的差异。ParVI的最新进展探索了复杂的梯度流,以获得具有加速位置更新或动态权重调整的精细粒子系统。在本文中,我们在一个名为SHIFR流的新型信息-费希尔-拉奥空间上引入半哈密顿梯度流,并提出了第一个同时具备加速位置更新和动态权重调整的ParVI框架,即广义加速动态权重基于粒子的变分推理(GAD-PVI)框架。GAD-PVI与经验分布和目标分布之间的不同差异以及梯度流的不同近似方法兼容。此外,当选择合适的差异时,即使无法获得解析得分,GAD-PVI也适用于获得高质量样本。在基于得分的任务和基于样本的任务下进行的实验表明,GAD-PVI方法比现有技术具有更快的收敛速度和更低的近似误差。