Shien-Ming Wu School of Intelligent Engineering, South China University of Technology, Guangzhou, 510640, China.
Shien-Ming Wu School of Intelligent Engineering, South China University of Technology, Guangzhou, 510640, China.
Neural Netw. 2020 Aug;128:107-125. doi: 10.1016/j.neunet.2020.04.029. Epub 2020 May 4.
As a major step forward in machine learning, generative adversarial networks (GANs) employ the Wasserstein distance as a metric between the generative distribution and target data distribution, and thus can be viewed as optimal transport (OT) problems to reflect the underlying geometry of the probability distribution. However, the unequal dimensions between the source random distribution and the target data, result in often instability in the training processes, and lack of diversity in the generative images. To resolve the challenges, we propose here a multiple-projection approach, to project the source and target probability measures into multiple different low-dimensional subspaces. Moreover, we show that the original problem can be transformed into a variant multi-marginal OT problem, and we provide the explicit properties of the solutions. In addition, we employ parameterized approximation for the objective, and study the corresponding differentiability and convergence properties, ensuring that the problem can indeed be computed.
作为机器学习的重大进展,生成对抗网络(GAN)使用 Wasserstein 距离作为生成分布和目标数据分布之间的度量,因此可以将其视为最优传输(OT)问题,以反映概率分布的潜在几何结构。然而,源随机分布和目标数据之间的维度不相等,导致训练过程中经常不稳定,生成图像缺乏多样性。为了解决这些挑战,我们提出了一种多投影方法,将源和目标概率测度投影到多个不同的低维子空间中。此外,我们表明原问题可以转化为一个变体多边际最优传输问题,并给出了解的显式性质。此外,我们还对目标函数进行了参数化逼近,并研究了相应的可微性和收敛性,确保了问题确实可以被计算。