An Dongsheng, Lei Na, Xu Xiaoyin, Gu Xianfeng
Stony Brook University, NY, USA.
Dalian University of Technology, Liaoning, China.
Proc AAAI Conf Artif Intell. 2022 Jun;36(9):10119-10128. doi: 10.1609/aaai.v36i9.21251.
Optimal transport (OT) plays an essential role in various areas like machine learning and deep learning. However, computing discrete OT for large scale problems with adequate accuracy and efficiency is highly challenging. Recently, methods based on the Sinkhorn algorithm add an entropy regularizer to the prime problem and obtain a trade off between efficiency and accuracy. In this paper, we propose a novel algorithm based on Nesterov's smoothing technique to further improve the efficiency and accuracy in computing OT. Basically, the non-smooth c-transform of the Kantorovich potential is approximated by the smooth Log-Sum-Exp function, which smooths the original non-smooth Kantorovich dual functional. The smooth Kantorovich functional can be efficiently optimized by a fast proximal gradient method, the fast iterative shrinkage thresholding algorithm (FISTA). Theoretically, the computational complexity of the proposed method is given by , which is lower than current estimation of the Sinkhorn algorithm. Experimentally, compared with the Sinkhorn algorithm, our results demonstrate that the proposed method achieves faster convergence and better accuracy with the same parameter.
最优传输(OT)在机器学习和深度学习等各个领域发挥着重要作用。然而,以足够的精度和效率计算大规模问题的离散OT极具挑战性。最近,基于Sinkhorn算法的方法在原始问题中添加了一个熵正则化项,并在效率和精度之间取得了平衡。在本文中,我们提出了一种基于Nesterov平滑技术的新算法,以进一步提高计算OT时的效率和精度。基本上,通过光滑的对数和指数函数逼近康德罗维奇势的非光滑c变换,从而使原始的非光滑康德罗维奇对偶泛函变得平滑。光滑的康德罗维奇泛函可以通过快速近端梯度法,即快速迭代收缩阈值算法(FISTA)进行有效优化。从理论上讲,所提方法的计算复杂度为 ,低于目前对Sinkhorn算法的估计。在实验上,与Sinkhorn算法相比,我们的结果表明所提方法在相同参数下收敛更快且精度更高。