非凸复合优化的方差缩减方法

Variance Reduced Methods for Non-Convex Composition Optimization.

作者信息

Liu Liu, Liu Ji, Tao Dacheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5813-5825. doi: 10.1109/TPAMI.2021.3071594. Epub 2022 Aug 4.

DOI:10.1109/TPAMI.2021.3071594

Abstract

This paper explores the non-convex composition optimization consisting of inner and outer finite-sum functions with a large number of component functions. This problem arises in important applications such as nonlinear embedding and reinforcement learning. Although existing approaches such as stochastic gradient descent (SGD) and stochastic variance reduced gradient (SVRG) descent can be applied to solve this problem, their query complexities tend to be high, especially when the number of inner component functions is large. Therefore, to significantly improve the query complexity of current approaches, we have devised the stochastic composition via variance reduction (SCVR). What's more, we analyze the query complexity under different numbers of inner function and outer function. Based on different kinds of estimation of inner component function, we also present the SCVRII algorithm, though the order of query complexities are the same with SCVR. Additionally, we propose an extension to handle the mini-batch cases, which improve the query complexity under the optimal mini-batch size. The experimental results validate our proposed algorithms and theoretical analyses.

摘要

本文探讨了由具有大量分量函数的内、外有限和函数组成的非凸复合优化问题。该问题出现在诸如非线性嵌入和强化学习等重要应用中。尽管诸如随机梯度下降（SGD）和随机方差减少梯度（SVRG）下降等现有方法可用于解决此问题，但其查询复杂度往往较高，尤其是当内部分量函数数量较大时。因此，为了显著提高当前方法的查询复杂度，我们设计了通过方差减少的随机复合（SCVR）方法。此外，我们分析了在内函数和外函数数量不同的情况下的查询复杂度。基于对内部分量函数的不同估计，我们还提出了SCVRII算法，尽管其查询复杂度的阶数与SCVR相同。此外，我们提出了一种扩展方法来处理小批量情况，这在最优小批量大小下提高了查询复杂度。实验结果验证了我们提出的算法和理论分析。

相似文献

Variance Reduced Methods for Non-Convex Composition Optimization.

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5813-5825. doi: 10.1109/TPAMI.2021.3071594. Epub 2022 Aug 4.

Improved Variance Reduction Methods for Riemannian Non-Convex Optimization.

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7610-7623. doi: 10.1109/TPAMI.2021.3112139. Epub 2022 Oct 4.

Dualityfree Methods for Stochastic Composition Optimization.

IEEE Trans Neural Netw Learn Syst. 2019 Apr;30(4):1205-1217. doi: 10.1109/TNNLS.2018.2866699. Epub 2018 Sep 12.

Distributed Stochastic Gradient Tracking Algorithm With Variance Reduction for Non-Convex Optimization.

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5310-5321. doi: 10.1109/TNNLS.2022.3170944. Epub 2023 Sep 1.

Painless Stochastic Conjugate Gradient for Large-Scale Machine Learning.

IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14645-14658. doi: 10.1109/TNNLS.2023.3280826. Epub 2024 Oct 7.

Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds.

IEEE Trans Pattern Anal Mach Intell. 2021 Feb;43(2):459-472. doi: 10.1109/TPAMI.2019.2933841. Epub 2021 Jan 8.

Accelerating Minibatch Stochastic Gradient Descent Using Typicality Sampling.

IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4649-4659. doi: 10.1109/TNNLS.2019.2957003. Epub 2020 Oct 29.

Accelerated Mini-batch Randomized Block Coordinate Descent Method.

Adv Neural Inf Process Syst. 2014 Dec;27:5614.

A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization.

IEEE Trans Pattern Anal Mach Intell. 2021 Jun 8;PP. doi: 10.1109/TPAMI.2021.3087328.

Weighted SGD for ℓ Regression with Randomized Preconditioning.

Proc Annu ACM SIAM Symp Discret Algorithms. 2016 Jan;2016:558-569. doi: 10.1137/1.9781611974331.ch41.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

非凸复合优化的方差缩减方法

Variance Reduced Methods for Non-Convex Composition Optimization.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献