Huang Feihu, Gao Shangqian, Pei Jian, Huang Heng
IEEE Trans Pattern Anal Mach Intell. 2024 Aug 14;PP. doi: 10.1109/TPAMI.2023.3347082.
Zeroth-order (a.k.a, derivative-free) methods are a class of effective optimization methods for solving complex machine learning problems, where gradients of the objective functions are not available or computationally prohibitive. Recently, although many zeroth-order methods have been developed, these approaches still have two main drawbacks: 1) high function query complexity; 2) not being well suitable for solving the problems with complex penalties and constraints. To address these challenging drawbacks, in this paper, we propose a class of faster zeroth-order stochastic alternating direction method of multipliers (ADMM) methods (ZO-SPIDER-ADMM) to solve the nonconvex finite-sum problems with multiple nonsmooth penalties. Moreover, we prove that the ZO-SPIDER-ADMM methods can achieve a lower function query complexity of [Formula: see text] for finding an ϵ-stationary point, which improves the existing best nonconvex zeroth-order ADMM methods by a factor of [Formula: see text], where n and d denote the sample size and data dimension, respectively. At the same time, we propose a class of faster zeroth-order online ADMM methods (ZOO-ADMM+) to solve the nonconvex online problems with multiple nonsmooth penalties. We also prove that the proposed ZOO-ADMM+ methods achieve a lower function query complexity of [Formula: see text], which improves the existing best result by a factor of [Formula: see text]. Extensive experimental results on the structure adversarial attack on black-box deep neural networks demonstrate the efficiency of our new algorithms.
零阶(又名无导数)方法是一类用于解决复杂机器学习问题的有效优化方法,这类问题中目标函数的梯度不可得或计算成本过高。最近,尽管已经开发了许多零阶方法,但这些方法仍然存在两个主要缺点:1)函数查询复杂度高;2)不太适合解决具有复杂惩罚和约束的问题。为了解决这些具有挑战性的缺点,在本文中,我们提出了一类更快的零阶随机交替方向乘子法(ADMM)方法(ZO-SPIDER-ADMM)来解决具有多个非光滑惩罚项的非凸有限和问题。此外,我们证明了ZO-SPIDER-ADMM方法在寻找一个ϵ-驻点时可以实现更低的函数查询复杂度[公式:见原文],这比现有的最佳非凸零阶ADMM方法提高了[公式:见原文]倍,其中n和d分别表示样本大小和数据维度。同时,我们提出了一类更快的零阶在线ADMM方法(ZOO-ADMM+)来解决具有多个非光滑惩罚项的非凸在线问题。我们还证明了所提出的ZOO-ADMM+方法实现了更低的函数查询复杂度[公式:见原文],这比现有最佳结果提高了[公式:见原文]倍。在对黑箱深度神经网络的结构对抗攻击上的大量实验结果证明了我们新算法的有效性。