Gong Pinghua, Zhang Changshui, Lu Zhaosong, Huang Jianhua Z, Ye Jieping
State Key Laboratory on Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology (TNList), Department of Automation, Tsinghua University, Beijing 100084, China.
Department of Mathematics, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada.
JMLR Workshop Conf Proc. 2013;28(2):37-45.
Non-convex sparsity-inducing penalties have recently received considerable attentions in sparse learning. Recent theoretical investigations have demonstrated their superiority over the convex counterparts in several sparse learning settings. However, solving the non-convex optimization problems associated with non-convex penalties remains a big challenge. A commonly used approach is the Multi-Stage (MS) convex relaxation (or DC programming), which relaxes the original non-convex problem to a sequence of convex problems. This approach is usually not very practical for large-scale problems because its computational cost is a multiple of solving a single convex problem. In this paper, we propose a General Iterative Shrinkage and Thresholding (GIST) algorithm to solve the nonconvex optimization problem for a large class of non-convex penalties. The GIST algorithm iteratively solves a proximal operator problem, which in turn has a closed-form solution for many commonly used penalties. At each outer iteration of the algorithm, we use a line search initialized by the Barzilai-Borwein (BB) rule that allows finding an appropriate step size quickly. The paper also presents a detailed convergence analysis of the GIST algorithm. The efficiency of the proposed algorithm is demonstrated by extensive experiments on large-scale data sets.
非凸稀疏诱导惩罚最近在稀疏学习中受到了相当多的关注。最近的理论研究表明,在几种稀疏学习设置中,它们比凸惩罚具有优越性。然而,求解与非凸惩罚相关的非凸优化问题仍然是一个巨大的挑战。一种常用的方法是多阶段(MS)凸松弛(或DC规划),它将原始的非凸问题松弛为一系列凸问题。这种方法对于大规模问题通常不太实用,因为其计算成本是求解单个凸问题的倍数。在本文中,我们提出了一种通用迭代收缩阈值(GIST)算法,用于求解一大类非凸惩罚的非凸优化问题。GIST算法迭代地求解一个近端算子问题,而对于许多常用的惩罚,该近端算子问题又有一个闭式解。在算法的每次外层迭代中,我们使用由Barzilai-Borwein(BB)规则初始化的线搜索,该规则允许快速找到合适的步长。本文还给出了GIST算法的详细收敛性分析。通过在大规模数据集上的大量实验证明了所提算法的效率。