Jiang Lanfan, Huang Zilin, Chen Yu, Zhu Wenxing
IEEE Trans Neural Netw Learn Syst. 2024 Sep 12;PP. doi: 10.1109/TNNLS.2024.3454070.
Taking advantage of the natural grouping structure inside data, group sparse optimization can effectively improve the efficiency and stability of high-dimensional data analysis, and it has wide applications in a variety of fields such as machine learning, signal processing, and bioinformatics. Although there has been a lot of progress, it is still a challenge to construct a group sparse-inducing function with good properties and to identify significant groups. This article aims to address the group-sparsity-constrained minimization problem. We convert the problem to an equivalent weighted l -norm ( , ) constrained optimization model, instead of its relaxation or approximation problem. Then, by applying the proximal gradient method, a solution method with theoretical convergence analysis is developed. Moreover, based on the properties proved in the Lagrangian dual framework, the homotopy technique is employed to cope with the parameter tuning task and to ensure that the output of the proposed homotopy algorithm is an L -stationary point of the original problem. The proposed weighted framework, with the central idea of identifying important groups, is compatible with a wide range of support set identification strategies, which can better meet the needs of different applications and improve the robustness of the model in practice. Both simulated and real data experiments demonstrate the superiority of the proposed method in terms of group feature selection accuracy and computational efficiency. Extensive experimental results in application areas such as compressed sensing, image recognition, and classifier design show that our method has great potential in a wide range of applications. Our codes will be available at https://github.com/jianglanfan/HIWT-GSC.
利用数据内部的自然分组结构,组稀疏优化可以有效提高高维数据分析的效率和稳定性,并且在机器学习、信号处理和生物信息学等多个领域有着广泛应用。尽管已经取得了很多进展,但构造具有良好性质的组稀疏诱导函数以及识别显著组仍然是一个挑战。本文旨在解决组稀疏约束最小化问题。我们将该问题转化为一个等价的加权l -范数( , )约束优化模型,而不是其松弛或近似问题。然后,通过应用近端梯度法,开发了一种具有理论收敛性分析的求解方法。此外,基于在拉格朗日对偶框架中证明的性质,采用同伦技术来处理参数调整任务,并确保所提出的同伦算法的输出是原始问题的一个L -驻点。所提出的加权框架以识别重要组为核心思想,与广泛的支持集识别策略兼容,能够更好地满足不同应用的需求,并在实际中提高模型的鲁棒性。模拟和真实数据实验都证明了所提方法在组特征选择准确性和计算效率方面的优越性。在压缩感知、图像识别和分类器设计等应用领域的大量实验结果表明,我们的方法在广泛的应用中具有巨大潜力。我们的代码将在https://github.com/jianglanfan/HIWT-GSC上提供。