通过求解路径实现半监督支持向量机的全局模型选择

Global Model Selection for Semi-Supervised Support Vector Machine via Solution Paths.

作者信息

Fan Yajing, Yu Shuyang, Gu Bin, Xiong Ziran, Zhai Zhou, Huang Heng, Chang Yi

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2154-2168. doi: 10.1109/TNNLS.2024.3354978. Epub 2025 Feb 6.

DOI:10.1109/TNNLS.2024.3354978

Abstract

Semi-supervised support vector machine (S3VM) is important because it can use plentiful unlabeled data to improve the generalization accuracy of traditional SVMs. In order to achieve good performance, it is necessary for S3VM to take some effective measures to select hyperparameters. However, model selection for semi-supervised models is still a key open problem. Existing methods for semi-supervised models to search for the optimal parameter values are usually computationally demanding, especially those ones with grid search. To address this challenging problem, in this article, we first propose solution paths of S3VM (SPS3VM), which can track the solutions of the nonconvex S3VM with respect to the hyperparameters. Specifically, we apply incremental and decremental learning methods to update the solution and let it satisfy the Karush-Kuhn-Tucker (KKT) conditions. Based on the SPS3VM and the piecewise linearity of model function, we can find the model with the minimum cross-validation (CV) error for the entire range of candidate hyperparameters by computing the error path of S3VM. Our SPS3VM is the first solution path algorithm for nonconvex optimization problem of semi-supervised learning models. We also provide the finite convergence analysis and computational complexity of SPS3VM. Experimental results on a variety of benchmark datasets not only verify that our SPS3VM can globally search the hyperparameters (regularization and ramp loss parameters) but also show a huge reduction of computational time while retaining similar or slightly better generalization performance compared with the grid search approach.

摘要

半监督支持向量机（S3VM）很重要，因为它可以利用大量未标记数据来提高传统支持向量机的泛化精度。为了实现良好的性能，S3VM有必要采取一些有效措施来选择超参数。然而，半监督模型的模型选择仍然是一个关键的开放性问题。现有的半监督模型搜索最优参数值的方法通常计算量很大，尤其是那些采用网格搜索的方法。为了解决这个具有挑战性的问题，在本文中，我们首先提出了S3VM的求解路径（SPS3VM），它可以跟踪非凸S3VM关于超参数的解。具体来说，我们应用增量和减量学习方法来更新解，并使其满足卡罗需-库恩-塔克（KKT）条件。基于SPS3VM和模型函数的分段线性，我们可以通过计算S3VM的误差路径，在候选超参数的整个范围内找到具有最小交叉验证（CV）误差的模型。我们的SPS3VM是首个用于半监督学习模型非凸优化问题的求解路径算法。我们还给出了SPS3VM的有限收敛性分析和计算复杂度。在各种基准数据集上的实验结果不仅验证了我们的SPS3VM可以全局搜索超参数（正则化和斜坡损失参数），而且表明与网格搜索方法相比，在保持相似或略好的泛化性能的同时，计算时间大幅减少。

相似文献

Global Model Selection for Semi-Supervised Support Vector Machine via Solution Paths.

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2154-2168. doi: 10.1109/TNNLS.2024.3354978. Epub 2025 Feb 6.

Global Model Selection via Solution Paths for Robust Support Vector Machine.

IEEE Trans Pattern Anal Mach Intell. 2025 Mar;47(3):1331-1347. doi: 10.1109/TPAMI.2023.3346765. Epub 2025 Feb 5.

Kernel Path for Semisupervised Support Vector Machine.

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):1512-1522. doi: 10.1109/TNNLS.2022.3183825. Epub 2024 Feb 5.

Kernel Error Path Algorithm.

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):8866-8878. doi: 10.1109/TNNLS.2022.3153953. Epub 2023 Oct 27.

Sequential Learning on sEMGs in Short- and Long-term Situations via Self-training Semi-supervised Support Vector Machine.

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3183-3186. doi: 10.1109/EMBC48229.2022.9871311.

Kernel Path for ν-Support Vector Classification.

IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):490-501. doi: 10.1109/TNNLS.2021.3097248. Epub 2023 Jan 5.

Incremental learning algorithm for large-scale semi-supervised ordinal regression.

Neural Netw. 2022 May;149:124-136. doi: 10.1016/j.neunet.2022.02.004. Epub 2022 Feb 11.

A Solution Path Algorithm for General Parametric Quadratic Programming Problem.

IEEE Trans Neural Netw Learn Syst. 2018 Sep;29(9):4462-4472. doi: 10.1109/TNNLS.2017.2771456. Epub 2017 Nov 29.

Cross Validation Through Two-Dimensional Solution Surface for Cost-Sensitive SVM.

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1103-1121. doi: 10.1109/TPAMI.2016.2578326. Epub 2016 Jun 8.

OPTIMAL COMPUTATIONAL AND STATISTICAL RATES OF CONVERGENCE FOR SPARSE NONCONVEX LEARNING PROBLEMS.

Ann Stat. 2014;42(6):2164-2201. doi: 10.1214/14-AOS1238.

引用本文的文献

Developing the new diagnostic model by integrating bioinformatics and machine learning for osteoarthritis.

J Orthop Surg Res. 2024 Dec 18;19(1):832. doi: 10.1186/s13018-024-05340-4.

Identification of the m6A/m5C/m1A methylation modification genes in Alzheimer's disease based on bioinformatic analysis.

Aging (Albany NY). 2024 Oct 31;16(21):13340-13355. doi: 10.18632/aging.206146.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过求解路径实现半监督支持向量机的全局模型选择

Global Model Selection for Semi-Supervised Support Vector Machine via Solution Paths.

作者信息

Fan Yajing, Yu Shuyang, Gu Bin, Xiong Ziran, Zhai Zhou, Huang Heng, Chang Yi

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2154-2168. doi: 10.1109/TNNLS.2024.3354978. Epub 2025 Feb 6.

DOI:10.1109/TNNLS.2024.3354978

PMID:38335085

Abstract

摘要

通过求解路径实现半监督支持向量机的全局模型选择

Global Model Selection for Semi-Supervised Support Vector Machine via Solution Paths.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过求解路径实现半监督支持向量机的全局模型选择

Global Model Selection for Semi-Supervised Support Vector Machine via Solution Paths.

作者信息

出版信息

相似文献

引用本文的文献