用于不平衡数据分类的逆自由约简全域孪生支持向量机

Moosaei Hossein, Ganaie M A, Hladík Milan, Tanveer M

Department of Informatics, Faculty of Science, Jan Evangelista Purkyně University, Ústí nad Labem, Czech Republic; Department of Applied Mathematics, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic.

Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, India; Department of Robotics, University of Michigan, Ann Arbor, MI, 48109, USA.

Neural Netw. 2023 Jan;157:125-135. doi: 10.1016/j.neunet.2022.10.003. Epub 2022 Oct 15.

Imbalanced datasets are prominent in real-world problems. In such problems, the data samples in one class are significantly higher than in the other classes, even though the other classes might be more important. The standard classification algorithms may classify all the data into the majority class, and this is a significant drawback of most standard learning algorithms, so imbalanced datasets need to be handled carefully. One of the traditional algorithms, twin support vector machines (TSVM), performed well on balanced data classification but poorly on imbalanced datasets classification. In order to improve the TSVM algorithm's classification ability for imbalanced datasets, recently, driven by the universum twin support vector machine (UTSVM), a reduced universum twin support vector machine for class imbalance learning (RUTSVM) was proposed. The dual problem and finding classifiers involve matrix inverse computation, which is one of RUTSVM's key drawbacks. In this paper, we improve the RUTSVM and propose an improved reduced universum twin support vector machine for class imbalance learning (IRUTSVM). We offer alternative Lagrangian functions to tackle the primal problems of RUTSVM in the suggested IRUTSVM approach by inserting one of the terms in the objective function into the constraints. As a result, we obtain new dual formulation for each optimization problem so that we need not compute inverse matrices neither in the training process nor in finding the classifiers. Moreover, the smaller size of the rectangular kernel matrices is used to reduce the computational time. Extensive testing is carried out on a variety of synthetic and real-world imbalanced datasets, and the findings show that the IRUTSVM algorithm outperforms the TSVM, UTSVM, and RUTSVM algorithms in terms of generalization performance.

不平衡数据集在现实世界问题中很突出。在这类问题中，一个类别的数据样本显著高于其他类别，尽管其他类别可能更重要。标准分类算法可能会将所有数据分类到多数类中，这是大多数标准学习算法的一个重大缺陷，因此需要谨慎处理不平衡数据集。传统算法之一的孪生支持向量机（TSVM）在平衡数据分类上表现良好，但在不平衡数据集分类上表现不佳。为了提高TSVM算法对不平衡数据集的分类能力，最近，在全域孪生支持向量机（UTSVM）的推动下，提出了一种用于类别不平衡学习的约简全域孪生支持向量机（RUTSVM）。对偶问题和寻找分类器涉及矩阵求逆计算，这是RUTSVM的关键缺陷之一。在本文中，我们改进了RUTSVM，并提出了一种改进的用于类别不平衡学习的约简全域孪生支持向量机（IRUTSVM）。我们通过将目标函数中的一项插入约束中来提供替代拉格朗日函数，以解决所提出的IRUTSVM方法中RUTSVM的原始问题。结果，我们为每个优化问题获得了新的对偶形式，这样在训练过程和寻找分类器时都无需计算逆矩阵。此外，使用较小尺寸的矩形核矩阵来减少计算时间。我们在各种合成和真实世界的不平衡数据集上进行了广泛测试，结果表明IRUTSVM算法在泛化性能方面优于TSVM、UTSVM和RUTSVM算法。

相似文献

Inverse free reduced universum twin support vector machine for imbalanced data classification.

Neural Netw. 2023 Jan;157:125-135. doi: 10.1016/j.neunet.2022.10.003. Epub 2022 Oct 15.

Universum based Lagrangian twin bounded support vector machine to classify EEG signals.

Comput Methods Programs Biomed. 2021 Sep;208:106244. doi: 10.1016/j.cmpb.2021.106244. Epub 2021 Jun 24.

Twin support vector machine with Universum data.

Neural Netw. 2012 Dec;36:112-9. doi: 10.1016/j.neunet.2012.09.004. Epub 2012 Oct 3.

Maximum Margin of Twin Spheres Support Vector Machine for Imbalanced Data Classification.

IEEE Trans Cybern. 2017 Jun;47(6):1540-1550. doi: 10.1109/TCYB.2016.2551735. Epub 2016 Apr 21.

Symmetric LINEX loss twin support vector machine for robust classification and its fast iterative algorithm.

Neural Netw. 2023 Nov;168:143-160. doi: 10.1016/j.neunet.2023.08.055. Epub 2023 Sep 9.

Affinity and class probability-based fuzzy support vector machine for imbalanced data sets.

Neural Netw. 2020 Feb;122:289-307. doi: 10.1016/j.neunet.2019.10.016. Epub 2019 Nov 2.

Near-Bayesian Support Vector Machines for imbalanced data classification with equal or unequal misclassification costs.

Neural Netw. 2015 Oct;70:39-52. doi: 10.1016/j.neunet.2015.06.005. Epub 2015 Jul 8.

Sparse solution of least-squares twin multi-class support vector machine using ℓ and ℓ-norm for classification and feature selection.

Neural Netw. 2023 Sep;166:471-486. doi: 10.1016/j.neunet.2023.07.039. Epub 2023 Aug 1.

Capped Linex Metric Twin Support Vector Machine for Robust Classification.

Sensors (Basel). 2022 Aug 31;22(17):6583. doi: 10.3390/s22176583.

Efficient Selection of Gaussian Kernel SVM Parameters for Imbalanced Data.

Genes (Basel). 2023 Feb 25;14(3):583. doi: 10.3390/genes14030583.

引用本文的文献

Parametric optimization of the slot waveguide characteristics using a machine-learning approach.

Sci Rep. 2025 Jul 5;15(1):24037. doi: 10.1038/s41598-025-07521-5.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Inverse free reduced universum twin support vector machine for imbalanced data classification.

Neural Netw. 2023 Jan;157:125-135. doi: 10.1016/j.neunet.2022.10.003. Epub 2022 Oct 15.

Universum based Lagrangian twin bounded support vector machine to classify EEG signals.

Comput Methods Programs Biomed. 2021 Sep;208:106244. doi: 10.1016/j.cmpb.2021.106244. Epub 2021 Jun 24.

Twin support vector machine with Universum data.

Neural Netw. 2012 Dec;36:112-9. doi: 10.1016/j.neunet.2012.09.004. Epub 2012 Oct 3.

Maximum Margin of Twin Spheres Support Vector Machine for Imbalanced Data Classification.

IEEE Trans Cybern. 2017 Jun;47(6):1540-1550. doi: 10.1109/TCYB.2016.2551735. Epub 2016 Apr 21.

Symmetric LINEX loss twin support vector machine for robust classification and its fast iterative algorithm.

Neural Netw. 2023 Nov;168:143-160. doi: 10.1016/j.neunet.2023.08.055. Epub 2023 Sep 9.

Affinity and class probability-based fuzzy support vector machine for imbalanced data sets.

Neural Netw. 2020 Feb;122:289-307. doi: 10.1016/j.neunet.2019.10.016. Epub 2019 Nov 2.

Near-Bayesian Support Vector Machines for imbalanced data classification with equal or unequal misclassification costs.

Neural Netw. 2015 Oct;70:39-52. doi: 10.1016/j.neunet.2015.06.005. Epub 2015 Jul 8.

Sparse solution of least-squares twin multi-class support vector machine using ℓ and ℓ-norm for classification and feature selection.

Neural Netw. 2023 Sep;166:471-486. doi: 10.1016/j.neunet.2023.07.039. Epub 2023 Aug 1.

Capped Linex Metric Twin Support Vector Machine for Robust Classification.

Sensors (Basel). 2022 Aug 31;22(17):6583. doi: 10.3390/s22176583.

Efficient Selection of Gaussian Kernel SVM Parameters for Imbalanced Data.

Genes (Basel). 2023 Feb 25;14(3):583. doi: 10.3390/genes14030583.

引用本文的文献

Parametric optimization of the slot waveguide characteristics using a machine-learning approach.

Sci Rep. 2025 Jul 5;15(1):24037. doi: 10.1038/s41598-025-07521-5.

Suppr
超能文献

Inverse free reduced universum twin support vector machine for imbalanced data classification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

Suppr超能文献

用于不平衡数据分类的逆自由约简全域孪生支持向量机

Inverse free reduced universum twin support vector machine for imbalanced data classification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

Suppr
超能文献