Peng Siyuan, Yang Zhijing, Nie Feiping, Chen Badong, Lin Zhiping
School of Information Engineering, Guangdong University of Technology, 510006, China.
School of Information Engineering, Guangdong University of Technology, 510006, China.
Neural Netw. 2022 Oct;154:203-217. doi: 10.1016/j.neunet.2022.07.021. Epub 2022 Jul 21.
Concept factorization (CF) has shown the effectiveness in the field of data clustering. In this paper, a novel and robust semi-supervised CF method, called correntropy based semi-supervised concept factorization with adaptive neighbors (CSCF), is proposed with improved performance in clustering applications. Specifically, on the one hand, the CSCF method adopts correntropy as the cost function to increase the robustness for non-Gaussian noise and outliers, and combines two different types of supervised information simultaneously for obtaining a compact low-dimensional representation of the original data. On the other hand, CSCF assigns the adaptive neighbors for each data point to construct a good data similarity matrix for reducing the sensitiveness of data. Moreover, a generalized version of CSCF is derived for enlarging the clustering application ranges. Analysis is also presented for the relationship of CSCF with several typical CF methods. Experimental results have shown that CSCF has better clustering performance than several state-of-the-art CF methods.
概念分解(CF)已在数据聚类领域展现出有效性。本文提出了一种新颖且稳健的半监督CF方法,称为基于核相关熵的带自适应邻域的半监督概念分解(CSCF),其在聚类应用中具有改进的性能。具体而言,一方面,CSCF方法采用核相关熵作为代价函数以增强对非高斯噪声和离群值的鲁棒性,并同时结合两种不同类型的监督信息来获得原始数据的紧凑低维表示。另一方面,CSCF为每个数据点分配自适应邻域以构建良好的数据相似性矩阵,从而降低数据的敏感性。此外,还推导了CSCF的广义版本以扩大聚类应用范围。同时也分析了CSCF与几种典型CF方法之间的关系。实验结果表明,CSCF比几种当前最先进的CF方法具有更好的聚类性能。