Li Miaomiao, Xia Jingyuan, Xu Huiying, Liao Qing, Zhu Xinzhong, Liu Xinwang
IEEE Trans Cybern. 2023 Jun;53(6):3479-3492. doi: 10.1109/TCYB.2021.3126727. Epub 2023 May 17.
Localized incomplete multiple kernel k -means (LI-MKKM) is recently put forward to boost the clustering accuracy via optimally utilizing a quantity of prespecified incomplete base kernel matrices. Despite achieving significant achievement in a variety of applications, we find out that LI-MKKM does not sufficiently consider the diversity and the complementary of the base kernels. This could make the imputation of incomplete kernels less effective, and vice versa degrades on the subsequent clustering. To tackle these problems, an improved LI-MKKM, called LI-MKKM with matrix-induced regularization (LI-MKKM-MR), is proposed by incorporating a matrix-induced regularization term to handle the correlation among base kernels. The incorporated regularization term is beneficial to decrease the probability of simultaneously selecting two similar kernels and increase the probability of selecting two kernels with moderate differences. After that, we establish a three-step iterative algorithm to solve the corresponding optimization objective and analyze its convergence. Moreover, we theoretically show that the local kernel alignment is a special case of its global one with normalizing each base kernel matrices. Based on the above observation, the generalization error bound of the proposed algorithm is derived to theoretically justify its effectiveness. Finally, extensive experiments on several public datasets have been conducted to evaluate the clustering performance of the LI-MKKM-MR. As indicated, the experimental results have demonstrated that our algorithm consistently outperforms the state-of-the-art ones, verifying the superior performance of the proposed algorithm.
局部不完全多核k均值算法(LI-MKKM)是最近提出的,旨在通过最优地利用大量预先指定的不完全基核矩阵来提高聚类精度。尽管在各种应用中取得了显著成果,但我们发现LI-MKKM没有充分考虑基核的多样性和互补性。这可能会使不完全核的插补效果不佳,反之则会降低后续聚类的性能。为了解决这些问题,我们提出了一种改进的LI-MKKM,即带矩阵诱导正则化的LI-MKKM(LI-MKKM-MR),通过引入一个矩阵诱导正则化项来处理基核之间的相关性。引入的正则化项有利于降低同时选择两个相似核的概率,并增加选择两个差异适中的核的概率。在此之后,我们建立了一个三步迭代算法来求解相应的优化目标,并分析其收敛性。此外,我们从理论上表明,局部核对齐是其全局核对齐的一种特殊情况,即对每个基核矩阵进行归一化。基于上述观察,我们推导了所提算法的泛化误差界,从理论上证明了其有效性。最后,我们在几个公共数据集上进行了广泛的实验,以评估LI-MKKM-MR的聚类性能。结果表明,实验结果证明我们的算法始终优于现有算法,验证了所提算法的卓越性能。