Yang Jing-Hua, Chen Chuan, Dai Hong-Ning, Ding Meng, Wu Zhe-Bin, Zheng Zibin
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):8839-8853. doi: 10.1109/TNNLS.2022.3215983. Epub 2024 Jul 8.
Tensor analysis has received widespread attention in high-dimensional data learning. Unfortunately, the tensor data are often accompanied by arbitrary signal corruptions, including missing entries and sparse noise. How to recover the characteristics of the corrupted tensor data and make it compatible with the downstream clustering task remains a challenging problem. In this article, we study a generalized transformed tensor low-rank representation (TTLRR) model for simultaneously recovering and clustering the corrupted tensor data. The core idea is to find the latent low-rank tensor structure from the corrupted measurements using the transformed tensor singular value decomposition (SVD). Theoretically, we prove that TTLRR can recover the clean tensor data with a high probability guarantee under mild conditions. Furthermore, by using the transform adaptively learning from the data itself, the proposed TTLRR model can approximately represent and exploit the intrinsic subspace and seek out the cluster structure of the tensor data precisely. An effective algorithm is designed to solve the proposed model under the alternating direction method of multipliers (ADMMs) algorithm framework. The effectiveness and superiority of the proposed method against the compared methods are showcased over different tasks, including video/face data recovery and face/object/scene data clustering.
张量分析在高维数据学习中受到了广泛关注。不幸的是,张量数据常常伴随着任意的信号损坏,包括缺失数据项和稀疏噪声。如何恢复损坏的张量数据的特征并使其与下游聚类任务兼容仍然是一个具有挑战性的问题。在本文中,我们研究了一种广义变换张量低秩表示(TTLRR)模型,用于同时恢复和聚类损坏的张量数据。核心思想是使用变换张量奇异值分解(SVD)从损坏的测量值中找到潜在的低秩张量结构。从理论上讲,我们证明了在温和条件下,TTLRR能够以高概率保证恢复干净的张量数据。此外,通过使用从数据本身自适应学习的变换,所提出的TTLRR模型可以近似表示并利用内在子空间,并精确地找出张量数据的聚类结构。我们设计了一种有效的算法,在交替方向乘子法(ADMMs)算法框架下求解所提出的模型。在所提出的方法与比较方法的对比中,其有效性和优越性在不同任务中得到了展示,包括视频/面部数据恢复以及面部/物体/场景数据聚类。