IEEE Trans Image Process. 2015 Dec;24(12):5343-55. doi: 10.1109/TIP.2015.2479560. Epub 2015 Sep 17.
In many image processing and pattern recognition problems, visual contents of images are currently described by high-dimensional features, which are often redundant and noisy. Toward this end, we propose a novel unsupervised feature selection scheme, namely, nonnegative spectral analysis with constrained redundancy, by jointly leveraging nonnegative spectral clustering and redundancy analysis. The proposed method can directly identify a discriminative subset of the most useful and redundancy-constrained features. Nonnegative spectral analysis is developed to learn more accurate cluster labels of the input images, during which the feature selection is performed simultaneously. The joint learning of the cluster labels and feature selection matrix enables to select the most discriminative features. Row-wise sparse models with a general ℓ(2, p)-norm (0 < p ≤ 1) are leveraged to make the proposed model suitable for feature selection and robust to noise. Besides, the redundancy between features is explicitly exploited to control the redundancy of the selected subset. The proposed problem is formulated as an optimization problem with a well-defined objective function solved by the developed simple yet efficient iterative algorithm. Finally, we conduct extensive experiments on nine diverse image benchmarks, including face data, handwritten digit data, and object image data. The proposed method achieves encouraging the experimental results in comparison with several representative algorithms, which demonstrates the effectiveness of the proposed algorithm for unsupervised feature selection.
在许多图像处理和模式识别问题中,图像的视觉内容目前由高维特征来描述,这些特征往往是冗余和有噪声的。针对这一问题,我们提出了一种新的无监督特征选择方案,即具有约束冗余的非负谱分析,通过联合利用非负谱聚类和冗余分析。所提出的方法可以直接识别出最有用和冗余约束特征的判别子集。非负谱分析用于学习输入图像更准确的聚类标签,在此过程中同时进行特征选择。聚类标签和特征选择矩阵的联合学习可以选择最具判别力的特征。利用具有一般 ℓ(2, p)-范数 (0 < p ≤ 1) 的行稀疏模型,使所提出的模型适合特征选择,并对噪声具有鲁棒性。此外,还显式地利用特征之间的冗余性来控制所选子集的冗余性。所提出的问题被表述为一个具有明确定义目标函数的优化问题,并通过开发的简单而有效的迭代算法来求解。最后,我们在九个不同的图像基准上进行了广泛的实验,包括人脸数据、手写数字数据和物体图像数据。与几个有代表性的算法相比,所提出的方法在实验结果中取得了令人鼓舞的结果,这表明了所提出的算法在无监督特征选择中的有效性。