IEEE Trans Neural Netw Learn Syst. 2017 Oct;28(10):2294-2305. doi: 10.1109/TNNLS.2016.2582746. Epub 2016 Jul 7.
In this paper, we propose a novel semisupervised feature selection framework by mining correlations among multiple tasks and apply it to different multimedia applications. Instead of independently computing the importance of features for each task, our algorithm leverages shared knowledge from multiple related tasks, thus improving the performance of feature selection. Note that the proposed algorithm is built upon an assumption that different tasks share some common structures. The proposed algorithm selects features in a batch mode, by which the correlations between various features are taken into consideration. Besides, considering the fact that labeling a large amount of training data in real world is both time-consuming and tedious, we adopt manifold learning, which exploits both labeled and unlabeled training data for a feature space analysis. Since the objective function is nonsmooth and difficult to solve, we propose an iteractive algorithm with fast convergence. Extensive experiments on different applications demonstrate that our algorithm outperforms the other state-of-the-art feature selection algorithms.
在本文中,我们提出了一种新颖的半监督特征选择框架,通过挖掘多个任务之间的相关性,并将其应用于不同的多媒体应用。我们的算法不是为每个任务独立计算特征的重要性,而是利用来自多个相关任务的共享知识,从而提高特征选择的性能。请注意,所提出的算法是基于不同任务共享某些共同结构的假设。所提出的算法以批量模式选择特征,从而考虑了各种特征之间的相关性。此外,考虑到在现实世界中对大量训练数据进行标记既耗时又乏味,我们采用了流形学习,该方法利用有标签和无标签的训练数据进行特征空间分析。由于目标函数是不平滑且难以求解的,我们提出了一种具有快速收敛性的迭代算法。在不同应用中的广泛实验表明,我们的算法优于其他最先进的特征选择算法。