IEEE Trans Cybern. 2017 Mar;47(3):759-771. doi: 10.1109/TCYB.2016.2526630. Epub 2016 Apr 5.
Dimension reduction is an important topic in pattern analysis and machine learning, and it has wide applications in feature representation and pattern classification. In the past two decades, sliced inverse regression (SIR) has attracted much research efforts due to its effectiveness and efficacy in dimension reduction. However, two drawbacks limit further applications of SIR. First, the computation complexity of SIR is usually high in the situation of high-dimensional data. Second, sparsity of projection subspace is not well mined for improving the feature selection and model interpretation abilities. This paper proposes to compute the SIR projection vectors in the spectral space, then an approximated regression solution can be obtained with a faster speed. Moreover, the adaptive lasso is used to attain a sparse and globally optimal solution, which is important in variable selection. To complete the robust pattern classification task with corruptions, a correntropy-based and class-wise regression model is designed in this paper. It takes a smooth penalty instead of sparsity constraint in the regression coefficients, and it can be conducted in class-wise, thus it is more flexible in practice. Extensive experiments are conducted by using some real and benchmark data sets, e.g., high-dimensional facial images and gene microarray data, to evaluate the new algorithms. The new proposals attain competitive results and are compared with other state-of-the-art methods.
降维是模式分析和机器学习中的一个重要课题,它在特征表示和模式分类中有广泛的应用。在过去的二十年中,切片逆回归(SIR)由于其在降维方面的有效性和功效而引起了广泛的研究兴趣。然而,有两个缺点限制了 SIR 的进一步应用。首先,在高维数据的情况下,SIR 的计算复杂度通常很高。其次,对于提高特征选择和模型解释能力,投影子空间的稀疏性没有得到很好的挖掘。本文提出在谱空间中计算 SIR 投影向量,然后可以更快的速度获得近似回归解。此外,自适应套索用于获得稀疏的全局最优解,这在变量选择中非常重要。为了完成具有损坏的鲁棒模式分类任务,本文设计了基于相关熵的分类回归模型。它在回归系数中采用了平滑惩罚而不是稀疏约束,并且可以在分类中进行,因此在实践中更加灵活。通过使用一些真实和基准数据集,例如高维面部图像和基因微阵列数据,进行了广泛的实验来评估新算法。新的提案获得了有竞争力的结果,并与其他最先进的方法进行了比较。