Koch Inge, Naito Kanta
Department of Statistics, School of Mathematics, University of New South Wales, Sydney, NSW 2052 Australia.
Neural Comput. 2007 Feb;19(2):513-45. doi: 10.1162/neco.2007.19.2.513.
This letter is concerned with the problem of selecting the best or most informative dimension for dimension reduction and feature extraction in high-dimensional data. The dimension of the data is reduced by principal component analysis; subsequent application of independent component analysis to the principal component scores determines the most nongaussian directions in the lower-dimensional space. A criterion for choosing the optimal dimension based on bias-adjusted skewness and kurtosis is proposed. This new dimension selector is applied to real data sets and compared to existing methods. Simulation studies for a range of densities show that the proposed method performs well and is more appropriate for nongaussian data than existing methods.
本文关注的是在高维数据中为降维和特征提取选择最佳或最具信息量维度的问题。通过主成分分析降低数据维度;随后对主成分得分应用独立成分分析,以确定低维空间中最非高斯的方向。提出了一种基于偏差调整后的偏度和峰度来选择最优维度的准则。将这种新的维度选择器应用于实际数据集,并与现有方法进行比较。针对一系列密度的模拟研究表明,所提出的方法表现良好,并且比现有方法更适用于非高斯数据。