Matsuda Yoshitatsu, Yamaguchi Kazunori
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5630-5642. doi: 10.1109/TNNLS.2018.2806959. Epub 2018 Mar 13.
The independent component analysis (ICA) is a widely used method for solving blind separation problems. The ICA assumes that the sources are independent of each other and extracts them by maximizing their non-Gaussianity as the objective function. There are the two types of non-Gaussianity of the sources (the super-Gaussian type with the positive kurtosis and the sub-Gaussian one with the negative kurtosis). In this paper, we propose a new objective function unifying the two types of non-Gaussianity naturally, which is derived by applying the Gaussian approximation to the distribution of sources in the second-order polynomial feature space. The proposed objective function [called the adaptive ICA function (AIF)] is a simple form given as a summation of weighted fourth-order statistics, where the weights are adaptively estimated by the current kurtoses. The first practical advantage of the AIF is that it can extract the sources one by one in the descending order of the criterion of non-Gaussianity. It can solve the permutation ambiguity problem. The second and more important advantage is that it can estimate the number of non-Gaussian sources by the Akaike information criterion irrespective of the specific form of their distributions. In order to utilize the above-mentioned advantages of the AIF, we construct a new algorithm named the ordering ICA by extending the fast ICA. Experimental results verify that the ordering ICA can estimate the number of non-Gaussian sources correctly in both artificial and real data sets.
独立成分分析(ICA)是一种广泛用于解决盲分离问题的方法。ICA假设源彼此独立,并通过最大化其非高斯性作为目标函数来提取它们。源存在两种类型的非高斯性(具有正峰度的超高斯类型和具有负峰度的亚高斯类型)。在本文中,我们提出了一种自然统一这两种非高斯性的新目标函数,它是通过将高斯近似应用于二阶多项式特征空间中源的分布而推导出来的。所提出的目标函数[称为自适应ICA函数(AIF)]是一种简单形式,以加权四阶统计量的总和给出,其中权重由当前峰度自适应估计。AIF的第一个实际优点是它可以按照非高斯性标准的降序逐个提取源。它可以解决排列模糊问题。第二个更重要的优点是它可以通过赤池信息准则估计非高斯源的数量,而不管其分布的具体形式如何。为了利用AIF的上述优点,我们通过扩展快速ICA构建了一种名为排序ICA的新算法。实验结果验证了排序ICA可以在人工数据集和真实数据集中正确估计非高斯源的数量。