独立成分分析的非线性与噪声扩展：理论及其在音高感知模型中的应用

Maeda Shin-ichi, Song Wen-Jie, Ishii Shin

Nara Institute of Science and Technology, Ikoma, Nara 630-0192, Japan.

Neural Comput. 2005 Jan;17(1):115-44. doi: 10.1162/0899766052530866.

In this letter, we propose a noisy nonlinear version of independent component analysis (ICA). Assuming that the probability density function (p. d. f.) of sources is known, a learning rule is derived based on maximum likelihood estimation (MLE). Our model involves some algorithms of noisy linear ICA (e. g., Bermond & Cardoso, 1999) or noise-free nonlinear ICA (e. g., Lee, Koehler, & Orglmeister, 1997) as special cases. Especially when the nonlinear function is linear, the learning rule derived as a generalized expectation-maximization algorithm has a similar form to the noisy ICA algorithm previously presented by Douglas, Cichocki, and Amari (1998). Moreover, our learning rule becomes identical to the standard noise-free linear ICA algorithm in the noiseless limit, while existing MLE-based noisy ICA algorithms do not rigorously include the noise-free ICA. We trained our noisy nonlinear ICA by using acoustic signals such as speech and music. The model after learning successfully simulates virtual pitch phenomena, and the existence region of virtual pitch is qualitatively similar to that observed in a psychoacoustic experiment. Although a linear transformation hypothesized in the central auditory system can account for the pitch sensation, our model suggests that the linear transformation can be acquired through learning from actual acoustic signals. Since our model includes a cepstrum analysis in a special case, it is expected to provide a useful feature extraction method that has often been given by the cepstrum analysis.

在这封信中，我们提出了一种独立成分分析（ICA）的含噪非线性版本。假设源的概率密度函数（p.d.f.）已知，基于最大似然估计（MLE）推导了一种学习规则。我们的模型包含一些含噪线性ICA算法（例如，Bermond & Cardoso，1999）或无噪非线性ICA算法（例如，Lee、Koehler和Orglmeister，1997）作为特殊情况。特别是当非线性函数为线性时，作为广义期望最大化算法推导的学习规则与Douglas、Cichocki和Amari（1998）先前提出的含噪ICA算法具有相似的形式。此外，在无噪声极限情况下，我们的学习规则与标准的无噪线性ICA算法相同，而现有的基于MLE的含噪ICA算法并未严格包含无噪ICA。我们使用语音和音乐等声学信号训练了我们的含噪非线性ICA。学习后的模型成功模拟了虚拟音高现象，并且虚拟音高的存在区域在定性上与心理声学实验中观察到的相似。尽管在中枢听觉系统中假设的线性变换可以解释音高感知，但我们的模型表明，线性变换可以通过从实际声学信号中学习获得。由于我们的模型在特殊情况下包含了倒谱分析，因此有望提供一种通常由倒谱分析给出的有用特征提取方法。