Department of Communicative Disorders, University of Wisconsin, Madison, WI 53706, USA.
Adv Exp Med Biol. 2013;787:203-11. doi: 10.1007/978-1-4614-1590-9_23.
We rely critically on our ability to 'hear out' (segregate) individual sound sources in a mixture. Yet, despite its importance, little is known regarding this -ability. Perturbation analysis is a psychophysical method that has been successfully applied to related problems in vision [Murray, R.F. 2011. J. of Vision 11, 1-25]. Here the approach is adapted to audition. The application proceeds in three stages: First, simple speech and environmental sounds are synthesized according to a generative model of the sound--producing source. Second, listener decision strategy in segregating target from non--target (noise) sources is determined from decision weights (regression coefficients) relating listener judgments regarding the target to lawful perturbations in acoustic parameters, as dictated by the generative model. Third, factors limiting segregation are identified by comparing the obtained weights and residuals to those of a maximum-likelihood (ML) observer that optimizes segregation based on the equations of motion of the generating source. Here, the approach is applied to test between the two major models of sound source segregation; target enhancement versus noise cancellation. The results indicate a tendency of noise segregation to preempt target enhancement when the noise source is unchanging. However, the results also show individual differences in segregation strategy that are not evident in the measures of performance accuracy alone.
我们严重依赖于“听出”(分离)混合物中各个声源的能力。然而,尽管这一能力很重要,却知之甚少。微扰分析是一种心理物理学方法,已成功应用于视觉相关问题[Murray, R.F. 2011. J. of Vision 11, 1-25]。在这里,这种方法被应用于听觉。应用程序分为三个阶段:首先,根据声源产生模型合成简单的语音和环境声音。其次,根据生成模型规定的、与听众对目标的判断相关的听觉参数的合法微扰,从将听众对目标的判断与非目标(噪声)源分离的决策权重(回归系数)确定听众的决策策略。第三,通过将获得的权重和残差与基于生成源运动方程的最大似然(ML)观察者的权重和残差进行比较,确定限制分离的因素。在这里,该方法应用于测试声源分离的两个主要模型;目标增强与噪声消除。结果表明,当噪声源不变时,噪声分离倾向于先于目标增强。然而,结果还显示了分离策略的个体差异,而这些差异在单独的性能精度测量中并不明显。