Szabó Beáta T, Denham Susan L, Winkler István
Faculty of Information Technology and Bionics, Pázmány Péter Catholic UniversityBudapest, Hungary; Institute of Cognitive Neuroscience and Psychology, Research Centre for Natural Sciences, Hungarian Academy of SciencesBudapest, Hungary.
School of Psychology, University of Plymouth Plymouth, UK.
Front Neurosci. 2016 Nov 15;10:524. doi: 10.3389/fnins.2016.00524. eCollection 2016.
Auditory scene analysis (ASA) refers to the process (es) of parsing the complex acoustic input into auditory perceptual objects representing either physical sources or temporal sound patterns, such as melodies, which contributed to the sound waves reaching the ears. A number of new computational models accounting for some of the perceptual phenomena of ASA have been published recently. Here we provide a theoretically motivated review of these computational models, aiming to relate their guiding principles to the central issues of the theoretical framework of ASA. Specifically, we ask how they achieve the grouping and separation of sound elements and whether they implement some form of competition between alternative interpretations of the sound input. We consider the extent to which they include predictive processes, as important current theories suggest that perception is inherently predictive, and also how they have been evaluated. We conclude that current computational models of ASA are fragmentary in the sense that rather than providing general competing interpretations of ASA, they focus on assessing the utility of specific processes (or algorithms) for finding the causes of the complex acoustic signal. This leaves open the possibility for integrating complementary aspects of the models into a more comprehensive theory of ASA.
听觉场景分析(ASA)是指将复杂的声学输入解析为代表物理声源或时间声音模式(如旋律)的听觉感知对象的过程,这些对象构成了到达耳朵的声波。最近已经发表了一些新的计算模型,它们解释了ASA的一些感知现象。在这里,我们对这些计算模型进行了理论驱动的综述,旨在将它们的指导原则与ASA理论框架的核心问题联系起来。具体来说,我们探讨它们如何实现声音元素的分组和分离,以及它们是否在声音输入的不同解释之间实施某种形式的竞争。我们考虑它们在多大程度上包含预测过程,因为当前重要的理论表明感知本质上是预测性的,同时也考虑它们是如何被评估的。我们得出结论,当前的ASA计算模型是不完整的,因为它们不是提供对ASA的一般竞争性解释,而是专注于评估特定过程(或算法)在寻找复杂声学信号成因方面的效用。这为将模型的互补方面整合到更全面的ASA理论中留下了可能性。