• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

听觉场景分析的计算模型:综述

Computational Models of Auditory Scene Analysis: A Review.

作者信息

Szabó Beáta T, Denham Susan L, Winkler István

机构信息

Faculty of Information Technology and Bionics, Pázmány Péter Catholic UniversityBudapest, Hungary; Institute of Cognitive Neuroscience and Psychology, Research Centre for Natural Sciences, Hungarian Academy of SciencesBudapest, Hungary.

School of Psychology, University of Plymouth Plymouth, UK.

出版信息

Front Neurosci. 2016 Nov 15;10:524. doi: 10.3389/fnins.2016.00524. eCollection 2016.

DOI:10.3389/fnins.2016.00524
PMID:27895552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5108797/
Abstract

Auditory scene analysis (ASA) refers to the process (es) of parsing the complex acoustic input into auditory perceptual objects representing either physical sources or temporal sound patterns, such as melodies, which contributed to the sound waves reaching the ears. A number of new computational models accounting for some of the perceptual phenomena of ASA have been published recently. Here we provide a theoretically motivated review of these computational models, aiming to relate their guiding principles to the central issues of the theoretical framework of ASA. Specifically, we ask how they achieve the grouping and separation of sound elements and whether they implement some form of competition between alternative interpretations of the sound input. We consider the extent to which they include predictive processes, as important current theories suggest that perception is inherently predictive, and also how they have been evaluated. We conclude that current computational models of ASA are fragmentary in the sense that rather than providing general competing interpretations of ASA, they focus on assessing the utility of specific processes (or algorithms) for finding the causes of the complex acoustic signal. This leaves open the possibility for integrating complementary aspects of the models into a more comprehensive theory of ASA.

摘要

听觉场景分析(ASA)是指将复杂的声学输入解析为代表物理声源或时间声音模式(如旋律)的听觉感知对象的过程,这些对象构成了到达耳朵的声波。最近已经发表了一些新的计算模型,它们解释了ASA的一些感知现象。在这里,我们对这些计算模型进行了理论驱动的综述,旨在将它们的指导原则与ASA理论框架的核心问题联系起来。具体来说,我们探讨它们如何实现声音元素的分组和分离,以及它们是否在声音输入的不同解释之间实施某种形式的竞争。我们考虑它们在多大程度上包含预测过程,因为当前重要的理论表明感知本质上是预测性的,同时也考虑它们是如何被评估的。我们得出结论,当前的ASA计算模型是不完整的,因为它们不是提供对ASA的一般竞争性解释,而是专注于评估特定过程(或算法)在寻找复杂声学信号成因方面的效用。这为将模型的互补方面整合到更全面的ASA理论中留下了可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f5/5108797/2565a8919635/fnins-10-00524-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f5/5108797/045e033010ef/fnins-10-00524-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f5/5108797/2565a8919635/fnins-10-00524-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f5/5108797/045e033010ef/fnins-10-00524-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f5/5108797/2565a8919635/fnins-10-00524-g0002.jpg

相似文献

1
Computational Models of Auditory Scene Analysis: A Review.听觉场景分析的计算模型:综述
Front Neurosci. 2016 Nov 15;10:524. doi: 10.3389/fnins.2016.00524. eCollection 2016.
2
Predictive coding in auditory perception: challenges and unresolved questions.听觉感知中的预测编码:挑战与未解决的问题。
Eur J Neurosci. 2020 Mar;51(5):1151-1160. doi: 10.1111/ejn.13802. Epub 2018 Jan 16.
3
A computational approach to the dynamic aspects of primitive auditory scene analysis.一种用于原始听觉场景分析动态方面的计算方法。
Adv Exp Med Biol. 2013;787:519-26. doi: 10.1007/978-1-4614-1590-9_57.
4
Modelling the emergence and dynamics of perceptual organisation in auditory streaming.听觉流中知觉组织的出现和动态建模。
PLoS Comput Biol. 2013;9(3):e1002925. doi: 10.1371/journal.pcbi.1002925. Epub 2013 Mar 14.
5
Predictability effects in auditory scene analysis: a review.听觉场景分析中的可预测性效应:综述
Front Neurosci. 2014 Mar 31;8:60. doi: 10.3389/fnins.2014.00060. eCollection 2014.
6
The source dilemma hypothesis: Perceptual uncertainty contributes to musical emotion.源困境假说:感知不确定性有助于产生音乐情感。
Cognition. 2016 Sep;154:174-181. doi: 10.1016/j.cognition.2016.05.021. Epub 2016 Jun 16.
7
The role of predictive models in the formation of auditory streams.预测模型在听觉流形成中的作用。
J Physiol Paris. 2006 Jul-Sep;100(1-3):154-70. doi: 10.1016/j.jphysparis.2006.09.012. Epub 2006 Nov 3.
8
Auditory scene analysis: the sweet music of ambiguity.听觉场景分析:模糊性的美妙乐章。
Front Hum Neurosci. 2011 Dec 14;5:158. doi: 10.3389/fnhum.2011.00158. eCollection 2011.
9
Modeling the auditory scene: predictive regularity representations and perceptual objects.听觉场景建模:预测性规则表示与知觉对象。
Trends Cogn Sci. 2009 Dec;13(12):532-40. doi: 10.1016/j.tics.2009.09.003. Epub 2009 Oct 12.
10
Selectively attending to auditory objects.选择性地关注听觉对象。
Front Biosci. 2000 Jan 1;5:D202-12. doi: 10.2741/alain.

引用本文的文献

1
Tactile stimulations reduce or promote the segregation of auditory streams: psychophysics and modeling.触觉刺激会减少或促进听觉流的分离:心理物理学与建模
PLoS Comput Biol. 2025 Aug 18;21(8):e1012701. doi: 10.1371/journal.pcbi.1012701. eCollection 2025 Aug.
2
Design and evaluation of a global workspace agent embodied in a realistic multimodal environment.在逼真的多模态环境中实现的全局工作区智能体的设计与评估。
Front Comput Neurosci. 2024 Jun 14;18:1352685. doi: 10.3389/fncom.2024.1352685. eCollection 2024.
3
Simultaneous relative cue reliance in speech-on-speech masking.

本文引用的文献

1
Transitional Probabilities Are Prioritized over Stimulus/Pattern Probabilities in Auditory Deviance Detection: Memory Basis for Predictive Sound Processing.在听觉偏差检测中,转换概率优先于刺激/模式概率:预测性声音处理的记忆基础。
J Neurosci. 2016 Sep 14;36(37):9572-9. doi: 10.1523/JNEUROSCI.1041-16.2016.
2
EEG signatures accompanying auditory figure-ground segregation.伴随听觉图形-背景分离的脑电图特征
Neuroimage. 2016 Nov 1;141:108-119. doi: 10.1016/j.neuroimage.2016.07.028. Epub 2016 Jul 12.
3
Neural Correlates of Auditory Figure-Ground Segregation Based on Temporal Coherence.
语音掩蔽中同时的相对线索依赖。
J Acoust Soc Am. 2023 Oct 1;154(4):2530-2538. doi: 10.1121/10.0021874.
4
A biologically oriented algorithm for spatial sound segregation.一种用于空间声音分离的生物导向算法。
Front Neurosci. 2022 Oct 14;16:1004071. doi: 10.3389/fnins.2022.1004071. eCollection 2022.
5
Intention-based predictive information modulates auditory deviance processing.基于意图的预测信息调节听觉偏差处理。
Front Neurosci. 2022 Sep 28;16:995119. doi: 10.3389/fnins.2022.995119. eCollection 2022.
6
Making sense of periodicity glimpses in a prediction-update-loop-A computational model of attentive voice tracking.理解预测更新循环中的周期性瞥见——一种注意力语音跟踪的计算模型
J Acoust Soc Am. 2022 Feb;151(2):712. doi: 10.1121/10.0009337.
7
A Speech-Level-Based Segmented Model to Decode the Dynamic Auditory Attention States in the Competing Speaker Scenes.一种基于语音水平的分段模型,用于解码竞争说话者场景中的动态听觉注意状态。
Front Neurosci. 2022 Feb 10;15:760611. doi: 10.3389/fnins.2021.760611. eCollection 2021.
8
Active inference, selective attention, and the cocktail party problem.主动推断、选择性注意和鸡尾酒会问题。
Neurosci Biobehav Rev. 2021 Dec;131:1288-1304. doi: 10.1016/j.neubiorev.2021.09.038. Epub 2021 Oct 21.
9
Auditory streaming emerges from fast excitation and slow delayed inhibition.听觉流分离源于快速兴奋和缓慢延迟抑制。
J Math Neurosci. 2021 May 3;11(1):8. doi: 10.1186/s13408-021-00106-2.
10
Buildup and bistability in auditory streaming as an evidence accumulation process with saturation.听觉流的累积和双稳性作为一个具有饱和的证据积累过程。
PLoS Comput Biol. 2020 Aug 27;16(8):e1008152. doi: 10.1371/journal.pcbi.1008152. eCollection 2020 Aug.
基于时间连贯性的听觉图形-背景分离的神经关联
Cereb Cortex. 2016 Sep;26(9):3669-80. doi: 10.1093/cercor/bhw173. Epub 2016 Jun 19.
4
Auditory Multi-Stability: Idiosyncratic Perceptual Switching Patterns, Executive Functions and Personality Traits.听觉多稳态:特异的感知切换模式、执行功能与人格特质。
PLoS One. 2016 May 2;11(5):e0154810. doi: 10.1371/journal.pone.0154810. eCollection 2016.
5
Assessing the validity of subjective reports in the auditory streaming paradigm.评估听觉流范式中主观报告的有效性。
J Acoust Soc Am. 2016 Apr;139(4):1762. doi: 10.1121/1.4945720.
6
A Brain System for Auditory Working Memory.一个用于听觉工作记忆的脑系统。
J Neurosci. 2016 Apr 20;36(16):4492-505. doi: 10.1523/JNEUROSCI.4341-14.2016.
7
Brain responses in humans reveal ideal observer-like sensitivity to complex acoustic patterns.人类的大脑反应显示出对复杂声学模式的类理想观察者敏感性。
Proc Natl Acad Sci U S A. 2016 Feb 2;113(5):E616-25. doi: 10.1073/pnas.1508523113. Epub 2016 Jan 19.
8
Auditory Streaming as an Online Classification Process with Evidence Accumulation.听觉流作为一种具有证据积累的在线分类过程。
PLoS One. 2015 Dec 15;10(12):e0144788. doi: 10.1371/journal.pone.0144788. eCollection 2015.
9
Neuromechanistic Model of Auditory Bistability.听觉双稳性的神经机制模型
PLoS Comput Biol. 2015 Nov 12;11(11):e1004555. doi: 10.1371/journal.pcbi.1004555. eCollection 2015 Nov.
10
Sound stream segregation: a neuromorphic approach to solve the "cocktail party problem" in real-time.声音流分离:一种用于实时解决“鸡尾酒会问题”的神经形态方法。
Front Neurosci. 2015 Sep 2;9:309. doi: 10.3389/fnins.2015.00309. eCollection 2015.