Suppr超能文献

一场带有皮层转折的鸡尾酒会:皮层机制如何促进声音分离。

A cocktail party with a cortical twist: how cortical mechanisms contribute to sound segregation.

作者信息

Elhilali Mounya, Shamma Shihab A

机构信息

Department of Electrical and Computer Engineering, Johns Hopkins University, Barton, Baltimore, Maryland 21218, USA.

出版信息

J Acoust Soc Am. 2008 Dec;124(6):3751-71. doi: 10.1121/1.3001672.

Abstract

Sound systems and speech technologies can benefit greatly from a deeper understanding of how the auditory system, and particularly the auditory cortex, is able to parse complex acoustic scenes into meaningful auditory objects and streams under adverse conditions. In the current work, a biologically plausible model of this process is presented, where the role of cortical mechanisms in organizing complex auditory scenes is explored. The model consists of two stages: (i) a feature analysis stage that maps the acoustic input into a multidimensional cortical representation and (ii) an integrative stage that recursively builds up expectations of how streams evolve over time and reconciles its predictions with the incoming sensory input by sorting it into different clusters. This approach yields a robust computational scheme for speaker separation under conditions of speech or music interference. The model can also emulate the archetypal streaming percepts of tonal stimuli that have long been tested in human subjects. The implications of this model are discussed with respect to the physiological correlates of streaming in the cortex as well as the role of attention and other top-down influences in guiding sound organization.

摘要

声音系统和语音技术可以从更深入地理解听觉系统,特别是听觉皮层如何在不利条件下将复杂的声学场景解析为有意义的听觉对象和流中受益匪浅。在当前的工作中,提出了一个关于这个过程的生物学上合理的模型,其中探索了皮层机制在组织复杂听觉场景中的作用。该模型由两个阶段组成:(i)一个特征分析阶段,将声学输入映射到多维皮层表示中;(ii)一个整合阶段,递归地建立关于流如何随时间演变的期望,并通过将传入的感官输入分类到不同的簇中来使其预测与传入的感官输入相协调。这种方法产生了一种在语音或音乐干扰条件下进行说话者分离的强大计算方案。该模型还可以模拟长期以来在人类受试者中测试过的音调刺激的典型流感知。讨论了该模型在皮层中流的生理相关性以及注意力和其他自上而下的影响在引导声音组织中的作用方面的意义。

相似文献

2
Cortical Representations of Speech in a Multitalker Auditory Scene.多说话者听觉场景中语音的皮质表征
J Neurosci. 2017 Sep 20;37(38):9189-9196. doi: 10.1523/JNEUROSCI.0938-17.2017. Epub 2017 Aug 18.
4
Temporal coherence and the streaming of complex sounds.时间相干性与复杂声音的流动。
Adv Exp Med Biol. 2013;787:535-43. doi: 10.1007/978-1-4614-1590-9_59.
9
Functional imaging of auditory scene analysis.听觉场景分析的功能成像。
Hear Res. 2014 Jan;307:98-110. doi: 10.1016/j.heares.2013.08.003. Epub 2013 Aug 19.

引用本文的文献

1
Perceptual clustering in auditory streaming.听觉流中的感知聚类
PLoS Comput Biol. 2025 Jul 11;21(7):e1013189. doi: 10.1371/journal.pcbi.1013189. eCollection 2025 Jul.
10
A Gestalt inference model for auditory scene segregation.听觉场景分离的格式塔推理模型。
PLoS Comput Biol. 2019 Jan 22;15(1):e1006711. doi: 10.1371/journal.pcbi.1006711. eCollection 2019 Jan.

本文引用的文献

1
Learning Invariance from Transformation Sequences.从变换序列中学习不变性。
Neural Comput. 1991 Summer;3(2):194-200. doi: 10.1162/neco.1991.3.2.194.
4
The role of attention in the formation of auditory streams.注意在听觉流形成中的作用。
Percept Psychophys. 2007 Jan;69(1):136-52. doi: 10.3758/bf03194460.
7
The perceptual consequences of binaural hearing.双耳听觉的感知结果。
Int J Audiol. 2006;45 Suppl 1:S34-44. doi: 10.1080/14992020600782642.
9
Neural encoding and retrieval of sound sequences.声音序列的神经编码与检索
Ann N Y Acad Sci. 2005 Dec;1060:125-35. doi: 10.1196/annals.1360.009.
10
Spectral processing in the auditory cortex.听觉皮层中的频谱处理。
Int Rev Neurobiol. 2005;70:253-98. doi: 10.1016/S0074-7742(05)70008-8.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验