• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鸡尾酒会问题的模式学习。

Schema learning for the cocktail party problem.

机构信息

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139.

Program in Speech and Hearing Bioscience and Technology, Division of Medical Sciences, Harvard University, Boston, MA 02115.

出版信息

Proc Natl Acad Sci U S A. 2018 Apr 3;115(14):E3313-E3322. doi: 10.1073/pnas.1801614115. Epub 2018 Mar 21.

DOI:10.1073/pnas.1801614115
PMID:29563229
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5889675/
Abstract

The cocktail party problem requires listeners to infer individual sound sources from mixtures of sound. The problem can be solved only by leveraging regularities in natural sound sources, but little is known about how such regularities are internalized. We explored whether listeners learn source "schemas"-the abstract structure shared by different occurrences of the same type of sound source-and use them to infer sources from mixtures. We measured the ability of listeners to segregate mixtures of time-varying sources. In each experiment a subset of trials contained schema-based sources generated from a common template by transformations (transposition and time dilation) that introduced acoustic variation but preserved abstract structure. Across several tasks and classes of sound sources, schema-based sources consistently aided source separation, in some cases producing rapid improvements in performance over the first few exposures to a schema. Learning persisted across blocks that did not contain the learned schema, and listeners were able to learn and use multiple schemas simultaneously. No learning was evident when schema were presented in the task-irrelevant (i.e., distractor) source. However, learning from task-relevant stimuli showed signs of being implicit, in that listeners were no more likely to report that sources recurred in experiments containing schema-based sources than in control experiments containing no schema-based sources. The results implicate a mechanism for rapidly internalizing abstract sound structure, facilitating accurate perceptual organization of sound sources that recur in the environment.

摘要

鸡尾酒会问题要求听众从声音混合物中推断出单个声源。这个问题只能通过利用自然声源的规律来解决,但对于这些规律是如何内化的,我们知之甚少。我们探讨了听众是否会学习源“模式”——相同类型声源的不同实例所共有的抽象结构,并利用它们从混合物中推断声源。我们测量了听众从时变声源混合物中进行分离的能力。在每个实验中,一部分试验包含基于模式的声源,这些声源是通过转换(转调与时间拉伸)从一个共同的模板生成的,转换引入了声学变化,但保留了抽象结构。在几个任务和声音源类别中,基于模式的声源始终有助于声源分离,在某些情况下,在接触模式的前几次时,性能就会迅速提高。学习会在不包含学习模式的块中持续存在,并且听众能够同时学习和使用多个模式。当模式在任务不相关(即干扰)的声源中呈现时,没有明显的学习效果。然而,从相关刺激中学习的迹象表明是内隐的,即与包含基于模式的声源的控制实验相比,听众不太可能报告在包含基于模式的声源的实验中声源会再次出现。这些结果暗示了一种快速内化抽象声音结构的机制,有助于准确地组织环境中重复出现的声源。

相似文献

1
Schema learning for the cocktail party problem.鸡尾酒会问题的模式学习。
Proc Natl Acad Sci U S A. 2018 Apr 3;115(14):E3313-E3322. doi: 10.1073/pnas.1801614115. Epub 2018 Mar 21.
2
Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG.鸡尾酒会在哪里?使用 EEG 解码有注意和无注意移动声源的位置。
Neuroimage. 2020 Jan 15;205:116283. doi: 10.1016/j.neuroimage.2019.116283. Epub 2019 Oct 17.
3
Recovering sound sources from embedded repetition.从嵌入重复中恢复声源。
Proc Natl Acad Sci U S A. 2011 Jan 18;108(3):1188-93. doi: 10.1073/pnas.1004765108. Epub 2011 Jan 3.
4
The effects of aging and interaural delay on the detection of a break in the interaural correlation between two sounds.年龄和两耳间延迟对检测两个声音之间的耳间相关中断的影响。
Ear Hear. 2009 Apr;30(2):273-86. doi: 10.1097/AUD.0b013e318198703d.
5
Electrophysiological correlates of cocktail-party listening.鸡尾酒会效应的电生理关联
Behav Brain Res. 2015 Oct 1;292:157-66. doi: 10.1016/j.bbr.2015.06.025. Epub 2015 Jun 16.
6
Statistics of natural reverberation enable perceptual separation of sound and space.自然混响的统计特性有助于实现声音与空间的感知分离。
Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):E7856-E7865. doi: 10.1073/pnas.1612524113. Epub 2016 Nov 10.
7
Breaking the wave: effects of attention and learning on concurrent sound perception.打破浪潮:注意力和学习对同时进行的声音感知的影响。
Hear Res. 2007 Jul;229(1-2):225-36. doi: 10.1016/j.heares.2007.01.011. Epub 2007 Jan 16.
8
Dissociation of perceptual judgments of "what" and "where" in an ambiguous auditory scene.在模糊听觉场景中“什么”与“哪里”的知觉判断解离
J Acoust Soc Am. 2010 Nov;128(5):3041-51. doi: 10.1121/1.3495942.
9
ARTSTREAM: a neural network model of auditory scene analysis and source segregation.ARTSTREAM:一种用于听觉场景分析和声源分离的神经网络模型。
Neural Netw. 2004 May;17(4):511-36. doi: 10.1016/j.neunet.2003.10.002.
10
Inharmonic speech reveals the role of harmonicity in the cocktail party problem.不和谐的语音揭示了和谐性在鸡尾酒会问题中的作用。
Nat Commun. 2018 May 29;9(1):2122. doi: 10.1038/s41467-018-04551-8.

引用本文的文献

1
A deep learning framework for understanding cochlear implants.一个用于理解人工耳蜗的深度学习框架。
bioRxiv. 2025 Jul 21:2025.07.16.665227. doi: 10.1101/2025.07.16.665227.
2
Individual differences in auditory scene analysis abilities in music and speech.音乐和语音中听觉场景分析能力的个体差异。
Sci Rep. 2025 Jul 5;15(1):24048. doi: 10.1038/s41598-025-10263-z.
3
Optimized feature gains explain and predict successes and failures of human selective listening.优化后的特征增益能够解释并预测人类选择性听力的成败。
bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.
4
Sensory and Perceptual Decisional Processes Underlying the Perception of Reverberant Auditory Environments.基于混响听觉环境感知的感觉和知觉决策过程。
eNeuro. 2024 Aug 20;11(8). doi: 10.1523/ENEURO.0122-24.2024. Print 2024 Aug.
5
Comparing online versus laboratory measures of speech perception in older children and adolescents.比较大龄儿童和青少年的在线与实验室语音感知测量。
PLoS One. 2024 Feb 7;19(2):e0297530. doi: 10.1371/journal.pone.0297530. eCollection 2024.
6
Implicit auditory memory in older listeners: From encoding to 6-month retention.老年听众的内隐听觉记忆:从编码到6个月的保持
Curr Res Neurobiol. 2023 Nov 7;5:100115. doi: 10.1016/j.crneur.2023.100115. eCollection 2023.
7
Neural signatures of automatic repetition detection in temporally regular and jittered acoustic sequences.在时间规则和抖动的声学序列中自动重复检测的神经特征。
PLoS One. 2023 Nov 10;18(11):e0284836. doi: 10.1371/journal.pone.0284836. eCollection 2023.
8
Model metamers reveal divergent invariances between biological and artificial neural networks.模型同型揭示了生物神经网络和人工神经网络之间的不同不变性。
Nat Neurosci. 2023 Nov;26(11):2017-2034. doi: 10.1038/s41593-023-01442-0. Epub 2023 Oct 16.
9
Web-based psychoacoustics: Hearing screening, infrastructure, and validation.基于网络的心理声学:听力筛查、基础设施和验证。
Behav Res Methods. 2024 Mar;56(3):1433-1448. doi: 10.3758/s13428-023-02101-9. Epub 2023 Jun 8.
10
The Headphone and Loudspeaker Test-Part II: A comprehensive method for playback device screening in Internet experiments.耳机和扬声器测试-第二部分:互联网实验中播放设备筛选的综合方法。
Behav Res Methods. 2024 Jan;56(1):362-378. doi: 10.3758/s13428-022-02048-3. Epub 2023 Jan 17.

本文引用的文献

1
Headphone screening to facilitate web-based auditory experiments.耳机筛选以促进基于网络的听觉实验。
Atten Percept Psychophys. 2017 Oct;79(7):2064-2072. doi: 10.3758/s13414-017-1361-2.
2
Attentive Tracking of Sound Sources.声源的精确跟踪
Curr Biol. 2015 Aug 31;25(17):2238-46. doi: 10.1016/j.cub.2015.07.043. Epub 2015 Aug 13.
3
Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data?亚马逊土耳其机器人:一种新的廉价、高质量数据来源?
Perspect Psychol Sci. 2011 Jan;6(1):3-5. doi: 10.1177/1745691610393980. Epub 2011 Feb 3.
4
The cocktail-party problem revisited: early processing and selection of multi-talker speech.再探鸡尾酒会问题:多说话者语音的早期处理与选择
Atten Percept Psychophys. 2015 Jul;77(5):1465-87. doi: 10.3758/s13414-015-0882-9.
5
The effects of rhythm and melody on auditory stream segregation.节奏和旋律对听觉流分离的影响。
J Acoust Soc Am. 2014 Mar;135(3):1392-405. doi: 10.1121/1.4865196.
6
Reputation as a sufficient condition for data quality on Amazon Mechanical Turk.声誉作为亚马逊土耳其机器人上数据质量的充分条件。
Behav Res Methods. 2014 Dec;46(4):1023-31. doi: 10.3758/s13428-013-0434-y.
7
Lexically guided phonetic retuning of foreign-accented speech and its generalization.词汇引导的外国口音语音调整及其泛化。
J Exp Psychol Hum Percept Perform. 2014 Apr;40(2):539-55. doi: 10.1037/a0034409. Epub 2013 Sep 23.
8
Statistical learning: From acquiring specific items to forming general rules.统计学习:从获取特定项目到形成一般规则。
Curr Dir Psychol Sci. 2012 Jun 1;21(3):170-176. doi: 10.1177/0963721412436806.
9
Swinging at a cocktail party: voice familiarity aids speech perception in the presence of a competing voice.在鸡尾酒会上摇摆:在存在竞争声音的情况下,语音熟悉度有助于语音感知。
Psychol Sci. 2013 Oct;24(10):1995-2004. doi: 10.1177/0956797613482467. Epub 2013 Aug 28.
10
Lexical influences on auditory streaming.听觉流的词汇影响。
Curr Biol. 2013 Aug 19;23(16):1585-9. doi: 10.1016/j.cub.2013.06.042. Epub 2013 Jul 25.