• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不和谐的语音揭示了和谐性在鸡尾酒会问题中的作用。

Inharmonic speech reveals the role of harmonicity in the cocktail party problem.

机构信息

Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, 02139, USA.

Helen Wills Neuroscience Institute, UC Berkeley, Berkeley, CA, 94720, USA.

出版信息

Nat Commun. 2018 May 29;9(1):2122. doi: 10.1038/s41467-018-04551-8.

DOI:10.1038/s41467-018-04551-8
PMID:29844313
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5974276/
Abstract

The "cocktail party problem" requires us to discern individual sound sources from mixtures of sources. The brain must use knowledge of natural sound regularities for this purpose. One much-discussed regularity is the tendency for frequencies to be harmonically related (integer multiples of a fundamental frequency). To test the role of harmonicity in real-world sound segregation, we developed speech analysis/synthesis tools to perturb the carrier frequencies of speech, disrupting harmonic frequency relations while maintaining the spectrotemporal envelope that determines phonemic content. We find that violations of harmonicity cause individual frequencies of speech to segregate from each other, impair the intelligibility of concurrent utterances despite leaving intelligibility of single utterances intact, and cause listeners to lose track of target talkers. However, additional segregation deficits result from replacing harmonic frequencies with noise (simulating whispering), suggesting additional grouping cues enabled by voiced speech excitation. Our results demonstrate acoustic grouping cues in real-world sound segregation.

摘要

“鸡尾酒会问题”要求我们从混合的声源中辨别出各个声源。大脑必须利用对自然声音规律的了解来实现这一目标。一个备受讨论的规律是,频率往往具有谐和关系(基频的整数倍)。为了检验谐和性在现实世界声音分离中的作用,我们开发了语音分析/合成工具来干扰语音的载波频率,破坏谐波频率关系,同时保持决定音位内容的频谱时间包络。我们发现,谐和性的违反会导致语音的各个频率彼此分离,尽管单个语音的可理解性保持不变,但会损害同时发出的语音的可理解性,并导致听众无法跟踪目标说话者。然而,用噪声(模拟低语)替代谐波频率会导致额外的分离缺陷,这表明由浊音语音激励提供了其他分组线索。我们的结果证明了现实世界声音分离中的声学分组线索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/f53b9b8145e2/41467_2018_4551_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/1d2f7a6f2e4b/41467_2018_4551_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/98f4d01d5b5f/41467_2018_4551_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/32dc594bab9d/41467_2018_4551_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/1f87fe056289/41467_2018_4551_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/edacdeab2118/41467_2018_4551_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/f53b9b8145e2/41467_2018_4551_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/1d2f7a6f2e4b/41467_2018_4551_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/98f4d01d5b5f/41467_2018_4551_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/32dc594bab9d/41467_2018_4551_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/1f87fe056289/41467_2018_4551_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/edacdeab2118/41467_2018_4551_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/922e/5974276/f53b9b8145e2/41467_2018_4551_Fig6_HTML.jpg

相似文献

1
Inharmonic speech reveals the role of harmonicity in the cocktail party problem.不和谐的语音揭示了和谐性在鸡尾酒会问题中的作用。
Nat Commun. 2018 May 29;9(1):2122. doi: 10.1038/s41467-018-04551-8.
2
Role of Binaural Temporal Fine Structure and Envelope Cues in Cocktail-Party Listening.双耳时间精细结构和包络线索在鸡尾酒会式聆听中的作用。
J Neurosci. 2016 Aug 3;36(31):8250-7. doi: 10.1523/JNEUROSCI.4421-15.2016.
3
The role of continuous low-frequency harmonicity cues for interrupted speech perception in bimodal hearing.连续低频谐波线索在双模式听力中对间断言语感知的作用。
J Acoust Soc Am. 2016 Apr;139(4):1747. doi: 10.1121/1.4945747.
4
The cocktail-party problem revisited: early processing and selection of multi-talker speech.再探鸡尾酒会问题:多说话者语音的早期处理与选择
Atten Percept Psychophys. 2015 Jul;77(5):1465-87. doi: 10.3758/s13414-015-0882-9.
5
Spatial cues alone produce inaccurate sound segregation: the effect of interaural time differences.仅空间线索会产生不准确的声音分离:耳间时间差的影响。
J Acoust Soc Am. 2012 Jul;132(1):357-68. doi: 10.1121/1.4718637.
6
Signal envelope and speech intelligibility differentially impact auditory motion perception.信号包络和语音可懂度对听觉运动感知有不同的影响。
Sci Rep. 2021 Jul 23;11(1):15117. doi: 10.1038/s41598-021-94662-y.
7
Monaural speech segregation using synthetic speech signals.使用合成语音信号的单耳语音分离
J Acoust Soc Am. 2006 Apr;119(4):2327-33. doi: 10.1121/1.2170030.
8
The intelligibility of speech in a harmonic masker varying in fundamental frequency contour, broadband temporal envelope, and spatial location.在基频轮廓、宽带时间包络和空间位置变化的谐波掩蔽器中语音的可懂度。
Hear Res. 2017 Jul;350:1-10. doi: 10.1016/j.heares.2017.03.012. Epub 2017 Mar 29.
9
ARTSTREAM: a neural network model of auditory scene analysis and source segregation.ARTSTREAM:一种用于听觉场景分析和声源分离的神经网络模型。
Neural Netw. 2004 May;17(4):511-36. doi: 10.1016/j.neunet.2003.10.002.
10
Low- and high-frequency cortical brain oscillations reflect dissociable mechanisms of concurrent speech segregation in noise.低频和高频皮层脑振荡反映了噪声中并行语音分离的不同机制。
Hear Res. 2018 Apr;361:92-102. doi: 10.1016/j.heares.2018.01.006. Epub 2018 Feb 2.

引用本文的文献

1
The role of harmonicity on listeners' ability to hear out voices in polyphonic music.和声在听众从复调音乐中分辨出各个声部能力方面所起的作用。
Sci Rep. 2025 Aug 28;15(1):31686. doi: 10.1038/s41598-025-16404-8.
2
Optimized feature gains explain and predict successes and failures of human selective listening.优化后的特征增益能够解释并预测人类选择性听力的成败。
bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.
3
Reduced Neural Responses to Natural Foreground versus Background Sounds in the Auditory Cortex.听觉皮层对自然前景声音与背景声音的神经反应减弱。

本文引用的文献

1
Diversity in pitch perception revealed by task dependence.任务依赖性揭示的音高感知多样性。
Nat Hum Behav. 2018 Jan;2(1):52-66. doi: 10.1038/s41562-017-0261-8. Epub 2017 Dec 11.
2
Schema learning for the cocktail party problem.鸡尾酒会问题的模式学习。
Proc Natl Acad Sci U S A. 2018 Apr 3;115(14):E3313-E3322. doi: 10.1073/pnas.1801614115. Epub 2018 Mar 21.
3
Harmonic template neurons in primate auditory cortex underlying complex sound processing.灵长类动物听觉皮层中参与复杂声音处理的谐波模板神经元。
J Neurosci. 2025 Mar 5;45(10):e0121242024. doi: 10.1523/JNEUROSCI.0121-24.2024.
4
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听觉中精确时间编码的任务依赖性必要性。
Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.
5
Testing the role of temporal coherence on speech intelligibility with noise and single-talker maskers.测试时间相干性在噪声和单说话人掩蔽下语音可懂度中的作用。
J Acoust Soc Am. 2024 Nov 1;156(5):3285-3297. doi: 10.1121/10.0034420.
6
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听力中精确时间编码在任务依赖方面的必要性。
bioRxiv. 2024 Sep 16:2024.04.21.590435. doi: 10.1101/2024.04.21.590435.
7
Effects of Memantine on the Auditory Steady-State and Harmonic Responses to 40 Hz Stimulation Across Species.美金刚对跨物种听觉稳态及对40赫兹刺激的谐波反应的影响。
Biol Psychiatry Cogn Neurosci Neuroimaging. 2024 Mar;9(3):346-355. doi: 10.1016/j.bpsc.2023.08.009. Epub 2023 Sep 6.
8
Sensitivity to Frequency Modulation is Limited Centrally.频率调制的敏感性受到中枢限制。
J Neurosci. 2023 May 17;43(20):3687-3695. doi: 10.1523/JNEUROSCI.0995-22.2023. Epub 2023 Apr 7.
9
A unitary model of auditory frequency change perception.听觉频率变化感知的单一模型。
PLoS Comput Biol. 2023 Jan 12;19(1):e1010307. doi: 10.1371/journal.pcbi.1010307. eCollection 2023 Jan.
10
Relative pitch representations and invariance to timbre.相对音高表示法与音色不变性。
Cognition. 2023 Mar;232:105327. doi: 10.1016/j.cognition.2022.105327. Epub 2022 Dec 7.
Proc Natl Acad Sci U S A. 2017 Jan 31;114(5):E840-E848. doi: 10.1073/pnas.1607519114. Epub 2017 Jan 17.
4
Indifference to dissonance in native Amazonians reveals cultural variation in music perception.对本土亚马逊人的不和谐音的漠不关心揭示了音乐感知中的文化差异。
Nature. 2016 Jul 28;535(7613):547-50. doi: 10.1038/nature18635. Epub 2016 Jul 13.
5
The role of periodicity in perceiving speech in quiet and in background noise.周期性在安静环境和背景噪声中语音感知中的作用。
J Acoust Soc Am. 2015 Dec;138(6):3586-99. doi: 10.1121/1.4936945.
6
Attentive Tracking of Sound Sources.声源的精确跟踪
Curr Biol. 2015 Aug 31;25(17):2238-46. doi: 10.1016/j.cub.2015.07.043. Epub 2015 Aug 13.
7
Segregating complex sound sources through temporal coherence.通过时间相干性分离复杂声源。
PLoS Comput Biol. 2014 Dec 18;10(12):e1003985. doi: 10.1371/journal.pcbi.1003985. eCollection 2014 Dec.
8
Speech recognition against harmonic and inharmonic complexes: spectral dips and periodicity.针对谐波和非谐波复合体的语音识别:频谱凹陷与周期性
J Acoust Soc Am. 2014 May;135(5):2873-84. doi: 10.1121/1.4870056.
9
Swinging at a cocktail party: voice familiarity aids speech perception in the presence of a competing voice.在鸡尾酒会上摇摆:在存在竞争声音的情况下,语音熟悉度有助于语音感知。
Psychol Sci. 2013 Oct;24(10):1995-2004. doi: 10.1177/0956797613482467. Epub 2013 Aug 28.
10
Lexical influences on auditory streaming.听觉流的词汇影响。
Curr Biol. 2013 Aug 19;23(16):1585-9. doi: 10.1016/j.cub.2013.06.042. Epub 2013 Jul 25.