• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

竞争条件下的声源特性、跨共振峰整合与言语可懂度

Acoustic source characteristics, across-formant integration, and speech intelligibility under competitive conditions.

作者信息

Roberts Brian, Summers Robert J, Bailey Peter J

机构信息

Psychology, School of Life and Health Sciences, Aston University.

Department of Psychology, University of York.

出版信息

J Exp Psychol Hum Percept Perform. 2015 Jun;41(3):680-91. doi: 10.1037/xhp0000038. Epub 2015 Mar 9.

DOI:10.1037/xhp0000038
PMID:25751040
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4445382/
Abstract

An important aspect of speech perception is the ability to group or select formants using cues in the acoustic source characteristics--for example, fundamental frequency (F0) differences between formants promote their segregation. This study explored the role of more radical differences in source characteristics. Three-formant (F1+F2+F3) synthetic speech analogues were derived from natural sentences. In Experiment 1, F1+F3 were generated by passing a harmonic glottal source (F0 = 140 Hz) through second-order resonators (H1+H3); in Experiment 2, F1+F3 were tonal (sine-wave) analogues (T1+T3). F2 could take either form (H2 or T2). In some conditions, the target formants were presented alone, either monaurally or dichotically (left ear = F1+F3; right ear = F2). In others, they were accompanied by a competitor for F2 (F1+F2C+F3; F2), which listeners must reject to optimize recognition. Competitors (H2C or T2C) were created using the time-reversed frequency and amplitude contours of F2. Dichotic presentation of F2 and F2C ensured that the impact of the competitor arose primarily through informational masking. In the absence of F2C, the effect of a source mismatch between F1+F3 and F2 was relatively modest. When F2C was present, intelligibility was lowest when F2 was tonal and F2C was harmonic, irrespective of which type matched F1+F3. This finding suggests that source type and context, rather than similarity, govern the phonetic contribution of a formant. It is proposed that wideband harmonic analogues are more effective informational maskers than narrowband tonal analogues, and so become dominant in across-frequency integration of phonetic information when placed in competition.

摘要

言语感知的一个重要方面是利用声源特征中的线索对共振峰进行分组或选择的能力——例如,共振峰之间的基频(F0)差异促进了它们的分离。本研究探讨了声源特征中更显著差异的作用。从自然句子中导出了三共振峰(F1+F2+F3)合成语音类似物。在实验1中,F1+F3是通过使谐波声门源(F0 = 140 Hz)通过二阶谐振器(H1+H3)生成的;在实验2中,F1+F3是音调(正弦波)类似物(T1+T3)。F2可以采用任何一种形式(H2或T2)。在某些条件下,目标共振峰单独呈现,单耳或双耳呈现(左耳 = F1+F3;右耳 = F2)。在其他条件下,它们伴随着F2的竞争音(F1+F2C+F3;F2),听众必须排除该竞争音以优化识别。竞争音(H2C或T2C)是使用F2的时间反转频率和幅度轮廓创建的。F2和F2C的双耳呈现确保了竞争音的影响主要通过信息掩蔽产生。在没有F2C的情况下,F1+F3和F2之间的声源不匹配效应相对较小。当存在F2C时,无论哪种类型与F1+F3匹配,当F2是音调且F2C是谐波时,可懂度最低。这一发现表明,声源类型和语境而非相似性决定了共振峰的语音贡献。有人提出,宽带谐波类似物比窄带音调类似物更有效的信息掩蔽器,因此在竞争中进行语音信息的跨频率整合时占主导地位。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/a120d18e5e21/xhp_41_3_680_fig4a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/81ada86ac489/xhp_41_3_680_fig1a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/fe1d2c128e00/xhp_41_3_680_fig2a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/edce830f84d8/xhp_41_3_680_fig3a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/a120d18e5e21/xhp_41_3_680_fig4a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/81ada86ac489/xhp_41_3_680_fig1a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/fe1d2c128e00/xhp_41_3_680_fig2a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/edce830f84d8/xhp_41_3_680_fig3a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dcb3/4445382/a120d18e5e21/xhp_41_3_680_fig4a.jpg

相似文献

1
Acoustic source characteristics, across-formant integration, and speech intelligibility under competitive conditions.竞争条件下的声源特性、跨共振峰整合与言语可懂度
J Exp Psychol Hum Percept Perform. 2015 Jun;41(3):680-91. doi: 10.1037/xhp0000038. Epub 2015 Mar 9.
2
Across-formant integration and speech intelligibility: Effects of acoustic source properties in the presence and absence of a contralateral interferer.跨共振峰整合与言语可懂度:声源特性在有和没有对侧干扰音情况下的影响。
J Acoust Soc Am. 2016 Aug;140(2):1227. doi: 10.1121/1.4960595.
3
Informational masking and the effects of differences in fundamental frequency and fundamental-frequency contour on phonetic integration in a formant ensemble.信息掩蔽以及共振峰组合中基频和基频轮廓差异对语音整合的影响。
Hear Res. 2017 Feb;344:295-303. doi: 10.1016/j.heares.2016.10.026. Epub 2016 Nov 1.
4
Effects of differences in fundamental frequency on across-formant grouping in speech perception.基频差异对言语感知中跨频带组合的影响。
J Acoust Soc Am. 2010 Dec;128(6):3667-77. doi: 10.1121/1.3505119.
5
Dichotic integration of acoustic-phonetic information: Competition from extraneous formants increases the effect of second-formant attenuation on intelligibility.两耳分听对语音信息的整合:多余共振峰的竞争增强了第二共振峰衰减对可懂度的影响。
J Acoust Soc Am. 2019 Mar;145(3):1230. doi: 10.1121/1.5091443.
6
Formant-frequency variation and its effects on across-formant grouping in speech perception.共振峰频率变化及其对言语感知中跨共振峰组合的影响。
Adv Exp Med Biol. 2013;787:323-31. doi: 10.1007/978-1-4614-1590-9_36.
7
Informational masking of monaural target speech by a single contralateral formant.单对侧共振峰对单耳目标语音的信息掩蔽
J Acoust Soc Am. 2015 May;137(5):2726-36. doi: 10.1121/1.4919344.
8
The perceptual organization of sine-wave speech under competitive conditions.正弦波语音在竞争条件下的知觉组织。
J Acoust Soc Am. 2010 Aug;128(2):804-17. doi: 10.1121/1.3445786.
9
Formant-frequency variation and informational masking of speech by extraneous formants: evidence against dynamic and speech-specific acoustical constraints.共振峰频率变化与无关共振峰对语音的信息掩蔽:反对动态和特定语音声学限制的证据。
J Exp Psychol Hum Percept Perform. 2014 Aug;40(4):1507-25. doi: 10.1037/a0036629. Epub 2014 May 19.
10
Effects of the rate of formant-frequency variation on the grouping of formants in speech perception.共振峰频率变化率对言语感知中共振峰分组的影响。
J Assoc Res Otolaryngol. 2012 Apr;13(2):269-280. doi: 10.1007/s10162-011-0307-y. Epub 2011 Dec 13.

引用本文的文献

1
Understanding the Process of Integration in Binaural Cochlear Implant Configurations.了解双耳人工耳蜗配置中的整合过程。
Ear Hear. 2025;46(4):880-898. doi: 10.1097/AUD.0000000000001629. Epub 2025 Feb 28.
2
Overtone focusing in biphonic tuvan throat singing.双音图瓦喉音的泛音聚焦。
Elife. 2020 Feb 17;9:e50476. doi: 10.7554/eLife.50476.
3
Arrays of rectangular subcritical speech bands: Intelligibility improved by noise-vocoding and expanding to critical bandwidths.矩形亚临界语音频段阵列:通过噪声编码和扩展到临界带宽来提高可懂度。

本文引用的文献

1
Formant-frequency variation and informational masking of speech by extraneous formants: evidence against dynamic and speech-specific acoustical constraints.共振峰频率变化与无关共振峰对语音的信息掩蔽:反对动态和特定语音声学限制的证据。
J Exp Psychol Hum Percept Perform. 2014 Aug;40(4):1507-25. doi: 10.1037/a0036629. Epub 2014 May 19.
2
The role of first formant information in simulated electro-acoustic hearing.第一共振峰信息在模拟电声听觉中的作用。
J Acoust Soc Am. 2013 Jun;133(6):4279-89. doi: 10.1121/1.4803910.
3
Effects of the rate of formant-frequency variation on the grouping of formants in speech perception.
J Acoust Soc Am. 2018 Apr;143(4):EL305. doi: 10.1121/1.5034170.
共振峰频率变化率对言语感知中共振峰分组的影响。
J Assoc Res Otolaryngol. 2012 Apr;13(2):269-280. doi: 10.1007/s10162-011-0307-y. Epub 2011 Dec 13.
4
Estimating speech spectra for copy synthesis by linear prediction and by hand.通过线性预测和手工方法估计复制合成的语音频谱。
J Acoust Soc Am. 2011 Oct;130(4):2173-8. doi: 10.1121/1.3631667.
5
Fundamental frequency is critical to speech perception in noise in combined acoustic and electric hearing.基频对于在声电联合听觉中噪声环境下的言语感知至关重要。
J Acoust Soc Am. 2011 Oct;130(4):2054-62. doi: 10.1121/1.3631563.
6
Coherence masking protection for speech in children and adults.儿童和成人语音的相干掩蔽保护
Atten Percept Psychophys. 2011 Nov;73(8):2606-23. doi: 10.3758/s13414-011-0210-y.
7
Evaluation of similarity effects in informational masking.信息掩蔽中相似性效应的评估。
J Acoust Soc Am. 2011 Jun;129(6):EL280-5. doi: 10.1121/1.3590168.
8
Effects of differences in fundamental frequency on across-formant grouping in speech perception.基频差异对言语感知中跨频带组合的影响。
J Acoust Soc Am. 2010 Dec;128(6):3667-77. doi: 10.1121/1.3505119.
9
The intelligibility of noise-vocoded speech: spectral information available from across-channel comparison of amplitude envelopes.噪声编码语音的可懂度:跨通道幅度包络比较可获得的频谱信息。
Proc Biol Sci. 2011 May 22;278(1711):1595-600. doi: 10.1098/rspb.2010.1554. Epub 2010 Nov 10.
10
The perceptual organization of sine-wave speech under competitive conditions.正弦波语音在竞争条件下的知觉组织。
J Acoust Soc Am. 2010 Aug;128(2):804-17. doi: 10.1121/1.3445786.