• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

黑白之间的感知:语调变量和过滤条件对社会语言学判断的影响及其对自动语音识别的启示

Perception in Black and White: Effects of Intonational Variables and Filtering Conditions on Sociolinguistic Judgments With Implications for ASR.

作者信息

Holliday Nicole R

机构信息

University of Pennsylvania, Philadelphia, PA, United States.

出版信息

Front Artif Intell. 2021 Jul 15;4:642783. doi: 10.3389/frai.2021.642783. eCollection 2021.

DOI:10.3389/frai.2021.642783
PMID:34337391
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8319665/
Abstract

This study tests the effects of intonational contours and filtering conditions on listener judgments of ethnicity to arrive at a more comprehensive understanding on how prosody influences these judgments, with implications for austomatic speech recognition systems as well as speech synthesis. In a perceptual experiment, 40 American English listeners heard phrase-long clips which were controlled for pitch accent type and focus marking. Each clip contained either two H* (high) or two L+H* (low high) pitch accents and a L-L% (falling) boundary tone, and had also previously been labelled for broad or narrow focus. Listeners rated clips in two tasks, one with unmodified stimuli and one with stimuli lowpass filtered at 400 Hz, and were asked to judge whether the speaker was "Black" or "White". In the filtered condition, tokens with the L+H* pitch accent were more likely to be rated as "Black", with an interaction such that broad focus enhanced this pattern, supporting earlier findings that listeners may perceive African American Language as having more variation in possible pitch accent meanings. In the unfiltered condition, tokens with the L+H* pitch accent were less likely to be rated as Black, with no effect of focus, likely due to the fact that listeners relied more heavily on available segmental information in this condition. These results enhance our understanding of cues listeners rely on in making social judgments about speakers, especially in ethnic identification and linguistic profiling, by highlighting perceptual differences due to listening environment as well as predicted meaning of specific intonational contours. They also contribute to our understanding of the role of how human listeners interpret meaning within a holistic context, which has implications for the construction of computational systems designed to replicate the properties of natural language. In particular, they have important applicability to speech synthesis and speech recognition programs, which are often limited in their capacities due to the fact that they do not make such holistic sociolinguistic considerations of the meanings of input or output speech.

摘要

本研究测试了语调轮廓和滤波条件对听众种族判断的影响,以更全面地了解韵律如何影响这些判断,这对自动语音识别系统以及语音合成具有启示意义。在一项感知实验中,40名美国英语听众收听了时长为短语的音频片段,这些片段在音高重音类型和焦点标记方面受到控制。每个片段包含两个H*(高)或两个L+H*(低高)音高重音以及一个L-L%(下降)边界调,并且之前也已被标记为宽泛焦点或狭窄焦点。听众在两项任务中对音频片段进行评分,一项任务使用未修改的刺激,另一项任务使用在400赫兹进行低通滤波的刺激,并被要求判断说话者是“黑人”还是“白人”。在滤波条件下,带有L+H音高重音的片段更有可能被评为“黑人”,存在一种交互作用,即宽泛焦点增强了这种模式,这支持了早期的研究结果,即听众可能认为非裔美国语言在可能的音高重音含义方面有更多变化。在未滤波条件下,带有L+H音高重音的片段被评为黑人的可能性较小,且没有焦点效应,这可能是因为在这种情况下听众更依赖可用的音段信息。这些结果通过突出由于聆听环境以及特定语调轮廓的预测含义而产生的感知差异,增强了我们对听众在对说话者进行社会判断时所依赖线索的理解,尤其是在种族识别和语言特征分析方面。它们还有助于我们理解人类听众在整体语境中如何解释意义的作用,这对旨在复制自然语言属性的计算系统的构建具有启示意义。特别是,它们对语音合成和语音识别程序具有重要的适用性,这些程序由于没有对输入或输出语音的意义进行这种整体的社会语言学考虑而往往能力有限。

相似文献

1
Perception in Black and White: Effects of Intonational Variables and Filtering Conditions on Sociolinguistic Judgments With Implications for ASR.黑白之间的感知:语调变量和过滤条件对社会语言学判断的影响及其对自动语音识别的启示
Front Artif Intell. 2021 Jul 15;4:642783. doi: 10.3389/frai.2021.642783. eCollection 2021.
2
Intonation as an encoder of speaker certainty: information and confirmation yes-no questions in Catalan.语调作为说话者确定性的编码方式:加泰罗尼亚语中的信息性和确认性是非疑问句
Lang Speech. 2013 Jun;56(Pt 2):163-90. doi: 10.1177/0023830912443942.
3
Listeners' adaptation to unreliable intonation is speaker-sensitive.听话者对不可靠语调的适应是受说话者影响的。
Cognition. 2020 Nov;204:104372. doi: 10.1016/j.cognition.2020.104372. Epub 2020 Jun 29.
4
Listening Effort by Native and Nonnative Listeners Due to Noise, Reverberation, and Talker Foreign Accent During English Speech Perception.母语和非母语听者在英语语音感知中因噪声、混响和说话者外国口音而产生的听力努力。
J Speech Lang Hear Res. 2019 Apr 15;62(4):1068-1081. doi: 10.1044/2018_JSLHR-H-17-0423.
5
Focus, accent, and argument structure: effects on language comprehension.焦点、重音和论证结构:对语言理解的影响。
Lang Speech. 1995 Oct-Dec;38 ( Pt 4):365-91. doi: 10.1177/002383099503800403.
6
How experience with tone in the native language affects the L2 acquisition of pitch accents.母语中的语调体验如何影响第二语言中声调重音的习得。
Front Psychol. 2022 Aug 19;13:903879. doi: 10.3389/fpsyg.2022.903879. eCollection 2022.
7
Phonological and phonetic marking of information status in Foreign Accent Syndrome.语音标记和语音学标记在外国口音综合征中的信息状态。
Int J Lang Commun Disord. 2012 Nov-Dec;47(6):738-49. doi: 10.1111/j.1460-6984.2012.00184.x. Epub 2012 Sep 27.
8
Encoding and decoding of meaning through structured variability in intonational speech prosody.通过语调韵律的结构化可变性来对意义进行编码和解码。
Cognition. 2021 Jun;211:104619. doi: 10.1016/j.cognition.2021.104619. Epub 2021 Feb 15.
9
Talker-listener accent interactions in speech-in-noise recognition: effects of prosodic manipulation as a function of language experience.言语噪声识别中的说话者-听话者口音交互作用:韵律操控的影响与语言经验有关。
J Acoust Soc Am. 2010 Sep;128(3):1357-65. doi: 10.1121/1.3466857.
10
How listeners weight acoustic cues to intonational phrase boundaries.听众如何权衡用于语调短语边界的声学线索。
PLoS One. 2014 Jul 14;9(7):e102166. doi: 10.1371/journal.pone.0102166. eCollection 2014.

引用本文的文献

1
An exploratory study on dialect density estimation for children and adult's African American Englisha).对儿童和成人非裔美国英语的方言密度估计的探索性研究。
J Acoust Soc Am. 2024 Apr 1;155(4):2836-2848. doi: 10.1121/10.0025771.

本文引用的文献

1
Interpreting Pitch Accents in Online Comprehension: H* vs. L+H*.在线理解中重音的解释:H* 与 L+H*。
Cogn Sci. 2008 Oct;32(7):1232-44. doi: 10.1080/03640210802138755.
2
Evaluational reactions to spoken languages.对口语的评估反应。
J Abnorm Soc Psychol. 1960 Jan;60:44-51. doi: 10.1037/h0044430.