通过语调韵律的结构化可变性来对意义进行编码和解码。

Encoding and decoding of meaning through structured variability in intonational speech prosody.

机构信息

Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY 14627, USA.

Department of Psychology, University of Maryland, College Park, MD 20742, USA.

出版信息

Cognition. 2021 Jun;211:104619. doi: 10.1016/j.cognition.2021.104619. Epub 2021 Feb 15.

DOI:10.1016/j.cognition.2021.104619

PMID:33601018

Abstract

Speech prosody plays an important role in communication of meaning. The cognitive and computational mechanisms supporting this communication remain to be understood, however. Prosodic cues vary across talkers and speaking conditions, creating ambiguity in the sound-to-meaning mapping. We hypothesize that listeners ameliorate this ambiguity in part by learning talker-specific statistics of prosodic cues. To test this hypothesis, we investigate the production and recognition of question vs. statement prosody in American English. Experiment 1 elicits productions of questions and statements from 65 talkers to examine the distributional statistics characterizing within- and cross-talker variability in these productions. We use Bayesian ideal observer models to assess the predicted consequences of cross-talker variability on listeners' recognition of prosody. We find that learning of talker-specific distributional statistics is predicted to facilitate recognition, above and beyond what can be achieved via commonly assumed normalizations of prosodic cues. Experiment 2 tests this prediction in a comprehension experiment. We expose different groups of listeners to different prosodic input statistics and assess listeners' recognition of questions and statements both prior to, and following, exposure. Prior to exposure, ideal observer-derived predictions based on Experiment 1 provide a good qualitative fit against listeners' recognition of prosodic contours in Experiment 2. Following exposure, listeners shift the categorization boundary between questions and statements in ways consistent with learning of talker-specific statistics.

摘要

语音韵律在意义交流中起着重要作用。然而，支持这种交流的认知和计算机制仍有待理解。韵律线索因说话者和说话条件而异，从而在声音与意义的映射中产生歧义。我们假设，听众通过学习说话者特定的韵律线索统计数据在一定程度上减轻了这种歧义。为了验证这一假设，我们研究了美国英语中疑问句和陈述句的产生和识别。实验 1 从 65 位说话者那里引出疑问句和陈述句的产生，以考察这些产生中特征在说话者内和说话者间变异性的分布统计数据。我们使用贝叶斯理想观察者模型来评估说话者间变异性对听众识别韵律的预测后果。我们发现，学习说话者特定的分布统计数据有望促进识别，这超出了通过通常假设的韵律线索归一化可以实现的效果。实验 2 在理解实验中检验了这一预测。我们让不同的听众群体接触不同的韵律输入统计数据，并在接触之前和之后评估他们对疑问句和陈述句的识别。在接触之前，基于实验 1 的理想观察者推导的预测与实验 2 中听众对韵律轮廓的识别有很好的定性匹配。在接触之后，听众在分类边界上对疑问句和陈述句的分类方式与说话者特定统计数据的学习一致。

相似文献

Encoding and decoding of meaning through structured variability in intonational speech prosody.通过语调韵律的结构化可变性来对意义进行编码和解码。

Cognition. 2021 Jun;211:104619. doi: 10.1016/j.cognition.2021.104619. Epub 2021 Feb 15.

Thinking probabilistically in the study of intonational speech prosody.从概率角度研究语调语音韵律。

Wiley Interdiscip Rev Cogn Sci. 2022 Jan;13(1):e1579. doi: 10.1002/wcs.1579. Epub 2021 Oct 2.

Distributional learning for speech reflects cumulative exposure to a talker's phonetic distributions.语音的分布学习反映了对说话者语音分布的累积接触。

Psychon Bull Rev. 2019 Jun;26(3):985-992. doi: 10.3758/s13423-018-1551-5.

In Search of Salience: Focus Detection in the Speech of Different Talkers.在不同说话者的语音中寻找焦点：焦点检测

Lang Speech. 2022 Sep;65(3):650-680. doi: 10.1177/00238309211046029. Epub 2021 Nov 28.

Influence of prosodic boundaries on comprehension of spoken English sentences.韵律边界对英语口语句子理解的影响。

Percept Mot Skills. 1998 Aug;87(1):3-18. doi: 10.2466/pms.1998.87.1.3.

Talker-listener accent interactions in speech-in-noise recognition: effects of prosodic manipulation as a function of language experience.言语噪声识别中的说话者-听话者口音交互作用：韵律操控的影响与语言经验有关。

J Acoust Soc Am. 2010 Sep;128(3):1357-65. doi: 10.1121/1.3466857.

Attention modulates perceptual learning of non-native-accented speech.注意力会调节对非母语口音语音的感知学习。

Atten Percept Psychophys. 2024 Jan;86(1):339-353. doi: 10.3758/s13414-023-02790-6. Epub 2023 Oct 23.

Intonational Structure Influences Perception of Contrastive Vowel Length: The Case of Phrase-Final Lengthening in Tokyo Japanese.音高结构对对比性元音长度感知的影响：以东京日语中短语结尾的拉长为例。

Lang Speech. 2021 Dec;64(4):839-858. doi: 10.1177/0023830920971842. Epub 2020 Nov 28.

Hierarchical contributions of linguistic knowledge to talker identification: Phonological versus lexical familiarity.语言知识对说话者识别的分层贡献：语音与词汇熟悉度

Atten Percept Psychophys. 2019 May;81(4):1088-1107. doi: 10.3758/s13414-019-01778-5.

Non-native listeners' recognition of high-variability speech using PRESTO.非母语听众使用PRESTO对高变异性语音的识别。

J Am Acad Audiol. 2014 Oct;25(9):869-92. doi: 10.3766/jaaa.25.9.9.

引用本文的文献

Beating stress: Evidence for recalibration of word stress perception.战胜压力：单词重音感知重新校准的证据。

Atten Percept Psychophys. 2025 May 20. doi: 10.3758/s13414-025-03088-5.

SingleMALD: Investigating practice effects in auditory lexical decision.单通道听觉词汇判定任务中的练习效应研究

Behav Res Methods. 2025 Apr 2;57(5):136. doi: 10.3758/s13428-025-02628-z.

Learning to recognize unfamiliar faces from fine-phonetic detail in visual speech.从视觉语音中的精细语音细节学习识别不熟悉的面孔。

Atten Percept Psychophys. 2025 Apr;87(3):936-951. doi: 10.3758/s13414-025-03049-y. Epub 2025 Mar 20.

Perception and adaptation of receptive prosody in autistic adolescents.自闭症青少年对接受性韵律的感知和适应。

Sci Rep. 2024 Jul 16;14(1):16409. doi: 10.1038/s41598-024-66569-x.

Evaluating normalization accounts against the dense vowel space of Central Swedish.根据瑞典中部密集元音空间评估归一化账户。

Front Psychol. 2023 Jun 21;14:1165742. doi: 10.3389/fpsyg.2023.1165742. eCollection 2023.

Perceptual learning of multiple talkers: Determinants, characteristics, and limitations.多位说话者的感知学习：决定因素、特征和局限性。

Atten Percept Psychophys. 2022 Oct;84(7):2335-2359. doi: 10.3758/s13414-022-02556-6. Epub 2022 Sep 8.

Computational Modeling of an Auditory Lexical Decision Experiment Using DIANA.使用 DIANA 对听觉词汇决策实验进行计算建模。

Lang Speech. 2023 Sep;66(3):564-605. doi: 10.1177/00238309221111752. Epub 2022 Aug 24.

Using Rational Models to Interpret the Results of Experiments on Accent Adaptation.运用理性模型解读口音适应实验结果。

Front Psychol. 2021 Nov 5;12:676271. doi: 10.3389/fpsyg.2021.676271. eCollection 2021.

Cross-talker generalization in the perception of nonnative speech: A large-scale replication.非母语语音感知中的交叉谈话语者泛化：大规模复制。

J Exp Psychol Gen. 2021 Nov;150(11):e22-e56. doi: 10.1037/xge0001039. Epub 2021 Aug 9.

Perception and Production of Statement-Question Intonation in Autism Spectrum Disorder: A Developmental Investigation.自闭症谱系障碍中陈述句-疑问句语调的感知和产生：一项发展研究。

J Autism Dev Disord. 2022 Aug;52(8):3456-3472. doi: 10.1007/s10803-021-05220-4. Epub 2021 Aug 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过语调韵律的结构化可变性来对意义进行编码和解码。

Encoding and decoding of meaning through structured variability in intonational speech prosody.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献