Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY 14627, USA.
Department of Psychology, University of Maryland, College Park, MD 20742, USA.
Cognition. 2021 Jun;211:104619. doi: 10.1016/j.cognition.2021.104619. Epub 2021 Feb 15.
Speech prosody plays an important role in communication of meaning. The cognitive and computational mechanisms supporting this communication remain to be understood, however. Prosodic cues vary across talkers and speaking conditions, creating ambiguity in the sound-to-meaning mapping. We hypothesize that listeners ameliorate this ambiguity in part by learning talker-specific statistics of prosodic cues. To test this hypothesis, we investigate the production and recognition of question vs. statement prosody in American English. Experiment 1 elicits productions of questions and statements from 65 talkers to examine the distributional statistics characterizing within- and cross-talker variability in these productions. We use Bayesian ideal observer models to assess the predicted consequences of cross-talker variability on listeners' recognition of prosody. We find that learning of talker-specific distributional statistics is predicted to facilitate recognition, above and beyond what can be achieved via commonly assumed normalizations of prosodic cues. Experiment 2 tests this prediction in a comprehension experiment. We expose different groups of listeners to different prosodic input statistics and assess listeners' recognition of questions and statements both prior to, and following, exposure. Prior to exposure, ideal observer-derived predictions based on Experiment 1 provide a good qualitative fit against listeners' recognition of prosodic contours in Experiment 2. Following exposure, listeners shift the categorization boundary between questions and statements in ways consistent with learning of talker-specific statistics.
语音韵律在意义交流中起着重要作用。然而,支持这种交流的认知和计算机制仍有待理解。韵律线索因说话者和说话条件而异,从而在声音与意义的映射中产生歧义。我们假设,听众通过学习说话者特定的韵律线索统计数据在一定程度上减轻了这种歧义。为了验证这一假设,我们研究了美国英语中疑问句和陈述句的产生和识别。实验 1 从 65 位说话者那里引出疑问句和陈述句的产生,以考察这些产生中特征在说话者内和说话者间变异性的分布统计数据。我们使用贝叶斯理想观察者模型来评估说话者间变异性对听众识别韵律的预测后果。我们发现,学习说话者特定的分布统计数据有望促进识别,这超出了通过通常假设的韵律线索归一化可以实现的效果。实验 2 在理解实验中检验了这一预测。我们让不同的听众群体接触不同的韵律输入统计数据,并在接触之前和之后评估他们对疑问句和陈述句的识别。在接触之前,基于实验 1 的理想观察者推导的预测与实验 2 中听众对韵律轮廓的识别有很好的定性匹配。在接触之后,听众在分类边界上对疑问句和陈述句的分类方式与说话者特定统计数据的学习一致。