Suppr超能文献

多位说话者的感知学习:决定因素、特征和局限性。

Perceptual learning of multiple talkers: Determinants, characteristics, and limitations.

机构信息

Department of Speech, Language, and Hearing Sciences, University of Connecticut, 2 Alethia Drive, Unit 1085, Storrs, CT, 06269-1085, USA.

Connecticut Institute for the Brain and Cognitive Sciences, University of Connecticut, 337 Mansfield Road, Unit 1272, Storrs, CT, 06269-1272, USA.

出版信息

Atten Percept Psychophys. 2022 Oct;84(7):2335-2359. doi: 10.3758/s13414-022-02556-6. Epub 2022 Sep 8.

Abstract

Research suggests that listeners simultaneously update talker-specific generative models to reflect structured phonetic variation. Because past investigations exposed listeners to talkers of different genders, it is unknown whether adaptation is talker specific or rather linked to a broader sociophonetic class. Here, we test determinants of listeners' ability to update and apply talker-specific models for speech perception. In six experiments (n = 480), listeners were first exposed to the speech of two talkers who produced ambiguous fricative energy. The talkers' speech was interleaved during exposure, and lexical context differentially biased interpretation of the ambiguity as either /s/ or /ʃ/ for each talker. At test, listeners categorized tokens from ashi-asi continua, one for each talker. Across conditions and experiments, we manipulated exposure quantity, talker gender, blocked versus interleaved talker structure at test, and the degree to which fricative acoustics differed between talkers. When test was blocked by talker, learning was observed for different but not same gender talkers. When talkers were interleaved at test, learning was observed for both different and same gender talkers, which was attenuated when fricative acoustics were constant across talkers. There was no strong evidence to suggest that adaptation to multiple talkers required increased quantity of exposure beyond that required to adapt to a single talker. These results suggest that perceptual learning for speech is achieved via a mechanism that represents a context-dependent, cumulative integration of experience with speech input and identity critical constraints on listeners' ability to dynamically apply multiple generative models in mixed talker listening environments.

摘要

研究表明,听众会同时更新特定说话者的生成模型,以反映出结构化的语音变化。由于过去的研究让听众接触到了不同性别的说话者,因此尚不清楚适应是特定于说话者的,还是与更广泛的社会语音类别有关。在这里,我们测试了听众更新和应用特定于说话者的语音感知模型的能力的决定因素。在六个实验中(n=480),听众首先接触到两个说话者的语音,这两个说话者的语音产生了模糊的摩擦能量。在暴露期间,说话者的语音交错出现,词汇语境分别偏向于对每个说话者的模糊音解释为/s/或/ʃ/。在测试中,听众对来自 ashi-asi 连续体的音素来进行分类,每个说话者一个。在不同的条件和实验中,我们操纵了暴露量、说话者性别、测试时的交错或分组说话者结构,以及说话者之间的摩擦音差异程度。当测试由说话者分组时,不同但不同性别说话者的学习都被观察到。当说话者在测试时交错时,不同性别和相同性别的说话者都观察到了学习,而当说话者之间的摩擦音不变时,学习则减弱。没有强有力的证据表明,适应多个说话者需要比适应单个说话者更多的暴露量。这些结果表明,语音感知学习是通过一种机制实现的,该机制代表了对语音输入的经验的上下文相关、累积整合,以及对听众在混合说话者聆听环境中动态应用多个生成模型的能力的关键限制。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验