Kennedy-Higgins Dan, Devlin Joseph T, Adank Patti
Department of Speech, Hearing and Phonetic Sciences, University College London, Chandler House, 2 Wakefield Street, London, WC1N 1PF, United Kingdom.
Department of Experimental Psychology, University College London, 26 Bedford Way, London, WC1H 0AP, United Kingdom.
J Acoust Soc Am. 2020 Apr;147(4):2728. doi: 10.1121/10.0001160.
Few studies thus far have investigated whether perception of distorted speech is consistent across different types of distortion. This study investigated whether participants show a consistent perceptual profile across three speech distortions: time-compressed, noise-vocoded, and speech in noise. Additionally, this study investigated whether/how individual differences in performance on a battery of audiological and cognitive tasks links to perception. Eighty-eight participants completed a speeded sentence-verification task with increases in accuracy and reductions in response times used to indicate performance. Audiological and cognitive task measures include pure tone audiometry, speech recognition threshold, working memory, vocabulary knowledge, attention switching, and pattern analysis. Despite previous studies suggesting that temporal and spectral/environmental perception require different lexical or phonological mechanisms, this study shows significant positive correlations in accuracy and response time performance across all distortions. Results of a principal component analysis and multiple linear regressions suggest that a component based on vocabulary knowledge and working memory predicted performance in the speech in quiet, time-compressed and speech in noise conditions. These results suggest that listeners employ a similar cognitive strategy to perceive different temporal and spectral/environmental speech distortions and that this mechanism is supported by vocabulary knowledge and working memory.
到目前为止,很少有研究调查对失真语音的感知在不同类型的失真情况下是否一致。本研究调查了参与者在三种语音失真情况下是否表现出一致的感知特征:时间压缩、噪声声码化和噪声中的语音。此外,本研究还调查了一系列听力学和认知任务的表现中的个体差异是否以及如何与感知相关联。88名参与者完成了一项快速句子验证任务,通过准确性的提高和反应时间的缩短来表明表现。听力学和认知任务测量包括纯音听力测定、言语识别阈值、工作记忆、词汇知识、注意力转换和模式分析。尽管先前的研究表明,时间和频谱/环境感知需要不同的词汇或语音机制,但本研究表明,在所有失真情况下,准确性和反应时间表现之间存在显著的正相关。主成分分析和多元线性回归的结果表明,基于词汇知识和工作记忆的一个成分预测了安静语音、时间压缩语音和噪声中语音条件下的表现。这些结果表明,听众采用类似的认知策略来感知不同的时间和频谱/环境语音失真,并且这种机制得到词汇知识和工作记忆的支持。