van de Velde Daan J, Schiller Niels O, van Heuven Vincent J, Levelt Claartje C, van Ginkel Joost, Beers Mieke, Briaire Jeroen J, Frijns Johan H M
Leiden University Centre for Linguistics, Leiden University, Van Wijkplaats 3, 2311 BX, Leiden, the Netherlands.
Department of Applied Linguistics, Pannon Egyetem, 10 Egyetem Utca, 8200 Veszprém, Hungary.
J Acoust Soc Am. 2017 May;141(5):3349. doi: 10.1121/1.4982198.
This study aimed to find the optimal filter slope for cochlear implant simulations (vocoding) by testing the effect of a wide range of slopes on the discrimination of emotional and linguistic (focus) prosody, with varying availability of F0 and duration cues. Forty normally hearing participants judged if (non-)vocoded sentences were pronounced with happy or sad emotion, or with adjectival or nominal focus. Sentences were recorded as natural stimuli and manipulated to contain only emotion- or focus-relevant segmental duration or F0 information or both, and then noise-vocoded with 5, 20, 80, 120, and 160 dB/octave filter slopes. Performance increased with steeper slopes, but only up to 120 dB/octave, with bigger effects for emotion than for focus perception. For emotion, results with both cues most closely resembled results with F0, while for focus results with both cues most closely resembled those with duration, showing emotion perception relies primarily on F0, and focus perception on duration. This suggests that filter slopes affect focus perception less than emotion perception because for emotion, F0 is both more informative and more affected. The performance increase until extreme filter slope values suggests that much performance improvement in prosody perception is still to be gained for CI users.
本研究旨在通过测试一系列斜率对情感和语言(焦点)韵律辨别力的影响,来寻找人工耳蜗模拟(语音编码)的最佳滤波器斜率,其中F0和时长线索的可用性各不相同。40名听力正常的参与者判断(非)语音编码的句子是用快乐还是悲伤的情绪发音,或者是用形容词或名词焦点发音。句子被记录为自然刺激,并进行处理,使其仅包含与情感或焦点相关的片段时长或F0信息或两者兼有,然后用5、20、80、120和160 dB/倍频程的滤波器斜率进行噪声语音编码。随着斜率变陡,性能有所提高,但仅在120 dB/倍频程以内,情感方面的影响比焦点感知方面的影响更大。对于情感,两种线索的结果与仅F0线索的结果最为相似,而对于焦点,两种线索的结果与仅时长线索的结果最为相似,这表明情感感知主要依赖于F0,而焦点感知依赖于时长。这表明滤波器斜率对焦点感知的影响小于对情感感知的影响,因为对于情感,F0既更具信息性又受影响更大。在达到极端滤波器斜率值之前性能的提高表明,人工耳蜗使用者在韵律感知方面仍有很大的性能提升空间。