Camarena Andres, Ardis Matthew, Fujioka Takako, Fitzgerald Matthew B, Goldsworthy Raymond L
Auditory Research Center, Caruso Department of Otolaryngology - Head and Neck Surgery, Keck School of Medicine, University of Southern California, Los Angeles, California, USA.
Center for Computer Research in Music and Acoustics, Stanford University, Stanford, California, USA.
Trends Hear. 2024 Jan-Dec;28:23312165241305049. doi: 10.1177/23312165241305049.
Cochlear implant (CI) users often complain about music appreciation and speech recognition in background noise, which depend on segregating sound sources into perceptual streams. The present study examined relationships between frequency and fundamental frequency (F0) discrimination with stream segregation of tonal and speech streams for CI users and peers with no known hearing loss. Frequency and F0 discrimination were measured for 1,000 Hz pure tones and 110 Hz complex tones, respectively. Stream segregation was measured for pure and complex tones using a lead/lag delay detection task. Spondee word identification was measured in competing speech with high levels of informational masking that required listeners to use F0 to segregate speech. The hypotheses were that frequency and F0 discrimination would explain a significant portion of the variance in outcomes for tonal segregation and speech reception. On average, CI users received a large benefit for stream segregation of tonal streams when either the frequency or F0 of the competing stream was shifted relative to the target stream. A linear relationship accounted for 42% of the covariance between measures of stream segregation and complex tone discrimination for CI users. In contrast, such benefits were absent when the F0 of the competing speech was shifted relative to the target speech. The large benefit observed for tonal streams is promising for music listening if it transfers to separating instruments within a song; however, the lack of benefit for speech suggests separate mechanisms, or special requirements, for speech processing.
人工耳蜗(CI)使用者经常抱怨在欣赏音乐和在背景噪声中进行语音识别方面存在问题,这取决于将声源分离为感知流。本研究考察了CI使用者和无已知听力损失的同龄人在音调流和语音流的流分离方面,频率辨别和基频(F0)辨别之间的关系。分别对1000Hz纯音和110Hz复合音进行频率辨别和F0辨别测量。使用超前/滞后延迟检测任务对纯音和复合音进行流分离测量。在具有高水平信息掩蔽的竞争性语音中测量重读单词识别,这要求听众使用F0来分离语音。假设是频率辨别和F0辨别将解释音调分离和语音接收结果中很大一部分的方差。平均而言,当竞争流的频率或F0相对于目标流发生偏移时,CI使用者在音调流的流分离方面获得了很大的益处。线性关系解释了CI使用者流分离测量和复合音辨别之间协方差的42%。相比之下,当竞争语音的F0相对于目标语音发生偏移时,这种益处并不存在。如果将观察到的音调流的大益处应用于分离歌曲中的乐器,那么对于音乐聆听来说是很有前景的;然而,语音方面缺乏益处表明语音处理存在不同的机制或特殊要求。