Department of Speech, Language and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA.
J Acoust Soc Am. 2019 Nov;146(5):3215. doi: 10.1121/1.5131651.
When a target talker speaks in the presence of competing talkers, the listener must not only segregate the voices but also understand the target message based on a limited set of spectrotemporal regions ("glimpses") in which the target voice dominates the acoustic mixture. Here, the hypothesis that a broad audible bandwidth is more critical for these sparse representations of speech than it is for intact speech is tested. Listeners with normal hearing were presented with sentences that were either intact, or progressively "glimpsed" according to a competing two-talker masker presented at various levels. This was achieved by using an ideal binary mask to exclude time-frequency units in the target that would be dominated by the masker in the natural mixture. In each glimpsed condition, speech intelligibility was measured for a range of low-pass conditions (cutoff frequencies from 500 to 8000 Hz). Intelligibility was poorer for sparser speech, and the bandwidth required for optimal intelligibility increased with the sparseness of the speech. The combined effects of glimpsing and bandwidth reduction were well captured by a simple metric based on the proportion of audible target glimpses retained. The findings may be relevant for understanding the impact of high-frequency hearing loss on everyday speech communication.
当目标说话人在竞争说话人的存在下讲话时,听众不仅必须将声音分开,还必须根据目标声音主导声混合的有限的时频区域(“瞥见”)来理解目标信息。在这里,测试了一个假设,即对于语音的这些稀疏表示形式,宽可听带宽比完整语音更为关键。正常听力的听众被呈现出完整的句子,或者根据以各种水平呈现的两个说话者掩蔽器进行逐渐“瞥见”。这是通过使用理想的二进制掩蔽来实现的,该掩蔽会排除在自然混合中会被掩蔽器主导的目标的时频单元。在每个瞥见条件下,对一系列低通条件(截止频率从 500 到 8000 Hz)进行了语音可懂度测量。语音越稀疏,可懂度越差,最佳可懂度所需的带宽随语音的稀疏度而增加。瞥见和带宽减少的综合影响可以通过基于保留的可听见目标瞥见的比例的简单度量来很好地捕获。这些发现可能与理解高频听力损失对日常言语交际的影响有关。