Otolaryngology-Head and Neck Surgery, Ear & Hearing, Amsterdam Public Health, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.
Biomedical Engineering Department, Boston University, Boston, Massachusetts, United States.
Trends Hear. 2020 Jan-Dec;24:2331216520972858. doi: 10.1177/2331216520972858.
Binaural acoustic recordings were made in multiple natural environments, which were chosen to be similar to those reported to be difficult for listeners with impaired hearing. These environments include natural conversations that take place in the presence of other sound sources as found in restaurants, walking or biking in the city, and so on. Sounds from these environments were recorded binaurally with in-the-ear microphones and were analyzed with respect to speech-likeness measures and interaural difference measures. The speech-likeness measures were based on amplitude-modulation patterns within frequency bands and were estimated for 1-s time-slices. The interaural difference measures included interaural coherence, interaural time difference, and interaural level difference, which were estimated for time-slices of 20-ms duration. These binaural measures were documented for one-fourth-octave frequency bands centered at 500 Hz and for the envelopes of one-fourth-octave bands centered at 2000 Hz. For comparison purposes, the same speech-likeness and interaural difference measures were computed for a set of virtual recordings that mimic typical clinical test configurations. These virtual recordings were created by filtering anechoic waveforms with available head-related transfer functions and combining them to create multiple source combinations. Overall, the speech-likeness results show large variability within and between environments, and they demonstrate the importance of having information from both ears available. Furthermore, the interaural parameter results show that the natural recordings contain a relatively small proportion of time-slices with high coherence compared with the virtual recordings; however, when present, binaural cues might be used for selecting intervals with good speech intelligibility for individual sources.
双耳声学录音是在多种自然环境中进行的,这些环境的选择类似于那些据报道对听力受损的听众来说困难的环境。这些环境包括在餐馆等存在其他声源的自然对话、在城市中步行或骑自行车等。来自这些环境的声音通过入耳式麦克风进行双耳录音,并针对语音相似性测量和耳间差异测量进行分析。语音相似性测量基于频带内的幅度调制模式,并针对 1 秒的时间片进行估计。耳间差异测量包括耳间相干性、耳间时间差和耳间水平差,它们针对 20 毫秒持续时间的时间片进行估计。这些双耳测量值记录在以 500 Hz 为中心的四分之一倍频带和以 2000 Hz 为中心的四分之一倍频带的包络中。为了比较目的,对一组模拟典型临床测试配置的虚拟录音进行了相同的语音相似性和耳间差异测量。这些虚拟录音是通过用可用的头部相关传递函数对无回声波形进行滤波,并将它们组合创建多个源组合来创建的。总体而言,语音相似性结果显示环境内和环境之间的可变性很大,并且证明了双耳信息的可用性很重要。此外,耳间参数结果表明,与虚拟录音相比,自然录音中具有高相干性的时间片比例相对较小;然而,当存在时,双耳线索可能用于为单个源选择具有良好语音可懂度的间隔。