Suppr超能文献

使用动态视听单词一致性任务评估视觉引导助听器的性能。

Evaluating the Performance of a Visually Guided Hearing Aid Using a Dynamic Auditory-Visual Word Congruence Task.

作者信息

Roverud Elin, Best Virginia, Mason Christine R, Streeter Timothy, Kidd Gerald

机构信息

From the Department of Speech, Language & Hearing Sciences, Boston University, Boston, MA.

EarLens Corporation, Menlo Park, CA.

出版信息

Ear Hear. 2018 Jul/Aug;39(4):756-769. doi: 10.1097/AUD.0000000000000532.

Abstract

OBJECTIVES

The "visually guided hearing aid" (VGHA), consisting of a beamforming microphone array steered by eye gaze, is an experimental device being tested for effectiveness in laboratory settings. Previous studies have found that beamforming without visual steering can provide significant benefits (relative to natural binaural listening) for speech identification in spatialized speech or noise maskers when sound sources are fixed in location. The aim of the present study was to evaluate the performance of the VGHA in listening conditions in which target speech could switch locations unpredictably, requiring visual steering of the beamforming. To address this aim, the present study tested an experimental simulation of the VGHA in a newly designed dynamic auditory-visual word congruence task.

DESIGN

Ten young normal-hearing (NH) and 11 young hearing-impaired (HI) adults participated. On each trial, three simultaneous spoken words were presented from three source positions (-30, 0, and 30 azimuth). An auditory-visual word congruence task was used in which participants indicated whether there was a match between the word printed on a screen at a location corresponding to the target source and the spoken target word presented acoustically from that location. Performance was compared for a natural binaural condition (stimuli presented using impulse responses measured on KEMAR), a simulated VGHA condition (BEAM), and a hybrid condition that combined lowpass-filtered KEMAR and highpass-filtered BEAM information (BEAMAR). In some blocks, the target remained fixed at one location across trials, and in other blocks, the target could transition in location between one trial and the next with a fixed but low probability.

RESULTS

Large individual variability in performance was observed. There were significant benefits for the hybrid BEAMAR condition relative to the KEMAR condition on average for both NH and HI groups when the targets were fixed. Although not apparent in the averaged data, some individuals showed BEAM benefits relative to KEMAR. Under dynamic conditions, BEAM and BEAMAR performance dropped significantly immediately following a target location transition. However, performance recovered by the second word in the sequence and was sustained until the next transition.

CONCLUSIONS

When performance was assessed using an auditory-visual word congruence task, the benefits of beamforming reported previously were generally preserved under dynamic conditions in which the target source could move unpredictably from one location to another (i.e., performance recovered rapidly following source transitions) while the observer steered the beamforming via eye gaze, for both young NH and young HI groups.

摘要

目的

“视觉引导助听器”(VGHA)由通过目光凝视控制的波束形成麦克风阵列组成,是一种正在实验室环境中测试其有效性的实验性设备。先前的研究发现,当声源位置固定时,在空间化语音或噪声掩蔽器中,无视觉控制的波束形成(相对于自然双耳聆听)能为语音识别带来显著益处。本研究的目的是评估VGHA在目标语音位置可能不可预测地切换,需要波束形成的视觉控制的聆听条件下的性能。为实现这一目的,本研究在一项新设计的动态视听单词一致性任务中测试了VGHA的实验模拟。

设计

10名听力正常的年轻成年人(NH)和11名听力受损的年轻成年人(HI)参与。每次试验中,从三个声源位置(-30°、0°和30°方位角)同时呈现三个口语单词。使用视听单词一致性任务,参与者需指出在与目标声源对应的位置在屏幕上打印的单词与从该位置声学呈现的口语目标单词之间是否匹配。比较了自然双耳条件(使用在KEMAR上测量的脉冲响应呈现刺激)、模拟VGHA条件(BEAM)和结合了低通滤波的KEMAR和高通滤波的BEAM信息的混合条件(BEAMAR)下的表现。在某些试验块中,目标在各次试验中保持固定在一个位置,而在其他试验块中,目标可能以固定但较低的概率在一次试验到下一次试验之间改变位置。

结果

观察到个体表现存在很大差异。当目标固定时,NH组和HI组在混合的BEAMAR条件下相对于KEMAR条件平均都有显著益处。尽管在平均数据中不明显,但一些个体相对于KEMAR表现出BEAM的益处。在动态条件下,目标位置转换后,BEAM和BEAMAR的表现立即显著下降。然而,在序列中的第二个单词时表现恢复,并持续到下一次转换。

结论

当使用视听单词一致性任务评估表现时,先前报道的波束形成的益处通常在动态条件下得以保留,即在目标声源可能从一个位置不可预测地移动到另一个位置(即声源转换后表现迅速恢复)的情况下,观察者通过目光凝视控制波束形成,对于年轻的NH组和年轻的HI组均如此。

相似文献

2
The Benefit of a Visually Guided Beamformer in a Dynamic Speech Task.
Trends Hear. 2017 Jan-Dec;21:2331216517722304. doi: 10.1177/2331216517722304.
3
Enhancing Auditory Selective Attention Using a Visually Guided Hearing Aid.
J Speech Lang Hear Res. 2017 Oct 17;60(10):3027-3038. doi: 10.1044/2017_JSLHR-H-17-0071.
9
Intentional switching in auditory selective attention: Exploring attention shifts with different reverberation times.
Hear Res. 2018 Mar;359:32-39. doi: 10.1016/j.heares.2017.12.013. Epub 2017 Dec 22.
10
Time course and cost of misdirecting auditory spatial attention in younger and older adults.
Ear Hear. 2013 Nov-Dec;34(6):711-21. doi: 10.1097/AUD.0b013e31829bf6ec.

引用本文的文献

1
An Effect of Gaze Direction in Cocktail Party Listening.
Trends Hear. 2023 Jan-Dec;27:23312165231152356. doi: 10.1177/23312165231152356.
2
Comparing In-ear EOG for Eye-Movement Estimation With Eye-Tracking: Accuracy, Calibration, and Speech Comprehension.
Front Neurosci. 2022 Jun 30;16:873201. doi: 10.3389/fnins.2022.873201. eCollection 2022.
4
6
Benefits of Beamforming With Local Spatial-Cue Preservation for Speech Localization and Segregation.
Trends Hear. 2020 Jan-Dec;24:2331216519896908. doi: 10.1177/2331216519896908.
8
Enhancing Auditory Selective Attention Using a Visually Guided Hearing Aid.
J Speech Lang Hear Res. 2017 Oct 17;60(10):3027-3038. doi: 10.1044/2017_JSLHR-H-17-0071.

本文引用的文献

1
Examination of a hybrid beamformer that preserves auditory spatial cues.
J Acoust Soc Am. 2017 Oct;142(4):EL369. doi: 10.1121/1.5007279.
2
Enhancing Auditory Selective Attention Using a Visually Guided Hearing Aid.
J Speech Lang Hear Res. 2017 Oct 17;60(10):3027-3038. doi: 10.1044/2017_JSLHR-H-17-0071.
4
The Benefit of a Visually Guided Beamformer in a Dynamic Speech Task.
Trends Hear. 2017 Jan-Dec;21:2331216517722304. doi: 10.1177/2331216517722304.
6
Determining the energetic and informational components of speech-on-speech masking.
J Acoust Soc Am. 2016 Jul;140(1):132. doi: 10.1121/1.4954748.
7
A Binaural Steering Beamformer System for Enhancing a Moving Speech Source.
Trends Hear. 2015 Dec 30;19:2331216515618903. doi: 10.1177/2331216515618903.
8
Effect of audibility on spatial release from speech-on-speech masking.
J Acoust Soc Am. 2015 Nov;138(5):3311-9. doi: 10.1121/1.4934732.
9
Effects of age on electrophysiological correlates of speech processing in a dynamic "cocktail-party" situation.
Front Neurosci. 2015 Sep 29;9:341. doi: 10.3389/fnins.2015.00341. eCollection 2015.
10
An evaluation of the performance of two binaural beamformers in complex and dynamic multitalker environments.
Int J Audiol. 2015;54(10):727-35. doi: 10.3109/14992027.2015.1059502. Epub 2015 Jul 3.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验