• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视觉引导波束形成器在动态语音任务中的益处。

The Benefit of a Visually Guided Beamformer in a Dynamic Speech Task.

机构信息

1 Department of Speech, Language and Hearing Sciences, Boston University, MA, USA.

出版信息

Trends Hear. 2017 Jan-Dec;21:2331216517722304. doi: 10.1177/2331216517722304.

DOI:10.1177/2331216517722304
PMID:28758567
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5542081/
Abstract

The aim of this study was to evaluate the performance of a visually guided hearing aid (VGHA) under conditions designed to capture some aspects of "real-world" communication settings. The VGHA uses eye gaze to steer the acoustic look direction of a highly directional beamforming microphone array. Although the VGHA has been shown to enhance speech intelligibility for fixed-location, frontal targets, it is currently not known whether these benefits persist in the face of frequent changes in location of the target talker that are typical of conversational turn-taking. Participants were 14 young adults, 7 with normal hearing and 7 with bilateral sensorineural hearing impairment. Target stimuli were sequences of 12 question-answer pairs that were embedded in a mixture of competing conversations. The participant's task was to respond via a key press after each answer indicating whether it was correct or not. Spatialization of the stimuli and microphone array processing were done offline using recorded impulse responses, before presentation over headphones. The look direction of the array was steered according to the eye movements of the participant as they followed a visual cue presented on a widescreen monitor. Performance was compared for a "dynamic" condition in which the target stimulus moved between three locations, and a "fixed" condition with a single target location. The benefits of the VGHA over natural binaural listening observed in the fixed condition were reduced in the dynamic condition, largely because visual fixation was less accurate.

摘要

本研究旨在评估视觉引导助听(VGHA)在设计用于捕捉“真实世界”通信环境某些方面的条件下的性能。VGHA 使用眼动来引导高度指向性波束形成麦克风阵列的声学注视方向。尽管已经证明 VGHA 可以提高固定位置、正面目标的语音可懂度,但目前尚不清楚在目标说话者位置频繁变化的情况下,这些益处是否仍然存在,这种位置变化是会话交替的典型特征。参与者为 14 名年轻成年人,其中 7 名听力正常,7 名双侧感音神经性听力障碍。目标刺激是由 12 个问答对组成的序列,嵌入在竞争对话的混合物中。参与者的任务是在每个答案后通过按键回答是或否,以表明答案是否正确。刺激的空间化和麦克风阵列的处理是使用录制的脉冲响应在线下完成的,然后通过耳机呈现。根据参与者在宽屏显示器上呈现的视觉提示跟踪的眼动来引导阵列的注视方向。比较了目标刺激在三个位置之间移动的“动态”条件和具有单个目标位置的“固定”条件下的性能。在固定条件下观察到的 VGHA 相对于自然双耳聆听的优势在动态条件下降低了,主要是因为视觉固定不太准确。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/440413cccc83/10.1177_2331216517722304-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/a73df80dd1e1/10.1177_2331216517722304-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/1eb986626d58/10.1177_2331216517722304-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/9e8509d751d1/10.1177_2331216517722304-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/c84027b56799/10.1177_2331216517722304-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/515f18a2e8c8/10.1177_2331216517722304-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/a2d8ceb09bfb/10.1177_2331216517722304-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/4a912420eb98/10.1177_2331216517722304-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/3199c0b33ffe/10.1177_2331216517722304-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/3d35688472c9/10.1177_2331216517722304-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/440413cccc83/10.1177_2331216517722304-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/a73df80dd1e1/10.1177_2331216517722304-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/1eb986626d58/10.1177_2331216517722304-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/9e8509d751d1/10.1177_2331216517722304-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/c84027b56799/10.1177_2331216517722304-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/515f18a2e8c8/10.1177_2331216517722304-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/a2d8ceb09bfb/10.1177_2331216517722304-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/4a912420eb98/10.1177_2331216517722304-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/3199c0b33ffe/10.1177_2331216517722304-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/3d35688472c9/10.1177_2331216517722304-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daf6/5542081/440413cccc83/10.1177_2331216517722304-fig10.jpg

相似文献

1
The Benefit of a Visually Guided Beamformer in a Dynamic Speech Task.视觉引导波束形成器在动态语音任务中的益处。
Trends Hear. 2017 Jan-Dec;21:2331216517722304. doi: 10.1177/2331216517722304.
2
Evaluating the Performance of a Visually Guided Hearing Aid Using a Dynamic Auditory-Visual Word Congruence Task.使用动态视听单词一致性任务评估视觉引导助听器的性能。
Ear Hear. 2018 Jul/Aug;39(4):756-769. doi: 10.1097/AUD.0000000000000532.
3
Examination of a hybrid beamformer that preserves auditory spatial cues.对一种保留听觉空间线索的混合波束形成器的检查。
J Acoust Soc Am. 2017 Oct;142(4):EL369. doi: 10.1121/1.5007279.
4
Benefits of Acoustic Beamforming for Solving the Cocktail Party Problem.声束形成在解决鸡尾酒会问题中的优势。
Trends Hear. 2015 Jun 30;19:2331216515593385. doi: 10.1177/2331216515593385.
5
How directional microphones affect speech recognition, listening effort and localisation for listeners with moderate-to-severe hearing loss.定向麦克风如何影响中重度听力损失患者的言语识别、聆听努力和定位。
Int J Audiol. 2017 Dec;56(12):909-918. doi: 10.1080/14992027.2017.1355074. Epub 2017 Jul 25.
6
Enhancing Auditory Selective Attention Using a Visually Guided Hearing Aid.使用视觉引导助听器增强听觉选择性注意
J Speech Lang Hear Res. 2017 Oct 17;60(10):3027-3038. doi: 10.1044/2017_JSLHR-H-17-0071.
7
On the Interaction of Head and Gaze Control With Acoustic Beam Width of a Simulated Beamformer in a Two-Talker Scenario.在双说话人场景中模拟波束形成器的声束宽度与头部和注视控制的相互作用。
Trends Hear. 2019 Jan-Dec;23:2331216519876795. doi: 10.1177/2331216519876795.
8
Acoustic and perceptual effects of magnifying interaural difference cues in a simulated "binaural" hearing aid.模拟“双耳”助听器中放大双耳差异线索的声学和感知效果。
Int J Audiol. 2018 Jun;57(sup3):S81-S91. doi: 10.1080/14992027.2017.1308564. Epub 2017 Apr 10.
9
The effect of hearing aid technologies on listening in an automobile.助听器技术对在汽车中聆听的影响。
J Am Acad Audiol. 2013 Jun;24(6):474-85. doi: 10.3766/jaaa.24.6.4.
10
Benefits of Beamforming With Local Spatial-Cue Preservation for Speech Localization and Segregation.具有局部空间线索保留的波束形成在语音定位和分离中的优势。
Trends Hear. 2020 Jan-Dec;24:2331216519896908. doi: 10.1177/2331216519896908.

引用本文的文献

1
An Effect of Gaze Direction in Cocktail Party Listening.鸡尾酒会效应中的注视方向影响
Trends Hear. 2023 Jan-Dec;27:23312165231152356. doi: 10.1177/23312165231152356.
2
Comparing In-ear EOG for Eye-Movement Estimation With Eye-Tracking: Accuracy, Calibration, and Speech Comprehension.比较用于眼动估计的入耳式眼电图与眼动追踪:准确性、校准和言语理解
Front Neurosci. 2022 Jun 30;16:873201. doi: 10.3389/fnins.2022.873201. eCollection 2022.
3
EEG alpha and pupil diameter reflect endogenous auditory attention switching and listening effort.

本文引用的文献

1
Enhancing Auditory Selective Attention Using a Visually Guided Hearing Aid.使用视觉引导助听器增强听觉选择性注意
J Speech Lang Hear Res. 2017 Oct 17;60(10):3027-3038. doi: 10.1044/2017_JSLHR-H-17-0071.
2
A Flexible Question-and-Answer Task for Measuring Speech Understanding.用于测量语音理解的灵活问答任务。
Trends Hear. 2016 Nov 24;20:2331216516678706. doi: 10.1177/2331216516678706.
3
A Dynamic Speech Comprehension Test for Assessing Real-World Listening Ability.一种用于评估现实世界听力能力的动态言语理解测试。
脑电图阿尔法和瞳孔直径反映了内源性听觉注意的转换和聆听努力。
Eur J Neurosci. 2022 Mar;55(5):1262-1277. doi: 10.1111/ejn.15616. Epub 2022 Feb 16.
4
Benefits of triple acoustic beamforming during speech-on-speech masking and sound localization for bilateral cochlear-implant users.双侧人工耳蜗植入者在语音掩蔽和声源定位中使用三重声束成形的益处。
J Acoust Soc Am. 2021 May;149(5):3052. doi: 10.1121/10.0003933.
5
Enhancing the perceptual segregation and localization of sound sources with a triple beamformer.使用三波束形成器增强声源的感知分离和定位
J Acoust Soc Am. 2020 Dec;148(6):3598. doi: 10.1121/10.0002779.
6
Movement and Gaze Behavior in Virtual Audiovisual Listening Environments Resembling Everyday Life.在类似日常生活的虚拟视听聆听环境中移动和注视行为。
Trends Hear. 2019 Jan-Dec;23:2331216519872362. doi: 10.1177/2331216519872362.
7
Evaluation of the Influence of Head Movement on Hearing Aid Algorithm Performance Using Acoustic Simulations.使用声学模拟评估头部运动对助听器算法性能的影响。
Trends Hear. 2020 Jan-Dec;24:2331216520916682. doi: 10.1177/2331216520916682.
8
On the Interaction of Head and Gaze Control With Acoustic Beam Width of a Simulated Beamformer in a Two-Talker Scenario.在双说话人场景中模拟波束形成器的声束宽度与头部和注视控制的相互作用。
Trends Hear. 2019 Jan-Dec;23:2331216519876795. doi: 10.1177/2331216519876795.
9
Temporal dynamics and uncertainty in binaural hearing revealed by anticipatory eye movements.双耳听觉的时间动态和不确定性通过预期的眼球运动揭示。
J Acoust Soc Am. 2019 Feb;145(2):676. doi: 10.1121/1.5088591.
10
Coherent Coding of Enhanced Interaural Cues Improves Sound Localization in Noise With Bilateral Cochlear Implants.增强的双侧人工耳蜗耳间线索相干编码可改善噪声下的声音定位。
Trends Hear. 2018 Jan-Dec;22:2331216518781746. doi: 10.1177/2331216518781746.
J Am Acad Audiol. 2016 Jul;27(7):515-26. doi: 10.3766/jaaa.15089.
4
A Binaural Steering Beamformer System for Enhancing a Moving Speech Source.一种用于增强移动语音源的双耳导向波束形成器系统。
Trends Hear. 2015 Dec 30;19:2331216515618903. doi: 10.1177/2331216515618903.
5
Comparing Binaural Pre-processing Strategies III: Speech Intelligibility of Normal-Hearing and Hearing-Impaired Listeners.双耳预处理策略比较III:正常听力和听力受损听众的言语可懂度
Trends Hear. 2015 Dec 30;19:2331216515618609. doi: 10.1177/2331216515618609.
6
Comparing Binaural Pre-processing Strategies I: Instrumental Evaluation.比较双耳预处理策略I:仪器评估。
Trends Hear. 2015 Dec 30;19:2331216515617916. doi: 10.1177/2331216515617916.
7
An evaluation of the performance of two binaural beamformers in complex and dynamic multitalker environments.在复杂动态多说话者环境中对两种双耳波束形成器性能的评估。
Int J Audiol. 2015;54(10):727-35. doi: 10.3109/14992027.2015.1059502. Epub 2015 Jul 3.
8
Benefits of Acoustic Beamforming for Solving the Cocktail Party Problem.声束形成在解决鸡尾酒会问题中的优势。
Trends Hear. 2015 Jun 30;19:2331216515593385. doi: 10.1177/2331216515593385.
9
Potential benefits and limitations of three types of directional processing in hearing aids.助听器中三种类型的定向处理的潜在益处与局限性。
Ear Hear. 2014 May-Jun;35(3):339-52. doi: 10.1097/AUD.0000000000000004.
10
Design and preliminary testing of a visually guided hearing aid.视觉引导助听器的设计与初步测试。
J Acoust Soc Am. 2013 Mar;133(3):EL202-7. doi: 10.1121/1.4791710.