• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自旋转时混响中的语音可懂度降低。

Speech Intelligibility in Reverberation is Reduced During Self-Rotation.

机构信息

Audio Information Processing, Technical University of Munich, Munich, Germany.

出版信息

Trends Hear. 2023 Jan-Dec;27:23312165231188619. doi: 10.1177/23312165231188619.

DOI:10.1177/23312165231188619
PMID:37475460
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10363862/
Abstract

Speech intelligibility in cocktail party situations has been traditionally studied for stationary sound sources and stationary participants. Here, speech intelligibility and behavior were investigated during active self-rotation of standing participants in a spatialized speech test. We investigated if people would rotate to improve speech intelligibility, and we asked if knowing the target location would be further beneficial. Target sentences randomly appeared at one of four possible locations: 0°, ± 90°, 180° relative to the participant's initial orientation on each trial, while speech-shaped noise was presented from the front (0°). Participants responded naturally with self-rotating motion. Target sentences were presented either without (Audio-only) or with a picture of an avatar (Audio-Visual). In a baseline (Static) condition, people were standing still without visual location cues. Participants' self-orientation undershot the target location and orientations were close to acoustically optimal. Participants oriented more often in an acoustically optimal way, and speech intelligibility was higher in the Audio-Visual than in the Audio-only condition for the lateral targets. The intelligibility of the individual words in Audio-Visual and Audio-only increased during self-rotation towards the rear target, but it was reduced for the lateral targets when compared to Static, which could be mostly, but not fully, attributed to changes in spatial unmasking. Speech intelligibility prediction based on a model of static spatial unmasking considering self-rotations overestimated the participant performance by 1.4 dB. The results suggest that speech intelligibility is reduced during self-rotation, and that visual cues of location help to achieve more optimal self-rotations and better speech intelligibility.

摘要

在传统的鸡尾酒会场景下,语音可懂度通常针对静止声源和静止参与者进行研究。在这里,我们在参与者在空间化语音测试中主动自转的情况下,研究了语音可懂度和行为。我们调查了人们是否会为了提高语音可懂度而进行自转,以及如果知道目标位置是否会有进一步的帮助。在每次试验中,目标句子会随机出现在四个可能位置之一:相对于参与者初始方向的 0°、±90°、180°,而语音噪声则从前(0°)方发出。参与者自然地通过自转运动做出响应。目标句子以两种方式呈现:一种是只有音频(Audio-only),另一种是带有头像图片的音频-视觉(Audio-Visual)。在基线(Static)条件下,人们站着不动,没有视觉位置提示。参与者的自我方向与目标位置相差不远,并且接近听觉最佳位置。参与者更经常以听觉最佳的方式进行自我定向,并且在听觉-视觉条件下,侧向目标的语音可懂度高于仅音频条件。在向后方目标进行自我旋转时,音频-视觉和仅音频条件下的个别单词的可懂度增加,但与静态条件相比,侧向目标的可懂度降低,这主要归因于空间掩蔽的变化,但并非完全归因于此。基于考虑自我旋转的静态空间掩蔽模型的语音可懂度预测,将参与者的表现高估了 1.4dB。结果表明,在自我旋转过程中语音可懂度会降低,而位置的视觉提示有助于实现更理想的自我旋转和更好的语音可懂度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/48d2ba058585/10.1177_23312165231188619-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/233bde0ea1b6/10.1177_23312165231188619-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/9c32d735b34d/10.1177_23312165231188619-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/10026a3cc0e2/10.1177_23312165231188619-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/a757e785452f/10.1177_23312165231188619-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/36524e64f3fe/10.1177_23312165231188619-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/48d2ba058585/10.1177_23312165231188619-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/233bde0ea1b6/10.1177_23312165231188619-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/9c32d735b34d/10.1177_23312165231188619-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/10026a3cc0e2/10.1177_23312165231188619-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/a757e785452f/10.1177_23312165231188619-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/36524e64f3fe/10.1177_23312165231188619-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/74f7/10363862/48d2ba058585/10.1177_23312165231188619-fig6.jpg

相似文献

1
Speech Intelligibility in Reverberation is Reduced During Self-Rotation.自旋转时混响中的语音可懂度降低。
Trends Hear. 2023 Jan-Dec;27:23312165231188619. doi: 10.1177/23312165231188619.
2
Audio-visual speech intelligibility benefits with bilateral cochlear implants when talker location varies.当说话者位置变化时,双侧人工耳蜗植入对视听言语可懂度的益处。
J Assoc Res Otolaryngol. 2015 Apr;16(2):309-15. doi: 10.1007/s10162-014-0503-7. Epub 2015 Jan 13.
3
The intelligibility of speech in a harmonic masker varying in fundamental frequency contour, broadband temporal envelope, and spatial location.在基频轮廓、宽带时间包络和空间位置变化的谐波掩蔽器中语音的可懂度。
Hear Res. 2017 Jul;350:1-10. doi: 10.1016/j.heares.2017.03.012. Epub 2017 Mar 29.
4
Binaural speech intelligibility in rooms with variations in spatial location of sources and modulation depth of noise interferers.双耳语音清晰度在声源空间位置变化和噪声干扰调制深度的房间中。
J Acoust Soc Am. 2013 Aug;134(2):1146-59. doi: 10.1121/1.4812248.
5
An Evaluation of Output Signal to Noise Ratio as a Predictor of Cochlear Implant Speech Intelligibility.输出信噪比评估作为人工耳蜗言语可懂度预测指标的研究。
Ear Hear. 2018 Sep/Oct;39(5):958-968. doi: 10.1097/AUD.0000000000000556.
6
Spatial hearing and speech intelligibility in bilateral cochlear implant users.双侧人工耳蜗植入使用者的空间听觉与言语可懂度
Ear Hear. 2009 Aug;30(4):419-31. doi: 10.1097/AUD.0b013e3181a165be.
7
Development of the Listening in Spatialized Noise-Sentences Test (LISN-S).空间噪声句子听力测试(LISN-S)的开发。
Ear Hear. 2007 Apr;28(2):196-211. doi: 10.1097/AUD.0b013e318031267f.
8
Binaural pre-processing for contralateral sound field attenuation can improve speech-in-noise intelligibility for bilateral hearing-aid users.用于对侧声场衰减的双耳预处理可以提高双侧助听器使用者在噪声环境下的言语清晰度。
Hear Res. 2023 May;432:108743. doi: 10.1016/j.heares.2023.108743. Epub 2023 Mar 25.
9
Measuring and modeling speech intelligibility in real and loudspeaker-based virtual sound environments.测量和建模真实和基于扬声器的虚拟声环境中的语音可懂度。
Hear Res. 2019 Jun;377:307-317. doi: 10.1016/j.heares.2019.02.003. Epub 2019 Feb 14.
10
Effects of type of early reflection, clarity of speech, reverberation and diffuse noise on the spatial perception of a speech source and its intelligibility.早期反射类型、语音清晰度、混响和扩散噪声对言语源空间感知及其可懂度的影响。
J Acoust Soc Am. 2022 May;151(5):3522. doi: 10.1121/10.0011403.

本文引用的文献

1
Signal envelope and speech intelligibility differentially impact auditory motion perception.信号包络和语音可懂度对听觉运动感知有不同的影响。
Sci Rep. 2021 Jul 23;11(1):15117. doi: 10.1038/s41598-021-94662-y.
2
Localization of tones in a room by moving listeners.通过移动听众来定位房间内的音调。
J Acoust Soc Am. 2021 Jun;149(6):4159. doi: 10.1121/10.0005045.
3
Gaze aversion in conversational settings: An investigation based on mock job interview.对话场景中的目光回避:基于模拟求职面试的调查
J Eye Mov Res. 2021 May 19;14(1). doi: 10.16910/jemr.14.1.1.
4
Using Virtual Reality to Assess Auditory Performance.使用虚拟现实技术评估听觉表现。
Hear J. 2019 Jun;72(6):20-23. doi: 10.1097/01.hj.0000558464.75151.52.
5
Objective Assessment of Speech Intelligibility in Crowded Public Spaces.客观评估嘈杂公共空间中的言语清晰度。
Ear Hear. 2020 Nov/Dec;41 Suppl 1(Suppl 1):68S-78S. doi: 10.1097/AUD.0000000000000943.
6
Review of Self-Motion in the Context of Hearing and Hearing Device Research.听觉与听觉装置研究背景下的自运动综述。
Ear Hear. 2020 Nov/Dec;41 Suppl 1:48S-55S. doi: 10.1097/AUD.0000000000000940.
7
A multimedia speech corpus for audio visual research in virtual reality (L).用于虚拟现实视听研究的多媒体语音语料库(L)
J Acoust Soc Am. 2020 Aug;148(2):492. doi: 10.1121/10.0001670.
8
Movement and Gaze Behavior in Virtual Audiovisual Listening Environments Resembling Everyday Life.在类似日常生活的虚拟视听聆听环境中移动和注视行为。
Trends Hear. 2019 Jan-Dec;23:2331216519872362. doi: 10.1177/2331216519872362.
9
On the Interaction of Head and Gaze Control With Acoustic Beam Width of a Simulated Beamformer in a Two-Talker Scenario.在双说话人场景中模拟波束形成器的声束宽度与头部和注视控制的相互作用。
Trends Hear. 2019 Jan-Dec;23:2331216519876795. doi: 10.1177/2331216519876795.
10
Spatial Release From Masking Under Different Reverberant Conditions in Young and Elderly Subjects: Effect of Moving or Stationary Maskers at Circular and Radial Conditions.在不同混响条件下年轻人和老年人的掩蔽释放空间:圆形和放射状条件下移动或固定掩蔽的影响。
J Speech Lang Hear Res. 2019 Sep 20;62(9):3582-3595. doi: 10.1044/2019_JSLHR-H-19-0092. Epub 2019 Sep 16.