• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于双耳语音可懂度和感知聆听努力度的盲预测的联合框架。

A joint framework for blind prediction of binaural speech intelligibility and perceived listening effort.

机构信息

Fraunhofer IDMT, Hearing, Speech and Audio Technology and Cluster of Excellence Hearing4all, Marie-Curie-Str. 2, 26129 Oldenburg, Germany.

Department für Medizinische Physik und Akustik, Carl von Ossietzky Universität Oldenburg and Cluster of Excellence Hearing4all, Oldenburg, Germany.

出版信息

Hear Res. 2022 Dec;426:108598. doi: 10.1016/j.heares.2022.108598. Epub 2022 Aug 8.

DOI:10.1016/j.heares.2022.108598
PMID:35995688
Abstract

Speech perception is strongly affected by noise and reverberation in the listening room, and binaural processing can substantially facilitate speech perception in conditions when target speech and maskers originate from different directions. Most studies and proposed models for predicting spatial unmasking have focused on speech intelligibility. The present study introduces a model framework that predicts both speech intelligibility and perceived listening effort from the same output measure. The framework is based on a combination of a blind binaural processing stage employing a blind equalization cancelation (EC) mechanism, and a blind backend based on phoneme probability classification. Neither frontend nor backend require any additional information, such as the source directions, the signal-to-noise ratio (SNR), or the number of sources, allowing for a fully blind perceptual assessment of binaural input signals consisting of target speech mixed with noise. The model is validated against a recent data set in which speech intelligibility and perceived listening effort were measured for a range of acoustic conditions differing in reverberation and binaural cues [Rennies and Kidd (2018), J. Acoust. Soc. Am. 144, 2147-2159]. Predictions of the proposed model are compared with a non-blind binaural model consisting of a non-blind EC stage and a backend based on the speech intelligibility index. The analyses indicated that all main trends observed in the experiments were correctly predicted by the blind model. The overall proportion of variance explained by the model (R² = 0.94) for speech intelligibility was slightly worse than for the non-blind model (R² = 0.98). For listening effort predictions, both models showed lower prediction accuracy, but still explained significant proportions of the observed variance (R² = 0.88 and R² = 0.71 for the non-blind and blind model, respectively). Closer inspection showed that the differences between data and predictions were largest for binaural conditions at high SNRs, where the perceived listening effort of human listeners tended to be underestimated by the models, specifically by the blind version.

摘要

言语感知强烈受到聆听室内噪声和混响的影响,双耳处理在目标语音和掩蔽声来自不同方向的情况下,可以显著促进言语感知。大多数用于预测空间掩蔽的研究和提出的模型都集中在言语可懂度上。本研究介绍了一种模型框架,该框架可以从相同的输出度量中预测言语可懂度和感知聆听努力度。该框架基于结合使用盲双耳处理阶段和基于盲语音概率分类的后端,该盲双耳处理阶段采用盲均衡抵消(EC)机制。前端和后端都不需要任何其他信息,例如源方向、信号噪声比(SNR)或源数量,从而可以对由与噪声混合的目标语音组成的双耳输入信号进行完全盲感知评估。该模型针对最近的一个数据集进行了验证,该数据集在不同混响和双耳线索的声环境下测量了言语可懂度和感知聆听努力度[Rennies 和 Kidd(2018),J. Acoust. Soc. Am. 144,2147-2159]。与包含盲 EC 阶段和基于言语可懂度指数的后端的非盲双耳模型相比,对所提出模型的预测进行了比较。分析表明,盲模型正确预测了实验中观察到的所有主要趋势。模型对言语可懂度的总体方差解释比例(R²=0.94)略低于非盲模型(R²=0.98)。对于聆听努力度的预测,两个模型的预测精度都较低,但仍解释了观察到的方差的很大比例(非盲模型和盲模型分别为 R²=0.88 和 R²=0.71)。更仔细的检查表明,数据和预测之间的差异在高 SNR 的双耳条件下最大,在这些条件下,模型对人类聆听者的感知聆听努力度的估计往往偏低,特别是盲模型。

相似文献

1
A joint framework for blind prediction of binaural speech intelligibility and perceived listening effort.用于双耳语音可懂度和感知聆听努力度的盲预测的联合框架。
Hear Res. 2022 Dec;426:108598. doi: 10.1016/j.heares.2022.108598. Epub 2022 Aug 8.
2
Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort.双耳语音可懂度和感知聆听努力中语音掩蔽的能量和信息成分。
Trends Hear. 2019 Jan-Dec;23:2331216519854597. doi: 10.1177/2331216519854597.
3
Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage.使用盲听处理阶段对语音的双耳掩蔽进行建模。
Trends Hear. 2020 Jan-Dec;24:2331216520975630. doi: 10.1177/2331216520975630.
4
Prediction of the influence of reverberation on binaural speech intelligibility in noise and in quiet.预测混响对噪声和安静环境下双耳语音可懂度的影响。
J Acoust Soc Am. 2011 Nov;130(5):2999-3012. doi: 10.1121/1.3641368.
5
Benefit of binaural listening as revealed by speech intelligibility and listening effort.双耳聆听在言语可懂度和聆听努力方面的益处。
J Acoust Soc Am. 2018 Oct;144(4):2147. doi: 10.1121/1.5057114.
6
The effect of audiovisual and binaural listening on the acceptable noise level (ANL): establishing an ANL conceptual model.视听和双耳聆听对可接受噪声水平(ANL)的影响:建立ANL概念模型。
J Am Acad Audiol. 2014 Feb;25(2):141-53. doi: 10.3766/jaaa.25.2.3.
7
Speech intelligibility prediction in reverberation: Towards an integrated model of speech transmission, spatial unmasking, and binaural de-reverberation.混响环境下的语音可懂度预测:迈向语音传输、空间掩蔽和双耳去混响的集成模型
J Acoust Soc Am. 2015 Jun;137(6):3335-45. doi: 10.1121/1.4921028.
8
Binaural prediction of speech intelligibility in reverberant rooms with multiple noise sources.双耳预测混响房间多噪声源下的语音可懂度。
J Acoust Soc Am. 2012 Jan;131(1):218-31. doi: 10.1121/1.3662075.
9
Perceived listening effort and speech intelligibility in reverberation and noise for hearing-impaired listeners.听力受损听众在混响和噪声环境中的感知聆听努力与言语可懂度
Int J Audiol. 2016 Dec;55(12):738-747. doi: 10.1080/14992027.2016.1219774. Epub 2016 Sep 14.
10
Modelling binaural unmasking and the intelligibility of speech in noise and reverberation for normal-hearing and hearing-impaired listeners.为正常听力和听力障碍的听众建立双耳掩蔽模型和噪声与混响环境下言语可懂度模型。
J Acoust Soc Am. 2021 Nov;150(5):3275. doi: 10.1121/10.0006736.