• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

正常听力者对经处理和未经处理的普通话声调信息的可懂度预测。

Predicting the intelligibility of Mandarin Chinese with manipulated and intact tonal information for normal-hearing listeners.

机构信息

Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, 100190, Beijing, China.

Cambridge Hearing Group, Department of Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom.

出版信息

J Acoust Soc Am. 2024 Nov 1;156(5):3088-3101. doi: 10.1121/10.0034233.

DOI:10.1121/10.0034233
PMID:39509085
Abstract

Objective indices for predicting speech intelligibility offer a quick and convenient alternative to behavioral measures of speech intelligibility. However, most such indices are designed for a specific language, such as English, and they do not take adequate account of tonal information in speech when applied to languages like Mandarin Chinese (hereafter called Mandarin) for which the patterns of fundamental frequency (F0) variation play an important role in distinguishing speech sounds with similar phonetic content. To address this, two experiments with normal-hearing listeners were conducted examining: (1) The impact of manipulations of tonal information on the intelligibility of Mandarin sentences presented in speech-shaped noise (SSN) at several signal-to-noise ratios (SNRs); (2) The intelligibility of Mandarin sentences with intact tonal information presented in SSN, pink noise, and babble at several SNRs. The outcomes were not correctly predicted by the Hearing Aid Speech Perception Index (HASPI-V1). A new intelligibility metric was developed that used one acoustic feature from HASPI-V1 plus Hilbert time envelope and temporal fine structure information from multiple frequency bands. For the new metric, the Pearson correlation between obtained and predicted intelligibility was 0.923 and the root mean square error was 0.119. The new metric provides a potential tool for evaluating Mandarin intelligibility.

摘要

客观的语音可懂度预测指标为语音可懂度的行为测量提供了一种快速而方便的替代方法。然而,大多数这样的指标都是为特定的语言设计的,例如英语,并且当应用于普通话等语言时,它们没有充分考虑语音中的声调信息,而在普通话中,基频(F0)变化的模式在区分具有相似语音内容的语音方面起着重要作用。为了解决这个问题,进行了两项正常听力受试者的实验,考察了:(1)声调信息的操纵对在几种信噪比(SNR)下的语音噪声(SSN)中呈现的普通话句子的可懂度的影响;(2)在 SSN、粉红噪声和噪声中呈现具有完整声调信息的普通话句子的可懂度在几种 SNR 下的情况。这些结果不能被听力助听感知指数(HASPI-V1)正确预测。开发了一种新的可懂度度量标准,该标准使用了 HASPI-V1 的一个声学特征,加上来自多个频带的希尔伯特时间包络和时间精细结构信息。对于新的度量标准,获得的和预测的可懂度之间的皮尔逊相关系数为 0.923,均方根误差为 0.119。新的度量标准为评估普通话可懂度提供了一种潜在的工具。

相似文献

1
Predicting the intelligibility of Mandarin Chinese with manipulated and intact tonal information for normal-hearing listeners.正常听力者对经处理和未经处理的普通话声调信息的可懂度预测。
J Acoust Soc Am. 2024 Nov 1;156(5):3088-3101. doi: 10.1121/10.0034233.
2
Assessing the perceptual contributions of vowels and consonants to Mandarin sentence intelligibility.评估元音和辅音对普通话句子可懂度的感知贡献。
J Acoust Soc Am. 2013 Aug;134(2):EL178-84. doi: 10.1121/1.4812820.
3
Relative contributions of acoustic temporal fine structure and envelope cues for lexical tone perception in noise.噪声中声学时间精细结构和包络线索对声调感知的相对贡献
J Acoust Soc Am. 2017 May;141(5):3022. doi: 10.1121/1.4982247.
4
The roles of fundamental frequency contours and sentence context in Mandarin Chinese speech intelligibility.基频轮廓和句子语境在汉语语音可懂度中的作用。
J Acoust Soc Am. 2013 Jul;134(1):EL91-7. doi: 10.1121/1.4811159.
5
Predicting the intelligibility of vocoded and wideband Mandarin Chinese.预测语音编码和宽带普通话的可懂度。
J Acoust Soc Am. 2011 May;129(5):3281-90. doi: 10.1121/1.3570957.
6
Effects of lexical tone contour on Mandarin sentence intelligibility.声调轮廓对汉语句子可懂度的影响。
J Speech Lang Hear Res. 2014 Feb;57(1):338-45. doi: 10.1044/1092-4388(2013/12-0324).
7
Effect of F0 contour on perception of Mandarin Chinese speech against masking.F0 轮廓对普通话语音在掩蔽环境下感知的影响。
PLoS One. 2019 Jan 3;14(1):e0209976. doi: 10.1371/journal.pone.0209976. eCollection 2019.
8
Construction and evaluation of the Mandarin Chinese matrix (CMNmatrix) sentence test for the assessment of speech recognition in noise.用于评估噪声环境下语音识别能力的汉语矩阵(CMNmatrix)句子测试的构建与评估。
Int J Audiol. 2018 Nov;57(11):838-850. doi: 10.1080/14992027.2018.1483083. Epub 2018 Sep 4.
9
Contribution of consonant landmarks to speech recognition in simulated acoustic-electric hearing.辅音地标对模拟电声听觉中的语音识别的贡献。
Ear Hear. 2010 Apr;31(2):259-67. doi: 10.1097/AUD.0b013e3181c7db17.
10
Effects of noise suppression and envelope dynamic range compression on the intelligibility of vocoded sentences for a tonal language.噪声抑制和包络动态范围压缩对声调语言中声码句可懂度的影响。
J Acoust Soc Am. 2017 Sep;142(3):1157. doi: 10.1121/1.5000164.