• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

模拟混响条件下声学语音质量测量的敏感性

Sensitivity of Acoustic Voice Quality Measures in Simulated Reverberation Conditions.

作者信息

Yousef Ahmed M, Hunter Eric J

机构信息

Department of Communication Sciences and Disorders, University of Iowa, Iowa City, IA 52242, USA.

出版信息

Bioengineering (Basel). 2024 Dec 11;11(12):1253. doi: 10.3390/bioengineering11121253.

DOI:10.3390/bioengineering11121253
PMID:39768071
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11673399/
Abstract

Room reverberation can affect oral/aural communication and is especially critical in computer analysis of voice. High levels of reverberation can distort voice recordings, impacting the accuracy of quantifying voice production quality and vocal health evaluations. This study quantifies the impact of additive simulated reverberation on otherwise clean voice recordings as reflected in voice metrics commonly used for voice quality evaluation. From a larger database of voice recordings collected in a low-noise, low-reverberation environment, voice samples of a sustained [a:] vowel produced at two different speaker intents (comfortable and clear) by five healthy voice college-age female native English speakers were used. Using the reverb effect in Audacity, eight reverberation situations indicating a range of reverberation times (T20 between 0.004 and 1.82 s) were simulated and convolved with the original recordings. All voice samples, both original and reverberation-affected, were analyzed using freely available PRAAT software (version 6.0.13) to calculate five common voice parameters: jitter, shimmer, harmonic-to-noise ratio (HNR), alpha ratio, and smoothed cepstral peak prominence (CPPs). Statistical analyses assessed the sensitivity and variations in voice metrics to a range of simulated room reverberation conditions. Results showed that jitter, HNR, and alpha ratio were stable at simulated reverberation times below T20 of 1 s, with HNR and jitter more stable in the clear vocal style. Shimmer was highly sensitive even at T20 of 0.53 s, which would reflect a common room, while CPPs remained stable across all simulated reverberation conditions. Understanding the sensitivity and stability of these voice metrics to a range of room acoustics effects allows for targeted use of certain metrics even in less controlled environments, enabling selective application of stable measures like CPPs and cautious interpretation of shimmer, ensuring more reliable and accurate voice assessments.

摘要

房间混响会影响口头/听觉交流,在语音的计算机分析中尤为关键。高水平的混响会使语音录音失真,影响量化语音产生质量和嗓音健康评估的准确性。本研究量化了加性模拟混响对原本清晰的语音录音的影响,这在常用于语音质量评估的语音指标中有所体现。从在低噪声、低混响环境中收集的更大语音录音数据库中,选取了五名以英语为母语、处于大学年龄的健康女性说话者在两种不同说话意图(舒适和清晰)下发出的持续[a:]元音的语音样本。利用Audacity中的混响效果,模拟了八种表示不同混响时间范围(T20在0.004秒至1.82秒之间)的混响情况,并将其与原始录音进行卷积。使用免费的PRAAT软件(版本6.0.13)对所有语音样本(原始样本和受混响影响的样本)进行分析,以计算五个常见的语音参数:抖动、闪烁、谐波噪声比(HNR)、阿尔法比率和平滑的谐波峰值突出度(CPPs)。统计分析评估了语音指标对一系列模拟房间混响条件的敏感性和变化情况。结果表明,在模拟混响时间低于T20为1秒时,抖动、HNR和阿尔法比率较为稳定,在清晰的发声风格中HNR和抖动更稳定。即使在T20为0.53秒(这反映的是常见房间)时,闪烁也高度敏感,而CPPs在所有模拟混响条件下都保持稳定。了解这些语音指标对一系列房间声学效果的敏感性和稳定性,即使在控制较差的环境中也能有针对性地使用某些指标,从而能够选择性地应用像CPPs这样稳定的测量方法,并谨慎解读闪烁情况,确保进行更可靠、准确的语音评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/0c922ed045d0/bioengineering-11-01253-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/5e109b22feec/bioengineering-11-01253-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/d816be6e370f/bioengineering-11-01253-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/3806e4877fc1/bioengineering-11-01253-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/c2ca94018f2c/bioengineering-11-01253-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/3d0a5944ee14/bioengineering-11-01253-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/0c922ed045d0/bioengineering-11-01253-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/5e109b22feec/bioengineering-11-01253-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/d816be6e370f/bioengineering-11-01253-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/3806e4877fc1/bioengineering-11-01253-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/c2ca94018f2c/bioengineering-11-01253-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/3d0a5944ee14/bioengineering-11-01253-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3145/11673399/0c922ed045d0/bioengineering-11-01253-g006.jpg

相似文献

1
Sensitivity of Acoustic Voice Quality Measures in Simulated Reverberation Conditions.模拟混响条件下声学语音质量测量的敏感性
Bioengineering (Basel). 2024 Dec 11;11(12):1253. doi: 10.3390/bioengineering11121253.
2
An Assessment of Different Praat Versions for Acoustic Measures Analyzed Automatically by VoiceEvalU8 and Manually by Two Raters.不同 Praat 版本的评估,这些版本的声学测量由 VoiceEvalU8 自动分析,由两位评分者手动分析。
J Voice. 2023 Jan;37(1):17-25. doi: 10.1016/j.jvoice.2020.12.003. Epub 2020 Dec 29.
3
Evaluation of Acoustic Analyses of Voice in Nonoptimized Conditions.非优化条件下嗓音声学分析评估。
J Speech Lang Hear Res. 2020 Dec 14;63(12):3991-3999. doi: 10.1044/2020_JSLHR-20-00212. Epub 2020 Nov 13.
4
Acoustic Effects of Vocal Warm-Up: A 7-Week Longitudinal Case Study.发声热身的声学效果:一项为期 7 周的纵向案例研究。
J Voice. 2024 Mar;38(2):458-465. doi: 10.1016/j.jvoice.2021.09.030. Epub 2021 Nov 26.
5
Voice disorder discrimination using vowel acoustic measures in female speakers.基于元音声学特征的女性嗓音障碍判别。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2087-2102. doi: 10.1111/1460-6984.13081. Epub 2024 Jun 17.
6
Acoustic Perturbation Measures Improve with Increasing Vocal Intensity in Individuals With and Without Voice Disorders.无论是否患有嗓音障碍,随着嗓音强度增加,声学微扰指标均有所改善。
J Voice. 2018 Mar;32(2):162-168. doi: 10.1016/j.jvoice.2017.04.008. Epub 2017 May 18.
7
Predicting Voice Disorder Status From Smoothed Measures of Cepstral Peak Prominence Using Praat and Analysis of Dysphonia in Speech and Voice (ADSV).使用Praat通过平滑的谐波峰值突出度测量以及言语和嗓音中的发声障碍分析(ADSV)预测嗓音障碍状态。
J Voice. 2017 Sep;31(5):557-566. doi: 10.1016/j.jvoice.2017.01.006. Epub 2017 Feb 4.
8
Effects of Fundamental Frequency, Vocal Intensity, Sample Duration, and Vowel Context in Cepstral and Spectral Measures of Dysphonic Voices.基频、发声强度、样本时长及元音语境对嗓音障碍语音的倒谱和频谱测量的影响
J Speech Lang Hear Res. 2020 May 22;63(5):1326-1339. doi: 10.1044/2020_JSLHR-19-00049. Epub 2020 Apr 29.
9
Cepstral analysis of hypokinetic and ataxic voices: correlations with perceptual and other acoustic measures.运动减退性和共济失调性嗓音的谐波倒谱分析:与感知及其他声学指标的相关性
J Voice. 2014 Nov;28(6):673-80. doi: 10.1016/j.jvoice.2014.01.013. Epub 2014 May 16.
10
Can acoustic measurements predict gender perception in the voice?声学测量能否预测语音中的性别感知?
PLoS One. 2024 Nov 14;19(11):e0310794. doi: 10.1371/journal.pone.0310794. eCollection 2024.

本文引用的文献

1
Within-talker and within-session stability of acoustic characteristics of conversational and clear speaking stylesa).会话者内和会话内稳定的会话和清晰说话风格的声学特征 a)。
J Acoust Soc Am. 2024 Jan 1;155(1):44-55. doi: 10.1121/10.0024241.
2
Deep Learning-Based Analysis of Glottal Attack and Offset Times in Adductor Laryngeal Dystonia.基于深度学习的内收型喉肌张力障碍中声门起音和终止时间分析
J Voice. 2023 Nov 15. doi: 10.1016/j.jvoice.2023.10.011.
3
Advances in Clinical Voice Quality Analysis with VOXplot.使用VOXplot进行临床嗓音质量分析的进展
J Clin Med. 2023 Jul 12;12(14):4644. doi: 10.3390/jcm12144644.
4
Variation of the acoustic parameters: f, jitter, shimmer and alpha ratio in relation with different background noise levels.声学参数(f、抖动、闪烁和阿尔法比率)随不同背景噪声水平的变化情况。
Acta Otorrinolaringol Esp (Engl Ed). 2023 Jul-Aug;74(4):219-225. doi: 10.1016/j.otoeng.2022.10.004. Epub 2022 Oct 17.
5
Deep-Learning-Based Representation of Vocal Fold Dynamics in Adductor Spasmodic Dysphonia during Connected Speech in High-Speed Videoendoscopy.高速视频内镜检查中内收型痉挛性发声障碍患者连贯言语时基于深度学习的声带动力学表现
J Voice. 2025 Mar;39(2):570.e1-570.e15. doi: 10.1016/j.jvoice.2022.08.022. Epub 2022 Sep 23.
6
Meta-Analysis on the Validity of the Acoustic Voice Quality Index.基于声学嗓音质量指数的有效性的元分析
J Voice. 2024 Nov;38(6):1527.e1-1527.e19. doi: 10.1016/j.jvoice.2022.04.022. Epub 2022 Jun 23.
7
Acoustic Voice Quality Index (AVQI) in the Measurement of Voice Quality: A Systematic Review and Meta-Analysis.嗓音声学质量指数(AVQI)在嗓音质量测量中的应用:系统评价和荟萃分析。
J Voice. 2024 Sep;38(5):1055-1069. doi: 10.1016/j.jvoice.2022.03.018. Epub 2022 Apr 20.
8
The Acoustic Voice Quality Index (AVQI) in People with Parkinson's Disease Before and After Intensive Voice and Articulation Therapies: Secondary Outcome of a Randomized Controlled Trial.帕金森病患者在接受强化嗓音和发音治疗前后的声学语音质量指数(AVQI):一项随机对照试验的次要结局。
J Voice. 2024 Nov;38(6):1529.e7-1529.e16. doi: 10.1016/j.jvoice.2022.03.014. Epub 2022 Apr 18.
9
Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy During Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach.基于卷积神经网络的痉挛性发声障碍患者连接性言语时高速视频内镜下声带图像遮挡的检测。
J Voice. 2024 Jul;38(4):951-962. doi: 10.1016/j.jvoice.2022.01.028. Epub 2022 Mar 16.
10
A Semiautomated Protocol Towards Quantifying Vocal Effort in Relation to Vocal Performance During a Vocal Loading Task.一种半自动协议,用于在声乐加载任务期间量化与声乐表现相关的发声努力。
J Voice. 2024 Jul;38(4):876-888. doi: 10.1016/j.jvoice.2022.01.003. Epub 2022 Feb 12.