使用基于听觉时间包络处理的生物启发计算模型预测感知声音粗糙度。

Predicting Perceived Vocal Roughness Using a Bio-Inspired Computational Model of Auditory Temporal Envelope Processing.

机构信息

Department of Communication Sciences and Disorders, University of South Florida, Tampa.

Office of the Provost & Executive Vice President, Indiana University Bloomington.

出版信息

J Speech Lang Hear Res. 2022 Aug 17;65(8):2748-2758. doi: 10.1044/2022_JSLHR-22-00101. Epub 2022 Jul 22.

DOI:10.1044/2022_JSLHR-22-00101

PMID:35867607

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9911094/

Abstract

PURPOSE

Vocal roughness is often present in many voice disorders but the assessment of roughness mainly depends on the subjective auditory-perceptual evaluation and lacks acoustic correlates. This study aimed to apply the concept of roughness in general sound quality perception to vocal roughness assessment and to characterize the relationship between vocal roughness and temporal envelop fluctuation measures obtained from an auditory model.

METHOD

Ten /ɑ/ recordings with a wide range of roughness were selected from an existing database. Ten listeners rated the roughness of the recordings in a single-variable matching task. Temporal envelope fluctuations of the recordings were analyzed with an auditory processing model of amplitude modulation that utilizes a modulation filterbank of different modulation frequencies. Pitch strength and the smoothed cepstral peak prominence were also obtained for comparison.

RESULTS

Individual simple regression models yielded envelope standard deviation from a modulation filter with a low center frequency (64.3 Hz) as a statistically significant predictor of vocal roughness with a strong coefficient of determination ( = .80). Pitch strength and CPPS were not significant predictors of roughness.

CONCLUSION

This result supports the possible utility of envelope fluctuation measures from an auditory model as objective correlates of vocal roughness.

摘要

目的

嗓音粗糙常常存在于许多嗓音障碍中，但粗糙的评估主要依赖于主观的听觉感知评估，缺乏声学相关性。本研究旨在将一般性音质感知中的粗糙概念应用于嗓音粗糙评估，并描述嗓音粗糙与从听觉模型获得的时域包络波动测量值之间的关系。

方法

从现有的数据库中选择了 10 个具有不同粗糙程度的/a/音记录。10 位听众在单变量匹配任务中对这些记录的粗糙程度进行了评分。利用调制滤波器组的不同调制频率的幅度调制听觉处理模型对记录的时域包络波动进行了分析。还获得了基音强度和频谱峰值凸显度用于比较。

结果

个体简单回归模型表明，来自调制滤波器的中心频率较低（64.3 Hz）的包络标准差是嗓音粗糙的一个统计学上显著的预测因子，具有很强的决定系数（r²=.80）。基音强度和 CPPS 不是粗糙的显著预测因子。

结论

该结果支持了听觉模型的包络波动测量值作为嗓音粗糙的客观相关物的可能用途。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

使用基于听觉时间包络处理的生物启发计算模型预测感知声音粗糙度。

Predicting Perceived Vocal Roughness Using a Bio-Inspired Computational Model of Auditory Temporal Envelope Processing.

机构信息

出版信息

PURPOSE

METHOD

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

使用基于听觉时间包络处理的生物启发计算模型预测感知声音粗糙度。

Predicting Perceived Vocal Roughness Using a Bio-Inspired Computational Model of Auditory Temporal Envelope Processing.

机构信息

出版信息

PURPOSE

METHOD

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献