嗓音粗糙度的声学评估。

Acoustic estimation of voice roughness.

作者信息

Anikin Andrey

机构信息

Division of Cognitive Science, Department of Philosophy, Lund University, Box 192, SE- 221 00, Lund, Sweden.

出版信息

Atten Percept Psychophys. 2025 Apr 28. doi: 10.3758/s13414-025-03060-3.

DOI:10.3758/s13414-025-03060-3

PMID:40295423

Abstract

Roughness is a perceptual characteristic of sound that was first applied to musical consonance and dissonance, but it is increasingly recognized as a central aspect of voice quality in human and animal communication. It may be particularly important for asserting social dominance or attracting attention in urgent signals such as screams. To ensure that the results of roughness research are valid and consistent across studies, we need standard methodology for measuring it. I review the literature on roughness estimation, from classic psychoacoustics to more recent approaches, and present two collections of 602 human vocal samples whose roughness was rated by 162 listeners in perceptual experiments. Two algorithms for estimating roughness acoustically from modulation spectra are then presented and optimized to match the human ratings. One uses a bank of gammatone or Butterworth filters to obtain an auditory spectrogram, and a faster algorithm begins with a conventional spectrogram obtained with Short-Time Fourier transform; both explain ~ 50% of variance in average human ratings per stimulus. The range of modulation frequencies most relevant to roughness perception is [50, 200] Hz; this range can be selected with simple cutoff points or with a lognormal weighting function. Modulation and roughness spectrograms are proposed as visual aids for studying the dynamics of roughness in longer recordings. The described algorithms are implemented in the function modulationSpectrum() from the open-source R library soundgen. The audio recordings and their ratings are freely available from https://osf.io/gvcpx/ and can be used for benchmarking other algorithms.

摘要

粗糙度是声音的一种感知特性，最初应用于音乐的协和与不协和，但它越来越被视为人类和动物交流中语音质量的核心方面。在诸如尖叫等紧急信号中，它对于确立社会主导地位或吸引注意力可能尤为重要。为确保粗糙度研究的结果在各项研究中有效且一致，我们需要测量它的标准方法。我回顾了从经典心理声学到最新方法的粗糙度估计文献，并展示了两组共602个人类语音样本，其粗糙度在感知实验中由162名听众进行了评级。然后介绍并优化了两种从调制谱声学估计粗糙度的算法，使其与人类评级相匹配。一种算法使用一组伽马通滤波器或巴特沃斯滤波器来获得听觉频谱图，另一种更快的算法从通过短时傅里叶变换获得的传统频谱图开始；两种算法都能解释每个刺激的平均人类评级中约50%的方差。与粗糙度感知最相关的调制频率范围是[50, 200]赫兹；这个范围可以通过简单的截止点或对数正态加权函数来选择。调制和粗糙度频谱图被提议作为研究较长录音中粗糙度动态的视觉辅助工具。所描述的算法在开源R库soundgen的函数modulationSpectrum()中实现。音频记录及其评级可从https://osf.io/gvcpx/免费获取，可用于对其他算法进行基准测试。

相似文献

Acoustic estimation of voice roughness.

Atten Percept Psychophys. 2025 Apr 28. doi: 10.3758/s13414-025-03060-3.

Home treatment for mental health problems: a systematic review.

Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

Effectiveness of voice rehabilitation on vocalisation in postlaryngectomy patients: a systematic review.

Int J Evid Based Healthc. 2010 Dec;8(4):256-8. doi: 10.1111/j.1744-1609.2010.00177.x.

Systemic treatments for metastatic cutaneous melanoma.

Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.

Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

Antidepressants for pain management in adults with chronic pain: a network meta-analysis.

Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.

Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.

Methylphenidate for children and adolescents with attention deficit hyperactivity disorder (ADHD).

Cochrane Database Syst Rev. 2023 Mar 27;3(3):CD009885. doi: 10.1002/14651858.CD009885.pub3.

本文引用的文献

Scream's roughness grants privileged access to the brain during sleep.

Sci Rep. 2025 May 14;15(1):16686. doi: 10.1038/s41598-025-01560-8.

Rough is salient: a conserved vocal niche to hijack the brain's salience system.

Philos Trans R Soc Lond B Biol Sci. 2025 Apr 3;380(1923):20240020. doi: 10.1098/rstb.2024.0020.

Nonlinear vocal phenomena and speech intelligibility.

Philos Trans R Soc Lond B Biol Sci. 2025 Apr 3;380(1923):20240254. doi: 10.1098/rstb.2024.0254.

Nonlinear vocal phenomena in African penguin begging calls: occurrence, significance and potential applications.

Philos Trans R Soc Lond B Biol Sci. 2025 Apr 3;380(1923):20240019. doi: 10.1098/rstb.2024.0019.

Information conveyed by voice qualitya).

J Acoust Soc Am. 2024 Feb 1;155(2):1264-1271. doi: 10.1121/10.0024609.

Beyond speech: Exploring diversity in the human voice.

iScience. 2023 Oct 14;26(11):108204. doi: 10.1016/j.isci.2023.108204. eCollection 2023 Nov 17.

Sensory translation between audition and vision.

Psychon Bull Rev. 2024 Apr;31(2):599-626. doi: 10.3758/s13423-023-02343-w. Epub 2023 Oct 6.

Evidence for a universal association of auditory roughness with musical stability.

PLoS One. 2023 Sep 20;18(9):e0291642. doi: 10.1371/journal.pone.0291642. eCollection 2023.

Perceptual and Computational Estimates of Vocal Breathiness and Roughness in Sustained Phonation and Connected Speech.

J Voice. 2025 Jul;39(4):1131.e31-1131.e43. doi: 10.1016/j.jvoice.2023.02.014. Epub 2023 Mar 16.

Consonance and dissonance perception. A critical review of the historical sources, multidisciplinary findings, and main hypotheses.

Phys Life Rev. 2022 Dec;43:273-304. doi: 10.1016/j.plrev.2022.10.004. Epub 2022 Oct 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

嗓音粗糙度的声学评估。

Acoustic estimation of voice roughness.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献