语音声学分析中的低通滤波效果。

Effects of low-pass filtering on acoustic analysis of voice.

机构信息

Department of Surgery, Division of Otolaryngology - Head and Neck Surgery, Medical Sciences Center, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin 53706-1532, USA.

出版信息

J Voice. 2011 Jan;25(1):15-20. doi: 10.1016/j.jvoice.2009.08.004. Epub 2010 Mar 25.

DOI:10.1016/j.jvoice.2009.08.004

PMID:20346621

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3018530/

Abstract

OBJECTIVE/HYPOTHESIS: Low-pass filtering is often applied to eliminate effects of environmental noise when preparing voice recordings for acoustic analysis. This study tested the effects of low-pass filter cutoff frequency on the results of acoustic voice analysis, with a particular interest in the effects of low cutoff frequencies on nonlinear dynamic parameters.

STUDY DESIGN

A crossover randomized controlled trial was performed using voice recordings of sustained vowel phonation obtained from the Disordered Voice Database.

METHODS

A second-order Butterworth filter was applied to the voices at cutoff frequencies ranging from 5000 to 40 Hz. Percent jitter, percent shimmer, fundamental frequency (F(0)), signal-to-noise ratio (SNR), correlation dimension (D(2)), and second-order entropy (K(2)) were calculated for each signal.

RESULTS

Traditional acoustic parameters were validly measured at cutoff frequencies as low as 300 Hz. The SNR and percent shimmer were improved by cutoff frequencies of 300 Hz or higher; F(0) and percent jitter were unaffected by filtering at these frequencies. D(2) and K(2) were measured stably for signals filtered at cutoff frequencies as low as 100 Hz.

CONCLUSION

To ensure accuracy in acoustic voice analysis, setting the cutoff frequency of a low-pass filter at least one octave above the F(0) (minimum of 300 Hz) is recommended. Nonlinear dynamic measures of D(2) and K(2) proved more robust and maintained accuracy at lower frequencies.

摘要

目的/假设：在为声学分析准备语音记录时，通常会应用低通滤波来消除环境噪声的影响。本研究测试了低通滤波器截止频率对声学语音分析结果的影响，特别关注低截止频率对非线性动态参数的影响。

研究设计

使用来自 Disorder Voice Database 的持续元音发声的语音记录，进行了一项交叉随机对照试验。

方法

使用二阶巴特沃斯滤波器对声音进行处理，截止频率范围从 5000 到 40 Hz。为每个信号计算抖动百分比、颤抖百分比、基频 (F(0))、信噪比 (SNR)、关联维数 (D(2)) 和二阶熵 (K(2))。

结果

在截止频率低至 300 Hz 的情况下，可以有效地测量传统声学参数。SNR 和颤抖百分比在 300 Hz 或更高的截止频率下得到改善；在这些频率下滤波对 F(0)和抖动百分比没有影响。在截止频率低至 100 Hz 的情况下，D(2)和 K(2)可以稳定地测量。

结论

为了确保声学语音分析的准确性，建议将低通滤波器的截止频率设置在 F(0)以上一个八度（最低 300 Hz）。非线性动态测量的 D(2)和 K(2)更具鲁棒性，并在较低频率下保持准确性。

相似文献

Effects of low-pass filtering on acoustic analysis of voice.

J Voice. 2011 Jan;25(1):15-20. doi: 10.1016/j.jvoice.2009.08.004. Epub 2010 Mar 25.

Do the Nonlinear Dynamic Acoustic Measurements, Nonlinear Energy Difference Ratio and Spectrum Convergence Ratio, Correlate with Perceptual Evaluation of Esophageal Voice Speakers?

J Voice. 2024 Nov;38(6):1278-1287. doi: 10.1016/j.jvoice.2022.06.004. Epub 2022 Jul 9.

Acoustic analysis of aperiodic voice: perturbation and nonlinear dynamic properties in esophageal phonation.

J Voice. 2009 May;23(3):283-90. doi: 10.1016/j.jvoice.2007.10.004. Epub 2008 Apr 14.

Quality of Voice in Patients With Partial Deafness Before and After Cochlear Implantation.

J Voice. 2024 Nov;38(6):1531.e5-1531.e11. doi: 10.1016/j.jvoice.2022.05.005. Epub 2022 Jun 3.

Vowel selection and its effects on perturbation and nonlinear dynamic measures.

Folia Phoniatr Logop. 2011;63(2):88-97. doi: 10.1159/000319786. Epub 2010 Oct 8.

Comparison of Acoustic Voice Features Derived From Mobile Devices and Studio Microphone Recordings.

J Voice. 2025 Mar;39(2):559.e1-559.e18. doi: 10.1016/j.jvoice.2022.10.006. Epub 2022 Nov 12.

Survey of Voice Acoustic Parameters in Iranian Female Teachers.

J Voice. 2016 Jul;30(4):507.e1-5. doi: 10.1016/j.jvoice.2015.05.020. Epub 2015 Aug 12.

Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task.

J Voice. 2011 Jan;25(1):44-53. doi: 10.1016/j.jvoice.2009.07.002. Epub 2010 Apr 8.

Materials of acoustic analysis: sustained vowel versus sentence.

J Voice. 2012 Sep;26(5):563-5. doi: 10.1016/j.jvoice.2011.09.007. Epub 2012 Apr 18.

A Comparison of Healthy and Disordered Voices Using Multi-Dimensional Voice Program, Praat, and TF32.

J Voice. 2024 Jul;38(4):963.e23-963.e38. doi: 10.1016/j.jvoice.2022.01.010. Epub 2022 Mar 1.

引用本文的文献

A novel speech analysis algorithm to detect cognitive impairment in a Spanish population.

Front Neurol. 2024 Apr 4;15:1342907. doi: 10.3389/fneur.2024.1342907. eCollection 2024.

[Current methods of acoustic analysis of voice: a review].

Lin Chuang Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. 2022 Dec;36(12):966-970;976. doi: 10.13201/j.issn.2096-7993.2022.12.016.

Probability-Based Best Sample Selection for Acoustic Analysis of Normal and Disordered Voices.

J Voice. 2022 Jan;36(1):21-26. doi: 10.1016/j.jvoice.2020.03.011. Epub 2020 May 29.

Audiovisual integration of emotional signals from others' social interactions.

Front Psychol. 2015 May 8;9:116. doi: 10.3389/fpsyg.2015.00611. eCollection 2015.

The effect of segment selection on acoustic analysis.

J Voice. 2012 Jan;26(1):1-7. doi: 10.1016/j.jvoice.2010.10.009. Epub 2011 Sep 1.

Objective methods of sample selection in acoustic analysis of voice.

Ann Otol Rhinol Laryngol. 2011 Mar;120(3):155-61. doi: 10.1177/000348941112000303.

本文引用的文献

Adaptive usage of the Butterworth digital filter.

J Biomech. 2007;40(13):2934-43. doi: 10.1016/j.jbiomech.2007.02.019. Epub 2007 Apr 17.

Vocal tremor and vibrato in the same person: acoustic and electromyographic differences.

J Voice. 2008 Sep;22(5):541-5. doi: 10.1016/j.jvoice.2006.12.001. Epub 2007 Feb 5.

Source-filter comparison of measurements of fundamental frequency perturbation and amplitude perturbation for synthesized voice signals.

J Voice. 2008 Mar;22(2):125-37. doi: 10.1016/j.jvoice.2006.09.007. Epub 2006 Dec 4.

Acoustic analysis of vowels following glossectomy.

Clin Linguist Phon. 2006 Apr-May;20(2-3):135-40. doi: 10.1080/02699200400026694.

Adverse effects of environmental noise on acoustic voice quality measurements.

J Voice. 2005 Mar;19(1):15-28. doi: 10.1016/j.jvoice.2004.07.003.

Nonlinear dynamics of phonations in excised larynx experiments.

J Acoust Soc Am. 2003 Oct;114(4 Pt 1):2198-205. doi: 10.1121/1.1610462.

The effect of noise on computer-aided measures of voice: a comparison of CSpeechSP and the Multi-Dimensional Voice Program software using the CSL 4300B Module and Multi-Speech for Windows.

J Voice. 2003 Mar;17(1):12-20. doi: 10.1016/s0892-1997(03)00031-6.

A fractal approach to normal and pathological voices.

Acta Otolaryngol. 2000 Mar;120(2):222-4. doi: 10.1080/000164800750000964.

A study of speech fractal dimensions.

Acta Otolaryngol. 1999 Mar;119(2):261-6. doi: 10.1080/00016489950181783.

Dimension increase in filtered chaotic signals.

Phys Rev Lett. 1988 Mar 14;60(11):979-982. doi: 10.1103/PhysRevLett.60.979.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

语音声学分析中的低通滤波效果。

Effects of low-pass filtering on acoustic analysis of voice.

机构信息

出版信息

STUDY DESIGN

METHODS

RESULTS

CONCLUSION

研究设计

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献