Manfredi Claudia, Giordano Andrea, Schoentgen Jean, Fraj Samia, Bocchi Leonardo, Dejonckere Philippe
Università degli Studi di Firenze, Department of Electronics and Telecommunications, Via S. Marta 3, Firenze 50139, Italy. claudia.manfredi@unifi .it
Logoped Phoniatr Vocol. 2011 Jul;36(2):78-89. doi: 10.3109/14015439.2011.578077. Epub 2011 May 24.
In this paper the effect of noise on both perceptual and automatic evaluation of the glottal cycle length in irregular voice signals (sustained vowels) is studied. The reliability of four tools for voice analysis (MDVP, Praat, AMPEX, and BioVoice) is compared to visual inspection made by trained clinicians using two measures of voice signal irregularity: the jitter (J) and the coefficient of variation of the fundamental frequency (F0CV). The purpose is also to test to what extent of irregularity trained raters are capable of determining visually the glottal cycle length as compared to dedicated software tools. For a perfect control of the amount of jitter and noise put in, data consist of synthesized sustained vowels corrupted by increasing jitter and noise. Both jitter and noise can be varied to the desired extent according to built-in functions. All the tools give almost reliable measurements up to 15% of jitter, for low or moderate noise, while only few of them are reliable for higher jitter and noise levels and would thus be suited for perturbation measures in strongly irregular voice signals. As shown in Part I of this work, for low noise levels the results obtained by visual inspection from expert raters are comparable or better than those obtained with the tools presented here, at the expense of a larger amount of time devoted to searching visually for the glottal cycle lengths in the signal waveform. In this paper it is shown that results rapidly deteriorate with increasing noise. Hence, the use of a robust tool for voice analysis can give valid support to clinicians in term of reliability, reproducibility of results, and time-saving.
本文研究了噪声对不规则语音信号(持续元音)中声门周期长度的感知评估和自动评估的影响。将四种语音分析工具(MDVP、Praat、AMPEX和BioVoice)的可靠性与训练有素的临床医生通过两种语音信号不规则性测量方法(抖动(J)和基频变异系数(F0CV))进行的目视检查进行了比较。目的还在于测试与专用软件工具相比,训练有素的评估者在多大程度的不规则性下能够通过视觉确定声门周期长度。为了完美控制引入的抖动和噪声量,数据由因抖动和噪声增加而受损的合成持续元音组成。根据内置功能,抖动和噪声都可以变化到所需的程度。对于低噪声或中等噪声,所有工具在抖动达到15%时都能给出几乎可靠的测量结果,而对于更高的抖动和噪声水平,只有少数工具是可靠的,因此适用于强不规则语音信号中的扰动测量。如本文第一部分所示,对于低噪声水平,专家评估者通过目视检查获得的结果与使用此处介绍的工具获得的结果相当或更好,但代价是在信号波形中目视搜索声门周期长度需要花费更多时间。本文表明,随着噪声增加,结果会迅速恶化。因此,使用强大的语音分析工具可以在可靠性、结果可重复性和节省时间方面为临床医生提供有效的支持。