Maryn Youri, Dwenger Kaitlyn, Kaufmann Sidney, Barkmeier-Kraemer Julie
European Institute for Otorhinolaryngology - Head & Neck Surgery (ORL-HNS), GZA Sint-Augustinus, Antwerp, Belgium.
Department of Rehabilitation Sciences, Faculty of Medicine and Health Sciences, Ghent University, Belgium.
J Speech Lang Hear Res. 2025 Jun 5;68(6):2721-2740. doi: 10.1044/2025_JSLHR-24-00467. Epub 2025 May 5.
This study compared three methods of acoustic algorithm-supported extraction and analysis of vocal tremor properties (i.e., rate, extent, and regularity of intensity level and fundamental frequency modulation): (a) visual perception and manual data extraction, (b) semi-automated data extraction, and (c) fully automated data extraction.
Forty-five midvowel sustained [a:] and [i:] audio recordings were collected as part of a scientific project to learn about the physiologic substrates of vocal tremor. This convenience data set contained vowels with a representative variety in vocal tremor severity. First, the vocal tremor properties in intensity level and fundamental frequency tracks were visually inspected and manually measured using Praat software. Second, the vocal tremor properties were determined using two Praat scripts: automated with the script of Maryn et al. (2019) and semi-automated with an adjusted version of this script to enable the user to intervene with the signal processing. The reliability of manual vocal tremor property measurement was assessed using the intraclass correlation coefficient. The properties as measured with the two scripts (automated vs. semi-automated) were compared with the manually determined properties using correlation and diagnostic accuracy statistical methods.
With intraclass correlation coefficients between .770 and .914, the reliability of the manual method was acceptable. The semi-automated method correlated with manual property measures better and was more accurate in diagnosing vocal tremor than the automated method.
Manual acoustic measurement of vocal tremor properties can be laborious and time-consuming. Automated or semi-automated acoustic methods may improve efficiency in vocal tremor property measurement in clinical as well as research settings. Although both Praat script-supported methods in this study yielded acceptable validity with the manual data measurements as a referent, the semi-automated method showed the best outcomes.
本研究比较了三种声学算法支持的声带震颤特性(即强度水平和基频调制的速率、程度和规律性)提取与分析方法:(a)视觉感知和手动数据提取,(b)半自动数据提取,以及(c)全自动数据提取。
作为一项了解声带震颤生理基础的科研项目的一部分,收集了45个中元音持续发声[a:]和[i:]的音频记录。这个便利数据集包含了声带震颤严重程度具有代表性差异的元音。首先,使用Praat软件对强度水平和基频轨迹中的声带震颤特性进行视觉检查和手动测量。其次,使用两个Praat脚本确定声带震颤特性:一个是采用Maryn等人(2019年)的脚本进行自动分析,另一个是采用该脚本的调整版本进行半自动分析,以便用户能够干预信号处理。使用组内相关系数评估手动测量声带震颤特性的可靠性。使用相关性和诊断准确性统计方法,将两种脚本(自动与半自动)测量的特性与手动确定的特性进行比较。
组内相关系数在0.770至0.914之间,手动方法的可靠性可以接受。半自动方法与手动特性测量的相关性更好,并且在诊断声带震颤方面比自动方法更准确。
手动声学测量声带震颤特性可能既费力又耗时。自动或半自动声学方法可能会提高临床和研究环境中声带震颤特性测量的效率。尽管本研究中两种Praat脚本支持的方法以手动数据测量为参照都产生了可接受的有效性,但半自动方法显示出最佳结果。