Tran Kelvin, Xu Lingfeng, Stegmann Gabriela, Liss Julie, Berisha Visar, Utianski Rene L
Arizona State University, USA.
Aural Analytics, USA.
Interspeech. 2022 Sep;2022:2263-2267. doi: 10.21437/interspeech.2022-10817.
Acoustic analysis plays an important role in the assessment of dysarthria. Out of a public health necessity, telepractice has become increasingly adopted as the modality in which clinical care is given. While there are differences in software among telepractice platforms, they all use some form of speech compression to preserve bandwidth, with the most common algorithm being the Opus codec. Opus has been optimized for compression of speech from the general (mostly healthy) population. As a result, for speech-language pathologists, this begs the question: is the remotely transmitted speech signal a faithful representation of dysarthric speech? Existing high-fidelity audio recordings from 20 speakers of various dysarthria types were encoded at three different bit rates defined within Opus to simulate different internet bandwidth conditions. Acoustic measures of articulation, voice, and prosody were extracted, and mixed-effect models were used to evaluate the impact of bandwidth conditions on the measures. Significant differences in cepstral peak prominence, degree of voice breaks, jitter, vowel space area, pitch, and vowel space area were observed after Opus processing, providing insight into the types of acoustic measures that are susceptible to speech compression algorithms.
声学分析在构音障碍评估中起着重要作用。出于公共卫生需求,远程医疗已越来越多地被用作提供临床护理的方式。虽然远程医疗平台之间的软件存在差异,但它们都使用某种形式的语音压缩来节省带宽,最常用的算法是Opus编解码器。Opus已针对一般(大多为健康)人群的语音压缩进行了优化。因此,对于言语治疗师来说,这就引出了一个问题:远程传输的语音信号是否能如实反映构音障碍语音?从20名不同类型构音障碍患者那里获取的现有高保真音频记录,按照Opus中定义的三种不同比特率进行编码,以模拟不同的网络带宽条件。提取了清晰度、嗓音和韵律方面的声学指标,并使用混合效应模型来评估带宽条件对这些指标的影响。经过Opus处理后,观察到谐波峰值突出度、嗓音中断程度、抖动、元音空间面积、音高和谐波峰值突出度等方面存在显著差异,这为了解易受语音压缩算法影响的声学指标类型提供了依据。