Clapham Renee P, van As-Brooks Corina J, van Son Rob J J H, Hilgers Frans J M, van den Brekel Michiel W M
Amsterdam Center for Language and Communication, University of Amsterdam (ACLC/UvA), Amsterdam, The Netherlands; Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute-Antoni van Leeuwenhoek, Amsterdam, The Netherlands.
Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute-Antoni van Leeuwenhoek, Amsterdam, The Netherlands; Department of Marketing and Clinical Affairs, Atos Medical, Hörby, Sweden.
J Voice. 2015 Jul;29(4):517.e23-9. doi: 10.1016/j.jvoice.2014.10.002. Epub 2015 Mar 17.
To investigate the relationship between acoustic signal typing and perceptual evaluation of sustained vowels produced by tracheoesophageal (TE) speakers and the use of signal typing in the clinical setting.
Two evaluators independently categorized 1.75-second segments of narrow-band spectrograms according to acoustic signal typing and independently evaluated the recording of the same segments on a visual analog scale according to overall perceptual acoustic voice quality. The relationship between acoustic signal typing and overall voice quality (as a continuous scale and as a four-point ordinal scale) was investigated and the proportion of inter-rater agreement as well as the reliability between the two measures is reported.
The agreement between signal type (I-IV) and ordinal voice quality (four-point scale) was low but significant, and there was a significant linear relationship between the variables. Signal type correctly predicted less than half of the voice quality data. There was a significant main effect of signal type on continuous voice quality scores with significant differences in median quality scores between signal types I-IV, I-III, and I-II.
Signal typing can be used as an adjunct to perceptual and acoustic evaluation of the same stimuli for TE speech as part of a multidimensional evaluation protocol. Signal typing in its current form provides limited predictive information on voice quality, and there is significant overlap between signal types II and III and perceptual categories. Future work should consider whether the current four signal types could be refined.
研究声信号分类与经气管食管造口(TE)发声者所发持续元音的感知评估之间的关系,以及声信号分类在临床环境中的应用。
两名评估者根据声信号分类对1.75秒的窄带频谱图片段进行独立分类,并根据整体感知声学语音质量在视觉模拟量表上对相同片段的录音进行独立评估。研究了声信号分类与整体语音质量(作为连续量表和四点顺序量表)之间的关系,并报告了评估者间的一致率以及两种测量方法之间的可靠性。
信号类型(I-IV)与顺序语音质量(四点量表)之间的一致性较低但具有显著性,且变量之间存在显著的线性关系。信号类型正确预测的语音质量数据不到一半。信号类型对连续语音质量得分有显著的主效应,信号类型I-IV、I-III和I-II之间的中位数质量得分存在显著差异。
作为多维评估方案的一部分,信号分类可作为对TE语音相同刺激进行感知和声学评估的辅助手段。当前形式的信号分类提供的关于语音质量的预测信息有限,且信号类型II和III与感知类别之间存在显著重叠。未来的工作应考虑是否可以对当前的四种信号类型进行细化。