Speech Science Laboratory, Division of Speech and Hearing Sciences, University of Hong Kong, Hong Kong, China.
J Voice. 2013 Jan;27(1):101-10. doi: 10.1016/j.jvoice.2012.06.009. Epub 2012 Oct 6.
Esophageal (SE) and tracheoesophageal (TE) speech are the most commonly used alaryngeal voicing types after total laryngectomy-a surgical procedure of removing a pathological larynx. Both SE and TE voices show more aperiodicity than normal laryngeal (NL) voices, and the vocal characteristics of alaryngeal voices are notoriously difficult to extract. The present study investigated the difference in vocal characteristics among NL, SE, and TE voices using perception measures and nonlinear dynamical analysis. Correlation dimension (D(2)) and sample entropy (SampEn) were obtained from 90 voice samples produced by 10 TE, 10 SE, and 10 NL male Cantonese speakers. Correlation between nonlinear dynamical parameters and perceptual ratings of different voices was also examined. Results show that both D(2) and SampEn values were significantly higher for TE and SE voices than NL voice. The overall perceptual judgment of SE and TE voice quality was negatively correlated with D(2) and SampEn. This finding supports the validity of using nonlinear dynamical parameters in assessing voice quality. Results of the present study also indicate that nonlinear dynamical analysis could be a supplemental tool to traditional acoustic analysis, especially for analyzing the voice quality of alaryngeal speech.
食管(SE)和气管食管(TE)语音是全喉切除术后最常用的两种人工发声方式,全喉切除是一种切除病理性喉部的手术。SE 和 TE 语音的非周期性都比正常喉部(NL)语音强,而人工发声的声音特征是出了名的难以提取。本研究使用感知测量和非线性动力学分析来研究 NL、SE 和 TE 语音之间的声音特征差异。从 10 名讲粤语的男性 NL、TE 和 SE 语音样本中获得了关联维数(D(2))和样本熵(SampEn)。还检查了非线性动力学参数与不同声音感知评分之间的相关性。结果表明,TE 和 SE 语音的 D(2)和 SampEn 值均明显高于 NL 语音。SE 和 TE 语音质量的整体感知判断与 D(2)和 SampEn 呈负相关。这一发现支持了使用非线性动力学参数评估语音质量的有效性。本研究的结果还表明,非线性动力学分析可能是传统声学分析的补充工具,特别是用于分析人工发声的语音质量。