Asgari Meysam, Bayestehtashk Alireza, Shafran Izhak
Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, USA.
Interspeech. 2013 Aug;2013:191-194.
In this paper, we report experiments on the Interspeech 2013 Autism Challenge, which comprises of two subtasks - detecting children with ASD and classifying them into four subtypes. We apply our recently developed algorithm to extract speech features that overcomes certain weaknesses of other currently available algorithms [1, 2]. From the input speech signal, we estimate the parameters of a harmonic model of the voiced speech for each frame including the fundamental frequency ( ). From the fundamental frequencies and the reconstructed noise-free signal, we compute other derived features such as Harmonic-to-Noise Ratio (HNR), shimmer, and jitter. In previous work, we found that these features detect voiced segments and speech more accurately than other algorithms and that they are useful in rating the severity of a subject's Parkinson's disease [3]. Here, we employ these features, along with standard features such as energy, cepstral, and spectral features. With these features, we detect ASD using a regression and identify the sub-type using a classifier. We find that our features improve the performance, measured in terms of unweighted average recall (UAR), of detecting autism spectrum disorder by 2.3% and classifying the disorder into four categories by 2.8% over the baseline results.
在本文中,我们报告了关于2013年国际语音会议自闭症挑战赛的实验,该挑战赛包括两个子任务——检测患有自闭症谱系障碍(ASD)的儿童并将他们分为四种亚型。我们应用我们最近开发的算法来提取语音特征,该算法克服了其他现有算法的某些弱点[1,2]。从输入的语音信号中,我们估计每一帧浊音语音的谐波模型参数,包括基频( )。从基频和重构的无噪声信号中,我们计算其他派生特征,如谐波噪声比(HNR)、微扰和抖动。在之前的工作中,我们发现这些特征比其他算法能更准确地检测浊音段和语音,并且它们在评估受试者帕金森病的严重程度方面很有用[3]。在这里,我们使用这些特征,以及诸如能量、倒谱和频谱特征等标准特征。利用这些特征,我们通过回归检测ASD,并使用分类器识别亚型。我们发现,以未加权平均召回率(UAR)衡量,我们的特征在检测自闭症谱系障碍方面比基线结果提高了2.3%,在将该障碍分为四类方面提高了2.8%。