Sharma Garima, Prasad Deepak, Umapathy Karthikeyan, Krishnan Sridhar
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:964-967. doi: 10.1109/EMBC44109.2020.9176056.
A child having a delayed development in language skills without any reason is known to be suffering from specific language impairment (SLI). Unfortunately, almost 7% kindergarten children are reported with SLI in their childhood. The SLI could be treated if identified at an early stage, but diagnosing SLI at early stage is challenging. In this article, we propose a machine learning based system to screen the SLI speech by analyzing the texture of the speech utterances. The texture of speech signals is extracted from the popular time-frequency representation called spectrograms. These spectrogram acts like a texture image and the textural features to capture the change in audio quality such as Haralick's feature and local binary patterns (LBPs) are extracted from these textural images. The experiments are performed on 4214 utterances taken from 44 healthy and 54 SLI speakers. Experimental results with 10-fold cross validation, indicates that a very good accuracy up to 97.41% is obtained when only 14 dimensional Haralick's feature is used. The accuracy is slightly boosted up to 99% when the 59-dimensional LBPs are amalgamated with Haralick's features. The sensitivity and specificity of the whole system is up to 98.96% and 99.20% respectively. The proposed method is gender and speaker independent and invariant to examination conditions.
一个语言技能发展延迟且无任何原因的儿童被认为患有特定语言障碍(SLI)。不幸的是,据报道,近7%的幼儿园儿童在童年时期患有SLI。如果在早期阶段发现,SLI是可以治疗的,但在早期阶段诊断SLI具有挑战性。在本文中,我们提出了一种基于机器学习的系统,通过分析语音话语的纹理来筛查SLI语音。语音信号的纹理是从称为频谱图的流行时频表示中提取的。这些频谱图就像一个纹理图像,并且从这些纹理图像中提取用于捕捉音频质量变化的纹理特征,如哈拉里克特征和局部二值模式(LBP)。实验是对从44名健康人和54名SLI患者那里获取的4214个话语进行的。10折交叉验证的实验结果表明,仅使用14维哈拉里克特征时,可获得高达97.41%的非常好的准确率。当将59维LBP与哈拉里克特征合并时,准确率略有提高,达到99%。整个系统的灵敏度和特异性分别高达98.96%和99.20%。所提出的方法与性别和说话者无关,并且不受检查条件的影响。