Zhang Tao, Jiang Peipei, Zhang Yajuan, Cao Yuyang
School of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P.R.China;Hebei Key Laboratory on Information Transmission and Signal Processing, Yanshan University, Qinhuangdao, Hebei 066004, P.R.China.
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2021 Feb 25;38(1):21-29. doi: 10.7507/1001-5515.202001024.
For speech detection in Parkinson's patients, we proposed a method based on time-frequency domain gradient statistics to analyze speech disorders of Parkinson's patients. In this method, speech signal was first converted to time-frequency domain (time-frequency representation). In the process, the speech signal was divided into frames. Through calculation, each frame was Fourier transformed to obtain the energy spectrum, which was mapped to the image space for visualization. Secondly, deviations values of each energy data on time axis and frequency axis was counted. According to deviations values, the gradient statistical features were used to show the abrupt changes of energy value in different time-domains and frequency-domains. Finally, KNN classifier was applied to classify the extracted gradient statistical features. In this paper, experiments on different speech datasets of Parkinson's patients showed that the gradient statistical features extracted in this paper had stronger clustering in classification. Compared with the classification results based on traditional features and deep learning features, the gradient statistical features extracted in this paper were better in classification accuracy, specificity and sensitivity. The experimental results show that the gradient statistical features proposed in this paper are feasible in speech classification diagnosis of Parkinson's patients.
针对帕金森病患者的语音检测,我们提出了一种基于时频域梯度统计的方法来分析帕金森病患者的语音障碍。在该方法中,语音信号首先被转换到时频域(时频表示)。在此过程中,语音信号被划分为帧。通过计算,对每一帧进行傅里叶变换以获得能谱,并将其映射到图像空间进行可视化。其次,统计各能量数据在时间轴和频率轴上的偏差值。根据偏差值,利用梯度统计特征来展示能量值在不同时域和频域中的突变情况。最后,应用KNN分类器对提取的梯度统计特征进行分类。本文在不同的帕金森病患者语音数据集上进行的实验表明,本文提取的梯度统计特征在分类中具有更强的聚类性。与基于传统特征和深度学习特征的分类结果相比,本文提取的梯度统计特征在分类准确率、特异性和敏感性方面表现更优。实验结果表明,本文提出的梯度统计特征在帕金森病患者的语音分类诊断中是可行的。