Hoq Muntasir, Uddin Mohammed Nazim, Park Seung-Bo
Department of Computer Science and Engineering, East Delta University, Chattogram 4209, Bangladesh.
Department of Software Convergence Engineering, Inha University, Incheon 22201, Korea.
Diagnostics (Basel). 2021 Jun 11;11(6):1076. doi: 10.3390/diagnostics11061076.
As a neurodegenerative disorder, Parkinson's disease (PD) affects the nerve cells of the human brain. Early detection and treatment can help to relieve the symptoms of PD. Recent PD studies have extracted the features from vocal disorders as a harbinger for PD detection, as patients face vocal changes and impairments at the early stages of PD. In this study, two hybrid models based on a Support Vector Machine (SVM) integrating with a Principal Component Analysis (PCA) and a Sparse Autoencoder (SAE) are proposed to detect PD patients based on their vocal features. The first model extracted and reduced the principal components of vocal features based on the explained variance of each feature using PCA. For the first time, the second model used a novel Deep Neural Network (DNN) of an SAE, consisting of multiple hidden layers with L1 regularization to compress the vocal features into lower-dimensional latent space. In both models, reduced features were fed into the SVM as inputs, which performed classification by learning hyperplanes, along with projecting the data into a higher dimension. An F1-score, a Mathews Correlation Coefficient (MCC), and a Precision-Recall curve were used, along with accuracy to evaluate the proposed models due to highly imbalanced data. With its highest accuracy of 0.935, F1-score of 0.951, and MCC value of 0.788, the probing results show that the proposed model of the SAE-SVM surpassed not only the former model of the PCA-SVM and other standard models including Multilayer Perceptron (MLP), Extreme Gradient Boosting (XGBoost), K-Nearest Neighbor (KNN), and Random Forest (RF), but also surpassed two recent studies using the same dataset. Oversampling and balancing the dataset with SMOTE boosted the performance of the models.
作为一种神经退行性疾病,帕金森病(PD)会影响人类大脑的神经细胞。早期检测和治疗有助于缓解帕金森病的症状。最近的帕金森病研究已经从声音障碍中提取特征,作为帕金森病检测的先兆,因为患者在帕金森病早期会出现声音变化和损伤。在本研究中,提出了两种基于支持向量机(SVM)与主成分分析(PCA)和稀疏自编码器(SAE)相结合的混合模型,用于根据帕金森病患者的声音特征进行检测。第一个模型基于每个特征的解释方差,使用主成分分析提取并减少声音特征的主成分。第二个模型首次使用了一种新颖的稀疏自编码器深度神经网络(DNN),它由多个带有L1正则化的隐藏层组成,将声音特征压缩到低维潜在空间。在这两种模型中,减少后的特征作为输入被送入支持向量机,支持向量机通过学习超平面进行分类,并将数据投影到更高维度。由于数据高度不平衡,使用F1分数、马修斯相关系数(MCC)和精确召回曲线以及准确率来评估所提出的模型。探测结果显示,所提出的稀疏自编码器-支持向量机模型的最高准确率为0.935,F1分数为0.951,MCC值为0.788,不仅超过了主成分分析-支持向量机的前一个模型以及其他标准模型,包括多层感知器(MLP)、极端梯度提升(XGBoost)、K近邻(KNN)和随机森林(RF),还超过了最近两项使用相同数据集的研究。使用合成少数过采样技术(SMOTE)对数据集进行过采样和平衡提高了模型的性能。