Gaziantep Vocational School of Higher Education, University of Gaziantep, Gaziantep, Turkey.
J Med Syst. 2012 Aug;36(4):2141-7. doi: 10.1007/s10916-011-9678-1. Epub 2011 Mar 10.
Parkinson disease (PD) is an age-related deterioration of certain nerve systems, which affects movement, balance, and muscle control of clients. PD is one of the common diseases which affect 1% of people older than 60 years. A new classification scheme based on support vector machine (SVM) selected features to train rotation forest (RF) ensemble classifiers is presented for improving diagnosis of PD. The dataset contains records of voice measurements from 31 people, 23 with PD and each record in the dataset is defined with 22 features. The diagnosis model first makes use of a linear SVM to select ten most relevant features from 22. As a second step of the classification model, six different classifiers are trained with the subset of features. Subsequently, at the third step, the accuracies of classifiers are improved by the utilization of RF ensemble classification strategy. The results of the experiments are evaluated using three metrics; classification accuracy (ACC), Kappa Error (KE) and Area under the Receiver Operating Characteristic (ROC) Curve (AUC). Performance measures of two base classifiers, i.e. KStar and IBk, demonstrated an apparent increase in PD diagnosis accuracy compared to similar studies in literature. After all, application of RF ensemble classification scheme improved PD diagnosis in 5 of 6 classifiers significantly. We, numerically, obtained about 97% accuracy in RF ensemble of IBk (a K-Nearest Neighbor variant) algorithm, which is a quite high performance for Parkinson disease diagnosis.
帕金森病(PD)是一种与年龄相关的某些神经系统恶化,它会影响客户的运动、平衡和肌肉控制。PD 是影响 60 岁以上人群 1%的常见疾病之一。本文提出了一种基于支持向量机(SVM)选择特征来训练旋转森林(RF)集成分类器的新分类方案,以提高 PD 的诊断能力。该数据集包含 31 个人的语音测量记录,其中 23 人患有 PD,每个记录都定义了 22 个特征。诊断模型首先利用线性 SVM 从 22 个特征中选择十个最相关的特征。作为分类模型的第二步,使用特征子集训练六个不同的分类器。随后,在第三步中,利用 RF 集成分类策略来提高分类器的准确性。实验结果使用三个指标进行评估;分类准确率(ACC)、Kappa 错误(KE)和接收者操作特征曲线下的面积(ROC)(AUC)。与文献中的类似研究相比,两种基本分类器(即 KStar 和 IBk)的性能度量显示出 PD 诊断准确性的明显提高。最终,RF 集成分类方案在 6 个分类器中的 5 个中显著提高了 PD 诊断的准确性。我们使用 IBk(一种 K-最近邻变体)算法的 RF 集成算法得到了约 97%的准确率,这对于帕金森病诊断来说是一个相当高的性能。