College of Computer Science and Information Technology, University of Al-Qadisiyah, Dewaniyah, Iraq.
School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing and Health Sciences, Monash University, Victoria, Australia.
Int J Med Inform. 2024 Nov;191:105583. doi: 10.1016/j.ijmedinf.2024.105583. Epub 2024 Aug 2.
Traditional classifier for the classification of diseases, such as K-Nearest Neighbors (KNN), Linear Discriminant Analysis (LDA), Random Forest (RF), Logistic Regression (LR), and Support Vector Machine (SVM), often struggle with high-dimensional medical datasets.
This study presents a novel classifier to overcome the limitations of traditional classifiers in Parkinson's disease (PD) detection based on Gower distance.
We present the Gower distance metric to handle diverse feature sets in voice recordings, which acts as a dissimilarity measure for all feature types, making the model adept at identifying subtle patterns indicative of PD. Additionally, the Cuckoo Search algorithm is employed for feature selection, reducing dimensionality by focusing on key features, thereby lessening the computational load associated with high-dimensional datasets.
The proposed classifier based on Gower distance resulted in an accuracy rate of 98.3% with feature selection and achieved an accuracy of 94.92% without the feature selection method. It outperforms traditional classifiers and recent studies in PD detection from voice recordings.
This accuracy shows the capability of the approach in the correct classification of instances and points out the potential of the approach as a reliable diagnostic tool for the medical practitioner. The findings state that the proposed approach holds promise for improving the diagnosis and monitoring of PD, both within medical institutions and at homes for the elderly.
传统的疾病分类器,如 K-最近邻(KNN)、线性判别分析(LDA)、随机森林(RF)、逻辑回归(LR)和支持向量机(SVM),在处理高维医学数据集时常常遇到困难。
本研究提出了一种新的分类器,以克服基于 Gower 距离的帕金森病(PD)检测中传统分类器的局限性。
我们提出了 Gower 距离度量来处理语音记录中的不同特征集,它作为所有特征类型的不相似性度量,使模型能够识别出提示 PD 的细微模式。此外,采用布谷鸟搜索算法进行特征选择,通过关注关键特征来降低维度,从而减轻高维数据集的计算负担。
基于 Gower 距离的分类器在进行特征选择时的准确率为 98.3%,而不使用特征选择方法时的准确率为 94.92%。它优于传统分类器和最近在语音记录中进行 PD 检测的研究。
该准确率表明了该方法在正确分类实例方面的能力,并指出了该方法作为医疗从业者可靠诊断工具的潜力。研究结果表明,该方法有望改善 PD 的诊断和监测,无论是在医疗机构还是在老年人家庭中。