Department of Otorhinolaryngology, Head and Neck Surgery, Faculty of Medicine, Academic Assembly, University of Toyama, 2630 Sugitani, Toyama City, Toyama Prefecture, 930-0194, Japan.
Sci Rep. 2022 Dec 2;12(1):20805. doi: 10.1038/s41598-022-24979-9.
Machine learning is considered a potential aid to support human decision making in disease prediction. In this study, we determined the utility of various machine learning algorithms in classifying peripheral vestibular (PV) and non-PV diseases based on the results of equilibrium function tests. A total of 1009 patients who had undergone our standardized neuro-otological examinations were recruited. We applied five supervised machine learning algorithms (random forest, adaboost, gradient boosting, support vector machine, and logistic regression). After preprocessing the data, optimizing the hyperparameters using GridSearchCV, and performing a final evaluation on the test set using scikit-learn, we evaluated the predictive capability using various performance metrics, namely, accuracy, F1-score, area under the receiver operating characteristic curve, precision, recall, and Matthews correlation coefficient (MCC). All five machine learning algorithms yielded satisfactory results; the accuracy of the algorithms ranged from 76 to 79%, with the support vector machine classifier having the highest accuracy. In cases where the predictions of the five models were consistent, the accuracy of the PV diagnostic results was improved to 83%, whereas it increased to 85% for the non-PV diagnostic results. Future research should increase the number of patients and optimize the classification methods to obtain the highest diagnostic accuracy.
机器学习被认为是一种支持人类在疾病预测中决策的潜在辅助手段。在这项研究中,我们根据平衡功能测试的结果,确定了各种机器学习算法在分类外周前庭(PV)和非-PV 疾病方面的效用。共有 1009 名接受过我们标准化神经耳科检查的患者被纳入研究。我们应用了五种监督机器学习算法(随机森林、adaboost、梯度提升、支持向量机和逻辑回归)。在对数据进行预处理、使用 GridSearchCV 优化超参数以及使用 scikit-learn 在测试集上进行最终评估之后,我们使用各种性能指标评估了预测能力,即准确性、F1 得分、接收者操作特征曲线下的面积、精度、召回率和马修斯相关系数(MCC)。所有五种机器学习算法都取得了令人满意的结果;算法的准确性在 76%到 79%之间,支持向量机分类器的准确性最高。在五个模型的预测一致的情况下,PV 诊断结果的准确性提高到了 83%,而非-PV 诊断结果的准确性提高到了 85%。未来的研究应该增加患者数量并优化分类方法,以获得最高的诊断准确性。