Innes Carrie R H, Lee Dominic, Chen Chen, Ponder-Sutton Agate M, Melzer Tracy R, Jones Richard D
Van der Veer Institute for Parkinson's and Brain Research , Christchurch , New Zealand.
Q J Exp Psychol (Hove). 2011 Sep;64(9):1714-25. doi: 10.1080/17470218.2011.555821. Epub 2011 Jun 12.
Prediction of complex behavioural tasks via relatively simple modelling techniques, such as logistic regression and discriminant analysis, often has limited success. We hypothesized that to more accurately model complex behaviour, more complex models, such as kernel-based methods, would be needed. To test this hypothesis, we assessed the value of six modelling approaches for predicting driving ability based on performance on computerized sensory-motor and cognitive tests (SMCTests™) in 501 people with brain disorders. The models included three models previously used to predict driving ability (discriminant analysis, DA; binary logistic regression, BLR; and nonlinear causal resource analysis, NCRA) and three kernel methods (support vector machine, SVM; product kernel density, PK; and kernel product density, KP). At the classification level, two kernel methods were substantially more accurate at classifying on-road pass or fail (SVM 99.6%, PK 99.8%) than the other models (DA 76%, BLR 78%, NCRA 74%, KP 81%). However, accuracy decreased substantially for all of the kernel models when cross-validation techniques were used to estimate prediction of on-road pass or fail in an independent referral group (SVM 73-76%, PK 72-73%, KP 71-72%) but decreased only slightly for DA (74-75%) and BLR (75-76%). Cross-validation of NCRA was not possible. In conclusion, while kernel-based models are successful at modelling complex data at a classification level, this is likely to be due to overfitting of the data, which does not lead to an improvement in accuracy in independent data over and above the accuracy of other less complex modelling techniques.
通过相对简单的建模技术(如逻辑回归和判别分析)来预测复杂行为任务,往往成效有限。我们假设,要更准确地对复杂行为进行建模,需要更复杂的模型,如基于核的方法。为了验证这一假设,我们评估了六种建模方法在预测驾驶能力方面的价值,这些方法基于501名脑部疾病患者在计算机化感觉运动和认知测试(SMCTests™)中的表现。这些模型包括之前用于预测驾驶能力的三种模型(判别分析,DA;二元逻辑回归,BLR;以及非线性因果资源分析,NCRA)和三种核方法(支持向量机,SVM;乘积核密度,PK;以及核乘积密度,KP)。在分类层面,两种核方法在对道路测试通过或不通过进行分类时(SVM为99.6%,PK为99.8%)比其他模型(DA为76%,BLR为78%,NCRA为74%,KP为81%)准确得多。然而,当使用交叉验证技术来估计独立转诊组中道路测试通过或不通过的预测时,所有核模型的准确率都大幅下降(SVM为73 - 76%,PK为72 - 73%,KP为71 - 72%),但DA(74 - 75%)和BLR(75 - 76%)仅略有下降。无法对NCRA进行交叉验证。总之,虽然基于核的模型在分类层面能够成功地对复杂数据进行建模,但这可能是由于数据过度拟合所致,在独立数据中,其准确率并未超过其他较简单建模技术,无法实现进一步提升。