Department of Computing & Technology, Anglia Ruskin University, Cambridge, United Kingdom.
Vision & Eye Research Unit (VERU), School of Medicine, Anglia Ruskin University, Cambridge, United Kingdom.
PLoS One. 2018 Oct 18;13(10):e0205754. doi: 10.1371/journal.pone.0205754. eCollection 2018.
In clinical neuropsychology the cognitive abilities of neurological patients are commonly estimated using well-established paper-based tests. Typically, scores on some tests remain relatively well preserved, whilst others exhibit a significant and disproportionate decline. Scores on those tests that measure preserved cognitive functions (so-called 'hold' tests) may be used to estimate premorbid abilities, including scores in non-hold tests that would have been expected prior to the onset of cognitive impairment. Many hold tests entail word reading, with each word being graded as correctly or incorrectly pronounced. Inevitably, such tests are likely to contain words that provide little or no diagnostic power (i.e., can be eliminated without negatively affecting prediction accuracy). In this paper, a genetic algorithm is developed and demonstrated, using n = 92 neurologically healthy participants, to identify optimal word subsets from the National Adult Reading Test that minimize the mean error in predicting the most widely used clinical measure of IQ and cognitive ability, the Wechsler Adult Intelligence Scale Fourth Edition IQ. In addition to requiring only 17-20 of the original 50 words (suggesting that this test could be revised to be up to 66% shorter) and minimizing mean prediction error, the algorithm increases the proportion of the variance in the predicted variable explained in comparison to using all words (from r2 = 0.46 to r2 = 0.61). In a clinical setting this would improve estimates of premorbid cognitive function and, if an abbreviated revision to this test were to be adopted, reduce the arduousness of the test for patients. The proposed method is evaluated with jackknifing and leave one out cross validation. The general approach may be used to optimize the relationship between any two psychological tests by finding the question subset in one test that minimizes the prediction error in a second test by training the genetic algorithm using data collected from participants upon whom both tests have been administered. This approach may also be used to develop new predictive tests, since it provides a method to identify an optimal subset of a set of candidate questions (for which empirical data have been collected) that maximizes prediction accuracy and the proportion of variance in the predicted variable that can be explained.
在临床神经心理学中,通常使用成熟的纸质测试来评估神经患者的认知能力。通常,某些测试的分数保持相对较好,而其他测试则表现出明显且不成比例的下降。那些衡量保留认知功能的测试(所谓的“保留”测试)的分数可用于估计发病前的能力,包括在认知障碍发作之前预期的非保留测试的分数。许多保留测试都需要进行单词阅读,每个单词都被评为正确或不正确发音。不可避免的是,此类测试可能包含提供很少或没有诊断能力的单词(即,如果不影响预测准确性,则可以将其删除)。在本文中,使用 n = 92 名神经健康参与者开发并演示了遗传算法,以从全国成人阅读测试中确定最佳单词子集,从而最大程度地减少预测最广泛使用的临床智商和认知能力测试(即韦氏成人智力测验第四版智商)的平均误差。该算法不仅仅需要原始 50 个单词中的 17-20 个(表明该测试可以修订为缩短至 66%),并且可以最小化平均预测误差,而且还可以增加预测变量的方差比例比使用所有单词(从 r2 = 0.46 增加到 r2 = 0.61)。在临床环境中,这将改善发病前认知功能的估计值,如果采用该测试的简短修订版,还将减少患者的测试难度。使用jackknifing 和 leave one out 交叉验证评估了所提出的方法。通过使用从接受过两种测试的参与者那里收集的数据来训练遗传算法,可以将该通用方法用于通过找到一个测试中的问题子集来优化任何两个心理测试之间的关系,该子集可以最小化第二个测试中的预测误差。该方法还可用于开发新的预测测试,因为它提供了一种方法来识别一组候选问题(已收集了经验数据)的最佳子集,该子集可最大程度地提高预测准确性和预测变量的方差比例。