Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
Department of Neuroimaging, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
Hum Brain Mapp. 2021 Jun 1;42(8):2332-2346. doi: 10.1002/hbm.25368. Epub 2021 Mar 19.
Brain morphology varies across the ageing trajectory and the prediction of a person's age using brain features can aid the detection of abnormalities in the ageing process. Existing studies on such "brain age prediction" vary widely in terms of their methods and type of data, so at present the most accurate and generalisable methodological approach is unclear. Therefore, we used the UK Biobank data set (N = 10,824, age range 47-73) to compare the performance of the machine learning models support vector regression, relevance vector regression and Gaussian process regression on whole-brain region-based or voxel-based structural magnetic resonance imaging data with or without dimensionality reduction through principal component analysis. Performance was assessed in the validation set through cross-validation as well as an independent test set. The models achieved mean absolute errors between 3.7 and 4.7 years, with those trained on voxel-level data with principal component analysis performing best. Overall, we observed little difference in performance between models trained on the same data type, indicating that the type of input data had greater impact on performance than model choice. All code is provided online in the hope that this will aid future research.
大脑形态在衰老过程中存在差异,利用大脑特征预测一个人的年龄可以帮助检测衰老过程中的异常。现有的此类“大脑年龄预测”研究在方法和数据类型上差异很大,因此目前最准确和可推广的方法尚不清楚。因此,我们使用英国生物银行数据集(N=10824,年龄范围 47-73 岁)比较了支持向量回归、相关向量回归和高斯过程回归在基于全脑区域或体素的结构磁共振成像数据上的性能,以及是否通过主成分分析进行降维。通过交叉验证和独立测试集在验证集中评估了性能。这些模型在验证集上的平均绝对误差在 3.7 到 4.7 岁之间,经过主成分分析训练的体素级数据模型表现最好。总体而言,我们观察到在相同数据类型上训练的模型之间的性能差异很小,这表明输入数据的类型对性能的影响大于模型选择。所有代码都在线提供,希望这将有助于未来的研究。