Melbourne Neuropsychiatry Centre, Department of Psychiatry, The University of Melbourne, Australia.
Melbourne Neuropsychiatry Centre, Department of Psychiatry, The University of Melbourne, Australia; Department of Biomedical Engineering, Faculty of Engineering and Information Technology, The University of Melbourne, Australia.
Neuroimage. 2021 Dec 15;245:118648. doi: 10.1016/j.neuroimage.2021.118648. Epub 2021 Oct 19.
Cognitive performance can be predicted from an individual's functional brain connectivity with modest accuracy using machine learning approaches. As yet, however, predictive models have arguably yielded limited insight into the neurobiological processes supporting cognition. To do so, feature selection and feature weight estimation need to be reliable to ensure that important connections and circuits with high predictive utility can be reliably identified. We comprehensively investigate feature weight test-retest reliability for various predictive models of cognitive performance built from resting-state functional connectivity networks in healthy young adults (n=400). Despite achieving modest prediction accuracies (r=0.2-0.4), we find that feature weight reliability is generally poor for all predictive models (ICC< 0.3), and significantly poorer than predictive models for overt biological attributes such as sex (ICC≈0.5). Larger sample sizes (n=800), the Haufe transformation, non-sparse feature selection/regularization and smaller feature spaces marginally improve reliability (ICC< 0.4). We elucidate a tradeoff between feature weight reliability and prediction accuracy and find that univariate statistics are marginally more reliable than feature weights from predictive models. Finally, we show that measuring agreement in feature weights between cross-validation folds provides inflated estimates of feature weight reliability. We thus recommend for reliability to be estimated out-of-sample, if possible. We argue that rebalancing focus from prediction accuracy to model reliability may facilitate mechanistic understanding of cognition with machine learning approaches.
使用机器学习方法,可以从个体的功能大脑连接以中等精度预测认知表现。然而,到目前为止,预测模型在支持认知的神经生物学过程方面的洞察力有限。为了实现这一点,需要可靠的特征选择和特征权重估计,以确保可以可靠地识别具有高预测效用的重要连接和电路。我们全面研究了来自健康年轻成年人(n=400)静息状态功能连接网络构建的认知表现各种预测模型的特征权重测试-重测可靠性。尽管达到了中等的预测精度(r=0.2-0.4),但我们发现所有预测模型的特征权重可靠性普遍较差(ICC<0.3),明显低于用于显性生物学属性(如性别)的预测模型(ICC≈0.5)。更大的样本量(n=800)、Haufe 变换、非稀疏特征选择/正则化和较小的特征空间略微提高了可靠性(ICC<0.4)。我们阐明了特征权重可靠性和预测准确性之间的权衡,并发现单变量统计数据比预测模型的特征权重稍微可靠。最后,我们表明,在交叉验证折叠之间测量特征权重的一致性会高估特征权重可靠性的估计值。因此,如果可能的话,我们建议在样本外估计可靠性。我们认为,重新平衡从预测准确性到模型可靠性的焦点可能有助于通过机器学习方法对认知进行机制理解。