Dafflon Jessica, Moraczewski Dustin, Earl Eric, Nielson Dylan M, Loewinger Gabriel, McClure Patrick, Thomas Adam G, Pereira Francisco
Data Science & Sharing Team, National Institute of Mental Health, Bethesda, MD, USA.
Machine Learning Team, National Institute of Mental Health, Bethesda, MD, USA.
ArXiv. 2024 May 1:arXiv:2405.00255v1.
One of the central objectives of contemporary neuroimaging research is to create predictive models that can disentangle the connection between patterns of functional connectivity across the entire brain and various behavioral traits. Previous studies have shown that models trained to predict behavioral features from the individual's functional connectivity have modest to poor performance. In this study, we trained models that predict observable individual traits (phenotypes) and their corresponding singular value decomposition (SVD) representations - herein referred to as from resting state functional connectivity. For this task, we predicted phenotypes in two large neuroimaging datasets: the Human Connectome Project (HCP) and the Philadelphia Neurodevelopmental Cohort (PNC). We illustrate the importance of regressing out confounds, which could significantly influence phenotype prediction. Our findings reveal that both phenotypes and their corresponding latent phenotypes yield similar predictive performance. Interestingly, only the first five latent phenotypes were reliably identified, and using just these reliable phenotypes for predicting phenotypes yielded a similar performance to using all latent phenotypes. This suggests that the predictable information is present in the first latent phenotypes, allowing the remainder to be filtered out without any harm in performance. This study sheds light on the intricate relationship between functional connectivity and the predictability and reliability of phenotypic information, with potential implications for enhancing predictive modeling in the realm of neuroimaging research.
当代神经影像学研究的核心目标之一是创建预测模型,该模型能够理清全脑功能连接模式与各种行为特征之间的联系。先前的研究表明,训练用于从个体功能连接预测行为特征的模型表现中等至较差。在本研究中,我们训练了能够从静息态功能连接预测可观察到的个体特征(表型)及其相应奇异值分解(SVD)表示形式(本文称为潜在表型)的模型。对于此任务,我们在两个大型神经影像数据集:人类连接组计划(HCP)和费城神经发育队列(PNC)中预测表型。我们阐述了消除可能显著影响表型预测的混杂因素的重要性。我们的研究结果表明,表型及其相应的潜在表型都产生了相似的预测性能。有趣的是,仅可靠地识别出了前五个潜在表型,并且仅使用这些可靠的潜在表型来预测表型,其性能与使用所有潜在表型相似。这表明可预测信息存在于前几个潜在表型中,从而可以滤除其余部分而不会对性能造成任何损害。本研究揭示了功能连接与表型信息的可预测性和可靠性之间的复杂关系,对增强神经影像学研究领域的预测建模具有潜在意义。