Interdepartmental Neuroscience Program, Yale University, New Haven, CT 06520, USA.
Department of Psychiatry, College of Physicians and Surgeons, Columbia University, New York, NY 10032, USA.
Cereb Cortex. 2017 Nov 1;27(11):5415-5429. doi: 10.1093/cercor/bhx230.
Best practices are currently being developed for the acquisition and processing of resting-state magnetic resonance imaging data used to estimate brain functional organization-or "functional connectivity." Standards have been proposed based on test-retest reliability, but open questions remain. These include how amount of data per subject influences whole-brain reliability, the influence of increasing runs versus sessions, the spatial distribution of reliability, the reliability of multivariate methods, and, crucially, how reliability maps onto prediction of behavior. We collected a dataset of 12 extensively sampled individuals (144 min data each across 2 identically configured scanners) to assess test-retest reliability of whole-brain connectivity within the generalizability theory framework. We used Human Connectome Project data to replicate these analyses and relate reliability to behavioral prediction. Overall, the historical 5-min scan produced poor reliability averaged across connections. Increasing the number of sessions was more beneficial than increasing runs. Reliability was lowest for subcortical connections and highest for within-network cortical connections. Multivariate reliability was greater than univariate. Finally, reliability could not be used to improve prediction; these findings are among the first to underscore this distinction for functional connectivity. A comprehensive understanding of test-retest reliability, including its limitations, supports the development of best practices in the field.
目前正在为获取和处理用于估计大脑功能组织的静息态磁共振成像数据制定最佳实践,即“功能连接”。已经基于测试-重测可靠性提出了标准,但仍存在一些悬而未决的问题。这些问题包括:每个受试者的数据量如何影响全脑可靠性,增加运行次数与增加扫描次数的影响,可靠性的空间分布,多变量方法的可靠性,以及至关重要的是,可靠性如何映射到行为预测。我们收集了一个由 12 个经过充分采样的个体组成的数据集(每个个体在 2 个相同配置的扫描仪上采集 144 分钟的数据),以在广义理论框架内评估全脑连接的测试-重测可靠性。我们使用人类连接组计划数据复制了这些分析,并将可靠性与行为预测联系起来。总的来说,在连接体中,平均而言,历史上的 5 分钟扫描产生的可靠性较差。增加扫描次数比增加运行次数更有益。皮质下连接的可靠性最低,网络内皮质连接的可靠性最高。多变量可靠性大于单变量可靠性。最后,可靠性不能用于提高预测,这些发现首次强调了功能连接在这方面的区别。对测试-重测可靠性的全面理解,包括其局限性,支持该领域最佳实践的发展。