反复尝试：一种测试-再测试可靠性的层次建模方法。

Trial and error: A hierarchical modeling approach to test-retest reliability.

机构信息

Scientific and Statistical Computing Core, National Institute of Mental Health, USA.

Section on Development and Affective Neuroscience, National Institute of Mental Health, USA.

出版信息

Neuroimage. 2021 Dec 15;245:118647. doi: 10.1016/j.neuroimage.2021.118647. Epub 2021 Oct 22.

DOI:10.1016/j.neuroimage.2021.118647

PMID:34688897

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10241320/

Abstract

The concept of test-retest reliability indexes the consistency of a measurement across time. High reliability is critical for any scientific study, but specifically for the study of individual differences. Evidence of poor reliability of commonly used behavioral and functional neuroimaging tasks is mounting. Reports on low reliability of task-based fMRI have called into question the adequacy of using even the most common, well-characterized cognitive tasks with robust population-level effects, to measure individual differences. Here, we lay out a hierarchical framework that estimates reliability as a correlation divorced from trial-level variability, and show that reliability tends to be underestimated under the conventional intraclass correlation framework through summary statistics based on condition-level modeling. In addition, we examine how reliability estimation between the two statistical frameworks diverges and assess how different factors (e.g., trial and subject sample sizes, relative magnitude of cross-trial variability) impact reliability estimates. As empirical data indicate that cross-trial variability is large in most tasks, this work highlights that a large number of trials (e.g., greater than 100) may be required to achieve precise reliability estimates. We reference the tools TRR and 3dLMEr for the community to apply trial-level models to behavior and neuroimaging data and discuss how to make these new measurements most useful for future studies.

摘要

重测信度指数是指测量在时间上的一致性。高可靠性对于任何科学研究都是至关重要的，但对于个体差异的研究尤为重要。越来越多的证据表明，常用的行为和功能神经影像学任务的可靠性较差。关于任务态 fMRI 可靠性低的报告质疑了即使使用最常见、特征最明显且具有强大群体效应的认知任务来测量个体差异的充分性。在这里，我们提出了一个层次框架，该框架将可靠性估计为与试验水平变异性分离的相关性，并通过基于条件水平建模的汇总统计数据表明，在传统的组内相关框架下，可靠性往往被低估。此外，我们还研究了两种统计框架之间的可靠性估计如何存在差异，并评估了不同因素（例如，试验和被试样本量、跨试验变异性的相对大小）如何影响可靠性估计。由于经验数据表明，大多数任务中的跨试验变异性较大，因此这项工作强调需要大量试验（例如，大于 100 次）才能获得精确的可靠性估计。我们为社区提供了 TRR 和 3dLMEr 这两个工具，以便将试验水平的模型应用于行为和神经影像学数据，并讨论了如何使这些新测量对未来的研究最有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7020/10241320/95b757e35a07/nihms-1897383-f0001.jpg

相似文献

Trial and error: A hierarchical modeling approach to test-retest reliability.

Neuroimage. 2021 Dec 15;245:118647. doi: 10.1016/j.neuroimage.2021.118647. Epub 2021 Oct 22.

Test-retest reliability and sample size estimates after MRI scanner relocation.

Neuroimage. 2020 May 1;211:116608. doi: 10.1016/j.neuroimage.2020.116608. Epub 2020 Feb 4.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Hyperbolic trade-off: The importance of balancing trial and subject sample sizes in neuroimaging.

Neuroimage. 2022 Feb 15;247:118786. doi: 10.1016/j.neuroimage.2021.118786. Epub 2021 Dec 11.

Assessing reliability in neuroimaging research through intra-class effect decomposition (ICED).

Elife. 2018 Jul 2;7:e35718. doi: 10.7554/eLife.35718.

Test-retest reliability of FreeSurfer automated hippocampal subfield segmentation within and across scanners.

Neuroimage. 2020 Apr 15;210:116563. doi: 10.1016/j.neuroimage.2020.116563. Epub 2020 Jan 21.

Test-retest reliability of evoked BOLD signals from a cognitive-emotive fMRI test battery.

Neuroimage. 2012 Apr 15;60(3):1746-58. doi: 10.1016/j.neuroimage.2012.01.129. Epub 2012 Feb 8.

To pool or not to pool: Can we ignore cross-trial variability in FMRI?

Neuroimage. 2021 Jan 15;225:117496. doi: 10.1016/j.neuroimage.2020.117496. Epub 2020 Oct 24.

Test-retest reliability of fMRI verbal episodic memory paradigms in healthy older adults and in persons with mild cognitive impairment.

Hum Brain Mapp. 2009 Dec;30(12):4033-47. doi: 10.1002/hbm.20827.

Test-retest reliability of dynamic functional connectivity in naturalistic paradigm functional magnetic resonance imaging.

Hum Brain Mapp. 2022 Mar;43(4):1463-1476. doi: 10.1002/hbm.25736. Epub 2021 Dec 6.

引用本文的文献

Sources of information waste in neuroimaging: mishandling structures, thinking dichotomously, and over-reducing data.

Apert Neuro. 2022;2. doi: 10.52294/apertureneuro.2022.2.zrji8542.

Habenula alterations in resting state functional connectivity among autistic individuals.

bioRxiv. 2025 May 14:2025.05.14.653992. doi: 10.1101/2025.05.14.653992.

Go Figure: Transparency in neuroscience images preserves context and clarifies interpretation.

ArXiv. 2025 Apr 10:arXiv:2504.07824v1.

Improving accuracy and precision of heritability estimation in twin studies through hierarchical modeling: reassessing the measurement error assumption.

Front Genet. 2025 Apr 2;16:1522729. doi: 10.3389/fgene.2025.1522729. eCollection 2025.

Using Machine Learning to Determine a Functional Classifier of Retaliation and Its Association With Aggression.

JAACAP Open. 2024 Jun 8;3(1):137-146. doi: 10.1016/j.jaacop.2024.04.007. eCollection 2025 Mar.

Trait state occasion (TSO) modeling of event-related potentials (ERPs).

Biol Psychol. 2025 Mar;196:109000. doi: 10.1016/j.biopsycho.2025.109000. Epub 2025 Mar 8.

Complementary benefits of multivariate and hierarchical models for identifying individual differences in cognitive control.

Imaging Neurosci (Camb). 2025 Feb 10;3. doi: 10.1162/imag_a_00447. eCollection 2025 Feb 1.

Group-to-individual generalizability and individual-level inferences in cognitive neuroscience.

Neurosci Biobehav Rev. 2025 Feb;169:106024. doi: 10.1016/j.neubiorev.2025.106024. Epub 2025 Jan 30.

Using machine learning to determine a functional classifier of reward responsiveness and its association with adolescent psychiatric symptomatology.

Psychol Med. 2024 Nov 18;54(15):1-10. doi: 10.1017/S003329172400240X.

Childhood neglect is associated with alterations in neural prediction error signaling and the response to novelty.

Psychol Med. 2024 Oct 24;54(14):1-9. doi: 10.1017/S0033291724002411.

本文引用的文献

Why many studies of individual differences with inhibition tasks may not localize correlations.

Psychon Bull Rev. 2023 Dec;30(6):2049-2066. doi: 10.3758/s13423-023-02293-3. Epub 2023 Jul 5.

Stan: A Probabilistic Programming Language.

J Stat Softw. 2017;76. doi: 10.18637/jss.v076.i01. Epub 2017 Jan 11.

Effect sizes and test-retest reliability of the fMRI-based neurologic pain signature.

Neuroimage. 2022 Feb 15;247:118844. doi: 10.1016/j.neuroimage.2021.118844. Epub 2021 Dec 20.

Hyperbolic trade-off: The importance of balancing trial and subject sample sizes in neuroimaging.

Neuroimage. 2022 Feb 15;247:118786. doi: 10.1016/j.neuroimage.2021.118786. Epub 2021 Dec 11.

Prestimulus dynamics blend with the stimulus in neural variability quenching.

Neuroimage. 2021 Sep;238:118160. doi: 10.1016/j.neuroimage.2021.118160. Epub 2021 May 28.

A guide to the measurement and interpretation of fMRI test-retest reliability.

Curr Opin Behav Sci. 2021 Aug;40:27-32. doi: 10.1016/j.cobeha.2020.12.012. Epub 2021 Jan 20.

To pool or not to pool: Can we ignore cross-trial variability in FMRI?

Neuroimage. 2021 Jan 15;225:117496. doi: 10.1016/j.neuroimage.2020.117496. Epub 2020 Oct 24.

What Is the Test-Retest Reliability of Common Task-Functional MRI Measures? New Empirical Evidence and a Meta-Analysis.

Psychol Sci. 2020 Jul;31(7):792-806. doi: 10.1177/0956797620916786. Epub 2020 Jun 3.

The Heterogeneity of Anxious Phenotypes: Neural Responses to Errors in Treatment-Seeking Anxious and Behaviorally Inhibited Youths.

J Am Acad Child Adolesc Psychiatry. 2020 Jun;59(6):759-769. doi: 10.1016/j.jaac.2019.05.014. Epub 2019 May 23.

Addressing the reliability fallacy in fMRI: Similar group effects may arise from unreliable individual effects.

Neuroimage. 2019 Jul 15;195:174-189. doi: 10.1016/j.neuroimage.2019.03.053. Epub 2019 Mar 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

反复尝试：一种测试-再测试可靠性的层次建模方法。

Trial and error: A hierarchical modeling approach to test-retest reliability.

机构信息

Scientific and Statistical Computing Core, National Institute of Mental Health, USA.

Section on Development and Affective Neuroscience, National Institute of Mental Health, USA.

出版信息

Neuroimage. 2021 Dec 15;245:118647. doi: 10.1016/j.neuroimage.2021.118647. Epub 2021 Oct 22.

DOI:10.1016/j.neuroimage.2021.118647

PMID:34688897

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10241320/

Abstract

摘要

反复尝试：一种测试-再测试可靠性的层次建模方法。

Trial and error: A hierarchical modeling approach to test-retest reliability.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

反复尝试：一种测试-再测试可靠性的层次建模方法。

Trial and error: A hierarchical modeling approach to test-retest reliability.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献