学习任务中可靠性测量的复杂性：以交替序列反应时任务为例的说明。

The complexity of measuring reliability in learning tasks: An illustration using the Alternating Serial Reaction Time Task.

机构信息

Université Paris-Saclay, UVSQ, Inserm, CESP, 94807, Villejuif, France.

Institut du Psychotraumatisme de l'Enfant et de l'Adolescent, Conseil Départemental Yvelines et Hauts-de-Seine, CH Versailles, 78000, Versailles, France.

出版信息

Behav Res Methods. 2024 Jan;56(1):301-317. doi: 10.3758/s13428-022-02038-5. Epub 2023 Jan 5.

DOI:10.3758/s13428-022-02038-5

PMID:36604378

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10794483/

Abstract

Despite the fact that reliability estimation is crucial for robust inference, it is underutilized in neuroscience and cognitive psychology. Appreciating reliability can help researchers increase statistical power, effect sizes, and reproducibility, decrease the impact of measurement error, and inform methodological choices. However, accurately calculating reliability for many experimental learning tasks is challenging. In this study, we highlight a number of these issues, and estimate multiple metrics of internal consistency and split-half reliability of a widely used learning task on a large sample of 180 subjects. We show how pre-processing choices, task length, and sample size can affect reliability and its estimation. Our results show that the Alternating Serial Reaction Time Task has respectable reliability, especially when learning scores are calculated based on reaction times and two-stage averaging. We also show that a task length of 25 blocks can be sufficient to meet the usual thresholds for minimally acceptable reliability. We further illustrate how relying on a single point estimate of reliability can be misleading, and the calculation of multiple metrics, along with their uncertainties, can lead to a more complete characterization of the psychometric properties of tasks.

摘要

尽管可靠性估计对于稳健推断至关重要，但它在神经科学和认知心理学中的应用还不够充分。了解可靠性可以帮助研究人员提高统计功效、效应大小和可重复性，降低测量误差的影响，并为方法选择提供信息。然而，对于许多实验学习任务来说，准确计算可靠性是具有挑战性的。在这项研究中，我们强调了其中的一些问题，并在 180 名受试者的大样本中，对广泛使用的学习任务的多种内部一致性和半分可靠性度量进行了估计。我们展示了预处理选择、任务长度和样本量如何影响可靠性及其估计。我们的结果表明，交替序列反应时间任务具有良好的可靠性，尤其是当学习得分基于反应时间和两阶段平均计算时。我们还表明，25 个块的任务长度足以满足最小可接受可靠性的通常阈值。我们进一步说明了为什么依赖可靠性的单个点估计可能会产生误导，以及计算多个度量及其不确定性，可以更全面地描述任务的心理计量特性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/740f/10794483/f984ceafff08/13428_2022_2038_Fig1_HTML.jpg

相似文献

The complexity of measuring reliability in learning tasks: An illustration using the Alternating Serial Reaction Time Task.学习任务中可靠性测量的复杂性：以交替序列反应时任务为例的说明。

Behav Res Methods. 2024 Jan;56(1):301-317. doi: 10.3758/s13428-022-02038-5. Epub 2023 Jan 5.

Reaction-time task reliability is more accurately computed with permutation-based split-half correlations than with Cronbach's alpha.与使用克朗巴哈系数相比，反应时间任务的信度通过基于排列的折半相关计算更为准确。

Psychon Bull Rev. 2025 Apr;32(2):652-673. doi: 10.3758/s13423-024-02597-y. Epub 2024 Oct 23.

The reliability of the serial reaction time task: meta-analysis of test-retest correlations.序列反应时任务的可靠性：重测信度的元分析

R Soc Open Sci. 2023 Jul 19;10(7):221542. doi: 10.1098/rsos.221542. eCollection 2023 Jul.

Individual differences in implicit motor learning: task specificity in sensorimotor adaptation and sequence learning.内隐运动学习中的个体差异：感觉运动适应和序列学习中的任务特异性。

J Neurophysiol. 2017 Jan 1;117(1):412-428. doi: 10.1152/jn.01141.2015. Epub 2016 Nov 2.

Psychometric Reliability of ERN and Pe Across Flanker, Stroop, and Go/No-Go Tasks: A Direct and Conceptual Replication.ERN和Pe在侧翼任务、Stroop任务和Go/No-Go任务中的心理测量信度：直接和概念性重复研究

Psychophysiology. 2025 Apr;62(4):e70042. doi: 10.1111/psyp.70042.

Use of internal consistency coefficients for estimating reliability of experimental task scores.使用内部一致性系数来估计实验任务分数的可靠性。

Psychon Bull Rev. 2016 Jun;23(3):750-63. doi: 10.3758/s13423-015-0968-3.

Correction: The complexity of measuring reliability in learning tasks: An illustration using the Alternating Serial Reaction Time Task.更正：学习任务中测量可靠性的复杂性：以交替序列反应时任务为例进行说明。

Behav Res Methods. 2025 Jun 3;57(7):185. doi: 10.3758/s13428-025-02670-x.

Psychometric properties of reaction time based experimental paradigms measuring anxiety-related information-processing biases in children.基于反应时间的实验范式测量儿童焦虑相关信息处理偏差的心理测量学特性。

J Anxiety Disord. 2014 Jan;28(1):97-107. doi: 10.1016/j.janxdis.2013.11.004. Epub 2013 Nov 25.

An Internal Consistency Reliability Study of the Catalyst Datafinch Applied Behavior Analysis Data Collection Application With Autistic Individuals.针对自闭症个体的Catalyst Datafinch应用行为分析数据收集应用程序的内部一致性信度研究。

Cureus. 2024 Apr 16;16(4):e58379. doi: 10.7759/cureus.58379. eCollection 2024 Apr.

Psychometric properties of the Chinese version of scales of knowledge, attitude, and practice of self-care for patients with arteriovenous fistula: a translation and verification study.动静脉内瘘患者自我护理知识、态度与行为量表中文版的心理测量学特性：一项翻译与验证研究

Front Public Health. 2025 Apr 28;13:1588271. doi: 10.3389/fpubh.2025.1588271. eCollection 2025.

引用本文的文献

The interplay between executive functions and updating predictive representations.执行功能与更新预测表征之间的相互作用。

Sci Rep. 2025 Aug 20;15(1):30555. doi: 10.1038/s41598-025-14876-2.

Individual differences in probabilistic learning and updating predictive representations in individuals with obsessive-compulsive tendencies.有强迫倾向个体在概率学习和更新预测表征方面的个体差异。

BMC Psychiatry. 2025 Apr 11;25(1):368. doi: 10.1186/s12888-025-06786-4.

Enhancing retrieval capacity of the predictive brain through dorsolateral prefrontal cortex intervention.通过背外侧前额叶皮质干预增强预测性大脑的检索能力。

Cereb Cortex. 2025 Feb 5;35(2). doi: 10.1093/cercor/bhaf005.

Identifying Transfer Learning in the Reshaping of Inductive Biases.在归纳偏差重塑中识别迁移学习。

Open Mind (Camb). 2024 Sep 15;8:1107-1128. doi: 10.1162/opmi_a_00158. eCollection 2024.

Top-down and bottom-up oscillatory dynamics regulate implicit visuomotor sequence learning.自上而下和自下而上的振荡动力学调节内隐视动序列学习。

Cereb Cortex. 2024 Jul 3;34(7). doi: 10.1093/cercor/bhae266.

Evidence for a competitive relationship between executive functions and statistical learning.执行功能与统计学习之间存在竞争关系的证据。

NPJ Sci Learn. 2024 Apr 12;9(1):30. doi: 10.1038/s41539-024-00243-9.

Resting network architecture of theta oscillations reflects hyper-learning of sensorimotor information in Gilles de la Tourette syndrome.θ振荡的静息网络结构反映了抽动秽语综合征中感觉运动信息的过度学习。

Brain Commun. 2024 Mar 14;6(2):fcae092. doi: 10.1093/braincomms/fcae092. eCollection 2024.

Reliability of the serial reaction time task: If at first you don't succeed, try, try, try again.序列反应时任务的可靠性：如果一开始你没有成功，那就再试一次，再试一次，再试一次。

Q J Exp Psychol (Hove). 2024 Nov;77(11):2256-2282. doi: 10.1177/17470218241232347. Epub 2024 Mar 7.

Reliability of individual differences in distractor suppression driven by statistical learning.统计学习驱动的分心物抑制个体差异的可靠性。

Behav Res Methods. 2024 Mar;56(3):2437-2451. doi: 10.3758/s13428-023-02157-7. Epub 2023 Jul 25.

Intact predictive processing in autistic adults: evidence from statistical learning.自闭症成人的完整预测加工：来自统计学习的证据。

Sci Rep. 2023 Jul 22;13(1):11873. doi: 10.1038/s41598-023-38708-3.

本文引用的文献

Cortex. 2024 Oct;179:168-190. doi: 10.1016/j.cortex.2024.07.008. Epub 2024 Aug 14.

Tracking human skill learning with a hierarchical Bayesian sequence model.用分层贝叶斯序列模型追踪人类技能学习。

PLoS Comput Biol. 2022 Nov 30;18(11):e1009866. doi: 10.1371/journal.pcbi.1009866. eCollection 2022 Nov.

Tracking the contribution of inductive bias to individualised internal models.追踪归纳偏置对个体化内部模型的贡献。

PLoS Comput Biol. 2022 Jun 22;18(6):e1010182. doi: 10.1371/journal.pcbi.1010182. eCollection 2022 Jun.

Cautious or causal? Key implicit sequence learning paradigms should not be overlooked when assessing the role of DLPFC (Commentary on Prutean et al.).谨慎还是随意？在评估背外侧前额叶皮质的作用时，关键的内隐序列学习范式不应被忽视（对普鲁特安等人的评论）。

Cortex. 2022 Mar;148:222-226. doi: 10.1016/j.cortex.2021.10.001. Epub 2021 Oct 22.

Access to Procedural Memories After One Year: Evidence for Robust Memory Consolidation in Tourette Syndrome.一年后程序性记忆的获取：抽动秽语综合征中稳固记忆巩固的证据

Front Hum Neurosci. 2021 Aug 12;15:715254. doi: 10.3389/fnhum.2021.715254. eCollection 2021.

Statistical learning occurs during practice while high-order rule learning during rest period.统计学习在练习期间发生，而高阶规则学习在休息期间发生。

NPJ Sci Learn. 2021 Jul 1;6(1):14. doi: 10.1038/s41539-021-00093-9.

Neurophysiological and functional neuroanatomical coding of statistical and deterministic rule information during sequence learning.在序列学习过程中对统计和确定性规则信息进行神经生理学和功能神经解剖学编码。

Hum Brain Mapp. 2021 Jul;42(10):3182-3201. doi: 10.1002/hbm.25427. Epub 2021 Apr 2.

Implicit anticipation of probabilistic regularities: Larger CNV emerges for unpredictable events.内隐概率规律预期：不可预测事件产生更大的 CNV。

Neuropsychologia. 2021 Jun 18;156:107826. doi: 10.1016/j.neuropsychologia.2021.107826. Epub 2021 Mar 12.

Dissociation between two aspects of procedural learning in Tourette syndrome: Enhanced statistical and impaired sequence learning.妥瑞氏症患者程序性学习两方面的分离：增强的统计学习和受损的序列学习。

Child Neuropsychol. 2021 Aug;27(6):799-821. doi: 10.1080/09297049.2021.1894110. Epub 2021 Mar 9.

Beta-Band Activity Is a Signature of Statistical Learning.β 波段活动是统计学习的特征。

J Neurosci. 2020 Sep 23;40(39):7523-7530. doi: 10.1523/JNEUROSCI.0771-20.2020. Epub 2020 Aug 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

学习任务中可靠性测量的复杂性：以交替序列反应时任务为例的说明。

The complexity of measuring reliability in learning tasks: An illustration using the Alternating Serial Reaction Time Task.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献