• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

与使用克朗巴哈系数相比,反应时间任务的信度通过基于排列的折半相关计算更为准确。

Reaction-time task reliability is more accurately computed with permutation-based split-half correlations than with Cronbach's alpha.

作者信息

Kahveci Sercan, Bathke Arne C, Blechert Jens

机构信息

Department of Psychology, Paris-Lodron-University of Salzburg, Hellbrunner Straße 34, 5020, Salzburg, Austria.

Centre for Cognitive Neuroscience, Paris-Lodron-University of Salzburg, Hellbrunner Straße 34, 5020, Salzburg, Austria.

出版信息

Psychon Bull Rev. 2025 Apr;32(2):652-673. doi: 10.3758/s13423-024-02597-y. Epub 2024 Oct 23.

DOI:10.3758/s13423-024-02597-y
PMID:39443394
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12000231/
Abstract

While it has become standard practice to report the reliability of self-report scales, it remains uncommon to do the same for experimental paradigms. To facilitate this practice, we review old and new ways to compute reliability in reaction-time tasks, and we compare their accuracy using a simulation study. Highly inaccurate and negatively biased reliability estimates are obtained through the common practice of averaging sets of trials and submitting them to Cronbach's alpha. Much more accurate reliability estimates are obtained using split-half reliability methods, especially by computing many random split-half correlations and aggregating them in a metric known as permutation-based split-half reliability. Through reanalysis of existing data and comparison of reliability values reported in the literature, we confirm that Cronbach's alpha also tends to be lower than split-half reliability in real data. We further establish a set of practices to maximize the accuracy of the permutation-based split-half reliability coefficient through simulations. We find that its accuracy is improved by ensuring each split-half dataset contains an approximately equal number of trials for each stimulus, by correcting the averaged correlation for test length using a modified variant of the Spearman-Brown formula, and by computing a sufficient number of split-half correlations: around 5,400 are needed to obtain a stable estimate for median-based double-difference scores computed from 30 participants and 256 trials. To conclude, we review the available software for computing this coefficient.

摘要

虽然报告自我报告量表的信度已成为标准做法,但对实验范式进行同样的操作仍不常见。为了促进这种做法,我们回顾了计算反应时任务中信度的新旧方法,并通过模拟研究比较了它们的准确性。通过对试验集进行平均并将其提交给克朗巴哈系数这一常见做法,会得到高度不准确且有负偏差的信度估计值。使用分半信度方法可获得更准确的信度估计值,特别是通过计算许多随机分半相关性并将它们汇总到一种称为基于排列的分半信度的度量中。通过对现有数据的重新分析以及对文献中报告的信度值的比较,我们证实,在实际数据中,克朗巴哈系数也往往低于分半信度。我们通过模拟进一步确立了一套做法,以最大限度地提高基于排列的分半信度系数的准确性。我们发现,通过确保每个分半数据集对每个刺激包含大致相等数量的试验、使用斯皮尔曼 - 布朗公式的修正变体校正平均相关性的测试长度以及计算足够数量的分半相关性,可提高其准确性:对于由30名参与者和256次试验计算出的基于中位数的双差异分数,需要大约5400次分半相关性才能获得稳定的估计值。最后,我们回顾了用于计算该系数的可用软件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/4ab3299af24c/13423_2024_2597_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/99fb21342867/13423_2024_2597_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/161e9a60488c/13423_2024_2597_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/a818c6ee8dae/13423_2024_2597_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/e60582758dbf/13423_2024_2597_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/86fe4dc123d0/13423_2024_2597_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/aad4b2cc1987/13423_2024_2597_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/32150a46329d/13423_2024_2597_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/af7fb9d91fdf/13423_2024_2597_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/4ab3299af24c/13423_2024_2597_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/99fb21342867/13423_2024_2597_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/161e9a60488c/13423_2024_2597_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/a818c6ee8dae/13423_2024_2597_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/e60582758dbf/13423_2024_2597_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/86fe4dc123d0/13423_2024_2597_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/aad4b2cc1987/13423_2024_2597_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/32150a46329d/13423_2024_2597_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/af7fb9d91fdf/13423_2024_2597_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b6b3/12000231/4ab3299af24c/13423_2024_2597_Fig9_HTML.jpg

相似文献

1
Reaction-time task reliability is more accurately computed with permutation-based split-half correlations than with Cronbach's alpha.与使用克朗巴哈系数相比,反应时间任务的信度通过基于排列的折半相关计算更为准确。
Psychon Bull Rev. 2025 Apr;32(2):652-673. doi: 10.3758/s13423-024-02597-y. Epub 2024 Oct 23.
2
An Internal Consistency Reliability Study of the Catalyst Datafinch Applied Behavior Analysis Data Collection Application With Autistic Individuals.针对自闭症个体的Catalyst Datafinch应用行为分析数据收集应用程序的内部一致性信度研究。
Cureus. 2024 Apr 16;16(4):e58379. doi: 10.7759/cureus.58379. eCollection 2024 Apr.
3
Spearman-Brown prophecy formula and Cronbach's alpha: different faces of reliability and opportunities for new applications.斯皮尔曼-布朗预测公式与克朗巴哈α系数:可靠性的不同面貌及新应用机遇
J Clin Epidemiol. 2017 May;85:45-49. doi: 10.1016/j.jclinepi.2017.01.013. Epub 2017 Mar 22.
4
The complexity of measuring reliability in learning tasks: An illustration using the Alternating Serial Reaction Time Task.学习任务中可靠性测量的复杂性:以交替序列反应时任务为例的说明。
Behav Res Methods. 2024 Jan;56(1):301-317. doi: 10.3758/s13428-022-02038-5. Epub 2023 Jan 5.
5
Thanks coefficient alpha, we'll take it from here.谢谢克朗巴哈系数,接下来我们自己来。
Psychol Methods. 2018 Sep;23(3):412-433. doi: 10.1037/met0000144. Epub 2017 May 29.
6
Estimating the reliability of a test split into two parts of equal or unequal length.估计分为长度相等或不等的两部分的测试的可靠性。
Psychol Methods. 2003 Mar;8(1):102-9. doi: 10.1037/1082-989x.8.1.102.
7
A better estimate of the internal consistency reliability of frontal EEG asymmetry scores.对额叶脑电图不对称性评分的内部一致性信度的更好估计。
Psychophysiology. 2009 Jan;46(1):132-42. doi: 10.1111/j.1469-8986.2008.00759.x. Epub 2008 Nov 26.
8
The Outpatient Experience Questionnaire of comprehensive public hospital in China: development, validity and reliability.中国综合公立医院门诊患者体验问卷:编制、效度与信度
Int J Qual Health Care. 2017 Feb 1;29(1):40-46. doi: 10.1093/intqhc/mzw133.
9
The Turkish version of nursing practice readiness scale: Cross-cultural adaptation and psychometric evaluation.护理实践准备就绪量表的土耳其语版本:跨文化调适和心理计量评估。
Int J Nurs Pract. 2024 Aug;30(4):e13254. doi: 10.1111/ijn.13254. Epub 2024 Apr 4.
10
Validity and reliability of the Turkish version of the Implementation Leadership Scale in the context of nurses.护士情境下实施领导力量表土耳其语版的有效性和信度。
BMC Health Serv Res. 2024 Nov 5;24(1):1347. doi: 10.1186/s12913-024-11721-6.

引用本文的文献

1
Anxiety Symptom Severity and Implicit and Explicit Self-As-Anxious Associations in a Large Online Sample of U.S. Adults: Trends From 2011 to 2022.美国成年人大型在线样本中的焦虑症状严重程度以及内隐和外显的自我焦虑关联:2011年至2022年的趋势
Clin Psychol Sci. 2025 Aug 12. doi: 10.1177/21677026251359621.

本文引用的文献

1
Measuring attentional bias using the dot-probe task in young women: Psychometric properties and feasibility of response-based computations, dwell time, and the N2pc component.使用点探测任务测量年轻女性的注意偏向:基于反应的计算、注视时间和N2pc成分的心理测量特性及可行性
J Behav Ther Exp Psychiatry. 2025 Sep;88:102036. doi: 10.1016/j.jbtep.2025.102036. Epub 2025 Apr 10.
2
A multiverse assessment of the reliability of the self-matching task as a measurement of the self-prioritization effect.作为自我优先效应测量手段的自我匹配任务可靠性的多宇宙评估。
Behav Res Methods. 2025 Jan 2;57(1):37. doi: 10.3758/s13428-024-02538-6.
3
Avoidance biases for vaping stimuli among college students with electronic-cigarette use.
大学生电子烟使用者对蒸气类产品刺激的回避偏见。
Addict Behav. 2024 Apr;151:107934. doi: 10.1016/j.addbeh.2023.107934. Epub 2023 Dec 10.
4
How pre-processing decisions affect the reliability and validity of the approach-avoidance task: Evidence from simulations and multiverse analyses with six datasets.预处理决策如何影响趋近回避任务的可靠性和有效性:来自六个数据集的模拟和多元宇宙分析的证据。
Behav Res Methods. 2024 Mar;56(3):1551-1582. doi: 10.3758/s13428-023-02109-1. Epub 2023 May 23.
5
Effect of data leakage in brain MRI classification using 2D convolutional neural networks.二维卷积神经网络在脑 MRI 分类中数据泄露的影响。
Sci Rep. 2021 Nov 19;11(1):22544. doi: 10.1038/s41598-021-01681-w.
6
Emotional Infant Face Processing in Women With Major Depression and Expecting Parents With Depressive Symptoms.患有重度抑郁症的女性及有抑郁症状的准父母对婴儿情绪面孔的加工
Front Psychol. 2021 Jul 2;12:657269. doi: 10.3389/fpsyg.2021.657269. eCollection 2021.
7
Grab it or not? Measuring avoidance of spiders with touchscreen-based hand movements.抓还是不抓?用基于触摸屏的手部运动来测量对蜘蛛的回避行为。
J Behav Ther Exp Psychiatry. 2021 Dec;73:101670. doi: 10.1016/j.jbtep.2021.101670. Epub 2021 Jun 16.
8
Methods to split cognitive task data for estimating split-half reliability: A comprehensive review and systematic assessment.将认知任务数据分割以估计分半可靠性的方法:全面综述与系统评估。
Psychon Bull Rev. 2022 Feb;29(1):44-54. doi: 10.3758/s13423-021-01948-3. Epub 2021 Jun 7.
9
Food approach bias is moderated by the desire to eat specific foods.食物趋近偏差受特定食物进食欲望的调节。
Appetite. 2020 Nov 1;154:104758. doi: 10.1016/j.appet.2020.104758. Epub 2020 Jun 11.
10
Reliability and validity of measures of attentional bias towards threat in unselected student samples: seek, but will you find?未选择学生样本中对威胁的注意偏向的测量的信度和效度:寻求,但你会发现吗?
Cogn Emot. 2020 Mar;34(2):217-228. doi: 10.1080/02699931.2019.1609423. Epub 2019 May 2.