• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯检验用于量化复制尝试的结果。

Bayesian tests to quantify the result of a replication attempt.

机构信息

Department of Psychology, University of Amsterdam.

出版信息

J Exp Psychol Gen. 2014 Aug;143(4):1457-75. doi: 10.1037/a0036731. Epub 2014 May 26.

DOI:10.1037/a0036731
PMID:24867486
Abstract

Replication attempts are essential to the empirical sciences. Successful replication attempts increase researchers' confidence in the presence of an effect, whereas failed replication attempts induce skepticism and doubt. However, it is often unclear to what extent a replication attempt results in success or failure. To quantify replication outcomes we propose a novel Bayesian replication test that compares the adequacy of 2 competing hypotheses. The 1st hypothesis is that of the skeptic and holds that the effect is spurious; this is the null hypothesis that postulates a zero effect size, H₀ : δ = 0. The 2nd hypothesis is that of the proponent and holds that the effect is consistent with the one found in the original study, an effect that can be quantified by a posterior distribution. Hence, the 2nd hypothesis-the replication hypothesis-is given by Hr : δ ∼ "posterior distribution from original study." The weighted-likelihood ratio between H₀ and Hr quantifies the evidence that the data provide for replication success and failure. In addition to the new test, we present several other Bayesian tests that address different but related questions concerning a replication study. These tests pertain to the independent conclusions of the separate experiments, the difference in effect size between the original experiment and the replication attempt, and the overall conclusion based on the pooled results. Together, this suite of Bayesian tests allows a relatively complete formalization of the way in which the result of a replication attempt alters our knowledge of the phenomenon at hand. The use of all Bayesian replication tests is illustrated with 3 examples from the literature. For experiments analyzed using the t test, computation of the new replication test only requires the t values and the numbers of participants from the original study and the replication study.

摘要

复制尝试对于经验科学至关重要。成功的复制尝试会增加研究人员对存在效应的信心,而失败的复制尝试则会引起怀疑和质疑。然而,通常不清楚复制尝试在多大程度上取得成功或失败。为了量化复制结果,我们提出了一种新的贝叶斯复制检验方法,该方法比较了两个竞争假设的充分性。第一个假设是怀疑论者的假设,认为效应是虚假的;这是零效应大小的零假设,H₀:δ=0。第二个假设是支持者的假设,认为效应与原始研究中发现的效应一致,该效应可以通过后验分布来量化。因此,第二个假设——复制假设——由 Hr:δ∼“原始研究的后验分布”给出。H₀和 Hr 之间的加权似然比量化了数据为复制成功和失败提供的证据。除了新的检验方法,我们还提出了其他几种贝叶斯检验方法,这些方法解决了与复制研究相关的不同但相关的问题。这些检验方法涉及到独立实验的独立结论、原始实验和复制尝试之间的效应大小差异,以及基于汇总结果的总体结论。这一系列贝叶斯检验方法一起允许相对完整地形式化复制尝试的结果如何改变我们对当前现象的认识。使用三个来自文献的例子说明了所有贝叶斯复制检验的用法。对于使用 t 检验进行分析的实验,新复制检验的计算仅需要原始研究和复制研究的 t 值和参与者数量。

相似文献

1
Bayesian tests to quantify the result of a replication attempt.贝叶斯检验用于量化复制尝试的结果。
J Exp Psychol Gen. 2014 Aug;143(4):1457-75. doi: 10.1037/a0036731. Epub 2014 May 26.
2
A Bayesian Perspective on the Reproducibility Project: Psychology.关于“可重复性项目:心理学”的贝叶斯视角
PLoS One. 2016 Feb 26;11(2):e0149794. doi: 10.1371/journal.pone.0149794. eCollection 2016.
3
Bayesian design of single-arm phase II clinical trials with continuous monitoring.具有连续监测的单臂II期临床试验的贝叶斯设计
Clin Trials. 2009 Jun;6(3):217-26. doi: 10.1177/1740774509105221.
4
A purely confirmatory replication study of structural brain-behavior correlations.一项关于大脑结构与行为相关性的纯验证性重复研究。
Cortex. 2015 May;66:115-33. doi: 10.1016/j.cortex.2014.11.019. Epub 2015 Jan 14.
5
Replication of null results: Absence of evidence or evidence of absence?重复零结果:无证据还是缺乏证据?
Elife. 2024 May 13;12:RP92311. doi: 10.7554/eLife.92311.
6
Empirical Bayes interval estimates that are conditionally equal to unadjusted confidence intervals or to default prior credibility intervals.经验贝叶斯区间估计在条件上等同于未调整的置信区间或默认的先验可信区间。
Stat Appl Genet Mol Biol. 2012 Feb 21;11(3):Article 7. doi: 10.1515/1544-6115.1765.
7
Bayesian hypothesis testing for single-subject designs.贝叶斯假设检验在单被试设计中的应用。
Psychol Methods. 2013 Jun;18(2):165-85. doi: 10.1037/a0031037. Epub 2013 Mar 4.
8
Bayes factor approaches for testing interval null hypotheses.贝叶斯因子方法在区间零假设检验中的应用。
Psychol Methods. 2011 Dec;16(4):406-19. doi: 10.1037/a0024377. Epub 2011 Jul 25.
9
Bayesian hypothesis testing for psychologists: a tutorial on the Savage-Dickey method.贝叶斯假设检验对心理学家来说:萨维奇-迪基方法教程。
Cogn Psychol. 2010 May;60(3):158-89. doi: 10.1016/j.cogpsych.2009.12.001. Epub 2010 Jan 12.
10
How to quantify the evidence for the absence of a correlation.如何量化不存在相关性的证据。
Behav Res Methods. 2016 Jun;48(2):413-26. doi: 10.3758/s13428-015-0593-0.

引用本文的文献

1
A scoping review on metrics to quantify reproducibility: a multitude of questions leads to a multitude of metrics.关于量化可重复性指标的范围综述:众多问题催生众多指标。
R Soc Open Sci. 2025 Jul 15;12(7):242076. doi: 10.1098/rsos.242076. eCollection 2025 Jul.
2
Trunk kinematics during seated functional activities in individuals with spinal cord injury: a systematic review and meta-analysis.脊髓损伤患者坐位功能活动期间的躯干运动学:一项系统综述和荟萃分析。
Sci Rep. 2025 Jul 1;15(1):22276. doi: 10.1038/s41598-025-06765-5.
3
The neural signatures of the psychological construct "flow": A replication study.
心理建构“心流”的神经特征:一项重复研究。
Neuroimage Rep. 2022 Oct 10;2(4):100139. doi: 10.1016/j.ynirp.2022.100139. eCollection 2022 Dec.
4
A longitudinal replication study testing migration from video game loot boxes to gambling in British Columbia, Canada.一项在加拿大不列颠哥伦比亚省进行的纵向重复研究,测试从电子游戏开箱到赌博的转变情况。
BMC Psychol. 2025 Apr 30;13(1):459. doi: 10.1186/s40359-025-02766-1.
5
Do infants use cues of saliva-sharing to infer close relationships? A replication of Thomas . (2022).婴儿会利用唾液共享的线索来推断亲密关系吗?托马斯的一项复现研究(2022年)
R Soc Open Sci. 2025 Apr 9;12(4):240229. doi: 10.1098/rsos.240229. eCollection 2025 Apr.
6
How can we make sound replication decisions?我们如何做出合理的复制决策?
Proc Natl Acad Sci U S A. 2025 Feb 4;122(5):e2401236121. doi: 10.1073/pnas.2401236121. Epub 2025 Jan 27.
7
Evaluating meta-analysis as a replication success measure.评估元分析作为一种复制成功的衡量标准。
PLoS One. 2024 Dec 11;19(12):e0308495. doi: 10.1371/journal.pone.0308495. eCollection 2024.
8
Electrophysiological dynamics of salience, default mode, and frontoparietal networks during episodic memory formation and recall revealed through multi-experiment iEEG replication.通过多实验 iEEG 复制揭示了情景记忆形成和回忆过程中突显、默认模式和额顶网络的电生理动力学。
Elife. 2024 Nov 18;13:RP99018. doi: 10.7554/eLife.99018.
9
Model-averaged Bayesian t tests.模型平均贝叶斯t检验
Psychon Bull Rev. 2025 Jun;32(3):1007-1031. doi: 10.3758/s13423-024-02590-5. Epub 2024 Nov 7.
10
The quantitative paradigm and the nature of the human mind. The replication crisis as an epistemological crisis of quantitative psychology in view of the ontic nature of the psyche.定量范式与人类心智的本质。鉴于心理的本体性质,复制危机作为定量心理学的一种认识论危机。
Front Psychol. 2024 Sep 12;15:1390233. doi: 10.3389/fpsyg.2024.1390233. eCollection 2024.