• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于评估复制成功性的对应性度量。

Correspondence measures for assessing replication success.

作者信息

Steiner Peter M, Sheehan Patrick, Wong Vivian C

机构信息

University of Maryland, College Park.

University of Virginia.

出版信息

Psychol Methods. 2025 Aug;30(4):793-814. doi: 10.1037/met0000597. Epub 2023 Jul 27.

DOI:10.1037/met0000597
PMID:37498693
Abstract

Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2025 APA, all rights reserved).

摘要

鉴于最近有证据对社会科学和行为科学研究结果的可重复性提出质疑,人们对在比较不同研究的效应估计时确定复制成功的适当方法提出了关键问题。问题在于,关于复制成功的结论往往取决于用于评估结果一致性的方法。尽管选择适当的方法很重要,但对于应使用哪些方法仍未达成广泛共识。本文通过正式描述评估复制成功最常用的方法,并根据其复制概率(即在特定研究设置下获得复制成功的概率)比较它们在不同情况下的表现,来解决这些问题。这些方法大致可分为基于结论的方法,即评估两项独立研究关于效应存在的结论的一致性;以及基于距离的方法,即检验两个效应估计值之间是否存在显著差异或等效性。我们还引入了一种评估复制成功的新方法,称为对应性检验,它在同一框架中结合了差异检验和等效性检验。为了帮助研究人员规划前瞻性的复制研究,我们提供了用于功效计算的封闭公式,可用于确定每项研究的最小可检测效应大小(进而确定样本量),以便实现预定的最小复制概率。最后,我们使用开放科学合作组织(2015年)的一个复制数据集来证明关于复制成功的结论在多大程度上取决于所选择的对应性度量。(《心理学文摘数据库记录》(c)2025美国心理学会,保留所有权利)

相似文献

1
Correspondence measures for assessing replication success.用于评估复制成功性的对应性度量。
Psychol Methods. 2025 Aug;30(4):793-814. doi: 10.1037/met0000597. Epub 2023 Jul 27.
2
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
6
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
7
Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施:系统评价概述
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.
8
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
9
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
10
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

引用本文的文献

1
A scoping review on metrics to quantify reproducibility: a multitude of questions leads to a multitude of metrics.关于量化可重复性指标的范围综述:众多问题催生众多指标。
R Soc Open Sci. 2025 Jul 15;12(7):242076. doi: 10.1098/rsos.242076. eCollection 2025 Jul.