科学不是一个信号检测问题。

Science is not a signal detection problem.

机构信息

Department of Psychology, University of California San Diego, La Jolla, CA 92093

Department of Psychology, University of California San Diego, La Jolla, CA 92093.

出版信息

Proc Natl Acad Sci U S A. 2020 Mar 17;117(11):5559-5567. doi: 10.1073/pnas.1914237117. Epub 2020 Mar 3.

DOI:10.1073/pnas.1914237117

PMID:32127477

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7084063/

Abstract

The perceived replication crisis and the reforms designed to address it are grounded in the notion that science is a binary signal detection problem. However, contrary to null hypothesis significance testing (NHST) logic, the magnitude of the underlying effect size for a given experiment is best conceptualized as a random draw from a continuous distribution, not as a random draw from a dichotomous distribution (null vs. alternative). Moreover, because continuously distributed effects selected using a < 0.05 filter must be inflated, the fact that they are smaller when replicated (reflecting regression to the mean) is no reason to sound the alarm. Considered from this perspective, recent replication efforts suggest that most published < 0.05 scientific findings are "true" (i.e., in the correct direction), with observed effect sizes that are inflated to varying degrees. We propose that original science is a screening process, one that adopts NHST logic as a useful fiction for selecting true effects that are potentially large enough to be of interest to other scientists. Unlike original science, replication science seeks to precisely measure the underlying effect size associated with an experimental protocol via large- direct replication, without regard for statistical significance. Registered reports are well suited to (often resource-intensive) direct replications, which should focus on influential findings and be published regardless of outcome. Conceptual replications play an important but separate role in validating theories. However, because they are part of NHST-based original science, conceptual replications cannot serve as the field's self-correction mechanism. Only direct replications can do that.

摘要

被感知的复制危机以及为解决这一危机而进行的改革，其基础是这样一种观念，即科学是一个二元信号检测问题。然而，与零假设显著性检验（NHST）逻辑相反，给定实验的潜在效应大小最好被概念化为来自连续分布的随机抽取，而不是来自二分分布（零假设与备择假设）的随机抽取。此外，由于使用 < 0.05 滤波器选择的连续分布效应必然会被夸大，因此当它们被复制时（反映出向均值回归）较小，这并不是发出警报的理由。从这个角度来看，最近的复制努力表明，大多数已发表的 < 0.05 的科学发现都是“真实的”（即，朝着正确的方向），观察到的效应大小在不同程度上被夸大了。我们提出，原始科学是一个筛选过程，它采用 NHST 逻辑作为一种有用的虚构，以选择潜在足够大、可能引起其他科学家兴趣的真实效应。与原始科学不同，复制科学旨在通过大型直接复制，精确测量与实验方案相关的潜在效应大小，而不考虑统计显著性。注册报告非常适合（通常资源密集型）直接复制，直接复制应侧重于有影响力的发现，无论结果如何都应发表。概念复制在验证理论方面发挥着重要但独立的作用。然而，由于它们是基于 NHST 的原始科学的一部分，概念复制不能作为该领域的自我修正机制。只有直接复制才能做到这一点。

相似文献

Science is not a signal detection problem.科学不是一个信号检测问题。

Proc Natl Acad Sci U S A. 2020 Mar 17;117(11):5559-5567. doi: 10.1073/pnas.1914237117. Epub 2020 Mar 3.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Small class sizes for improving student achievement in primary and secondary schools: a systematic review.小班教学对提高中小学学生成绩的影响：一项系统综述。

Campbell Syst Rev. 2018 Oct 11;14(1):1-107. doi: 10.4073/csr.2018.10. eCollection 2018.

Bayesian inference of population prevalence.贝叶斯推断种群流行率。

Elife. 2021 Oct 6;10:e62461. doi: 10.7554/eLife.62461.

When Null Hypothesis Significance Testing Is Unsuitable for Research: A Reassessment.当零假设显著性检验不适用于研究时：重新评估

Front Hum Neurosci. 2017 Aug 3;11:390. doi: 10.3389/fnhum.2017.00390. eCollection 2017.

Erratum: Eyestalk Ablation to Increase Ovarian Maturation in Mud Crabs.勘误：切除眼柄以增加泥蟹的卵巢成熟度。

J Vis Exp. 2023 May 26(195). doi: 10.3791/6561.

Neuroscience Needs to Test Both Statistical and Scientific Hypotheses.神经科学需要同时检验统计假设和科学假设。

J Neurosci. 2022 Nov 9;42(45):8432-8438. doi: 10.1523/JNEUROSCI.1134-22.2022.

Replication marketplaces would help science to become more self-correcting.复制市场将有助于科学变得更具自我纠错能力。

R Soc Open Sci. 2024 Oct 2;11(10):240850. doi: 10.1098/rsos.240850. eCollection 2024 Oct.

The continuing misuse of null hypothesis significance testing in biological anthropology.生物人类学中持续存在的对零假设显著性检验的误用。

Am J Phys Anthropol. 2018 May;166(1):236-245. doi: 10.1002/ajpa.23399. Epub 2018 Jan 18.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

引用本文的文献

Behavioral interventions for waste reduction: a systematic review of experimental studies.减少废物产生的行为干预措施：实验研究的系统评价

Front Psychol. 2025 Jun 24;16:1561467. doi: 10.3389/fpsyg.2025.1561467. eCollection 2025.

Did he or didn't he? Mixed evidence for the continued influence of retracted misinformation on person impressions.他到底有没有？关于撤回的错误信息对人物印象持续影响的混合证据。

PLoS One. 2025 May 7;20(5):e0322045. doi: 10.1371/journal.pone.0322045. eCollection 2025.

Can expected error costs justify testing a hypothesis at multiple alpha levels rather than searching for an elusive optimal alpha?可以用预期误差成本来证明在多个 α 水平上检验假设是合理的，而不是寻找难以捉摸的最优 α 吗？

PLoS One. 2024 Sep 25;19(9):e0304675. doi: 10.1371/journal.pone.0304675. eCollection 2024.

Developmental psychologists should adopt citizen science to improve generalization and reproducibility.发展心理学家应采用公民科学来提高普遍性和可重复性。

Infant Child Dev. 2024 Jan-Feb;33(1). doi: 10.1002/icd.2348. Epub 2022 Aug 2.

Finding the right power balance: Better study design and collaboration can reduce dependence on statistical power.找到合适的权力平衡：更好的研究设计和协作可以减少对统计功效的依赖。

PLoS Biol. 2024 Jan 8;22(1):e3002423. doi: 10.1371/journal.pbio.3002423. eCollection 2024 Jan.

High replicability of newly discovered social-behavioural findings is achievable.新发现的社会行为研究结果具有高度可复制性。

Nat Hum Behav. 2024 Feb;8(2):311-319. doi: 10.1038/s41562-023-01749-9. Epub 2023 Nov 9.

Are most published research findings false in a continuous universe?在一个连续的宇宙中，大多数已发表的研究结果都是错误的吗？

PLoS One. 2022 Dec 20;17(12):e0277935. doi: 10.1371/journal.pone.0277935. eCollection 2022.

Sleep deprivation and memory: Meta-analytic reviews of studies on sleep deprivation before and after learning.睡眠剥夺与记忆：学习前后进行睡眠剥夺的研究的元分析综述。

Psychol Bull. 2021 Nov;147(11):1215-1240. doi: 10.1037/bul0000348.

The tainted altruism effect: a successful pre-registered replication.受污染的利他主义效应：一项成功的预注册复制研究

R Soc Open Sci. 2022 Jan 26;9(1):211152. doi: 10.1098/rsos.211152. eCollection 2022 Jan.

Acetonitrile Adducts of Tranexamic Acid as Sensitive Ions for Quantification at Residue Levels in Human Plasma by UHPLC-MS/MS.氨甲环酸的乙腈加合物作为通过超高效液相色谱-串联质谱法测定人血浆中残留水平的灵敏离子。

Pharmaceuticals (Basel). 2021 Nov 23;14(12):1205. doi: 10.3390/ph14121205.

本文引用的文献

A manifesto for reproducible science.可重复科学宣言。

Nat Hum Behav. 2017 Jan 10;1(1):0021. doi: 10.1038/s41562-016-0021.

Low replicability can support robust and efficient science.低可重复性可以支持稳健且高效的科学。

Nat Commun. 2020 Jan 17;11(1):358. doi: 10.1038/s41467-019-14203-0.

The impact of sleep on eyewitness identifications.睡眠对目击证人辨认的影响。

R Soc Open Sci. 2019 Dec 4;6(12):170501. doi: 10.1098/rsos.170501. eCollection 2019 Dec.

Comparing meta-analyses and preregistered multiple-laboratory replication projects.比较荟萃分析和预先注册的多实验室复制项目。

Nat Hum Behav. 2020 Apr;4(4):423-434. doi: 10.1038/s41562-019-0787-z. Epub 2019 Dec 23.

What's next for psychology's embattled field of social priming.陷入困境的心理学社会启动领域的下一步走向是什么。

Nature. 2019 Dec;576(7786):200-202. doi: 10.1038/d41586-019-03755-2.

Replicator degrees of freedom allow publication of misleading failures to replicate.复制者自由度允许发表具有误导性的未能复制的结果。

Proc Natl Acad Sci U S A. 2019 Dec 17;116(51):25535-25545. doi: 10.1073/pnas.1910951116. Epub 2019 Nov 25.

What's next for Registered Reports?注册报告的下一步是什么？

Nature. 2019 Sep;573(7773):187-189. doi: 10.1038/d41586-019-02674-6.

The forgotten history of signal detection theory.信号检测理论的遗忘历史。

J Exp Psychol Learn Mem Cogn. 2020 Feb;46(2):201-233. doi: 10.1037/xlm0000732. Epub 2019 Jun 27.

Improving social and behavioral science by making replication mainstream: A response to commentaries.通过将复制作为主流来改进社会和行为科学：对评论的回应。

Behav Brain Sci. 2018 Jan;41:e157. doi: 10.1017/S0140525X18000961.

Rein in the four horsemen of irreproducibility.控制住不可重复性的四大因素。

Nature. 2019 Apr;568(7753):435. doi: 10.1038/d41586-019-01307-2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验