普遍性危机。

The generalizability crisis.

机构信息

Department of Psychology, The University of Texas at Austin, Austin, TX78712-1043,

出版信息

Behav Brain Sci. 2020 Dec 21;45:e1. doi: 10.1017/S0140525X20001685.

DOI:10.1017/S0140525X20001685

PMID:33342451

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10681374/

Abstract

Most theories and hypotheses in psychology are verbal in nature, yet their evaluation overwhelmingly relies on inferential statistical procedures. The validity of the move from qualitative to quantitative analysis depends on the verbal and statistical expressions of a hypothesis being closely aligned - that is, that the two must refer to roughly the same set of hypothetical observations. Here, I argue that many applications of statistical inference in psychology fail to meet this basic condition. Focusing on the most widely used class of model in psychology - the linear mixed model - I explore the consequences of failing to statistically operationalize verbal hypotheses in a way that respects researchers' actual generalization intentions. I demonstrate that although the "random effect" formalism is used pervasively in psychology to model intersubject variability, few researchers accord the same treatment to other variables they clearly intend to generalize over (e.g., stimuli, tasks, or research sites). The under-specification of random effects imposes far stronger constraints on the generalizability of results than most researchers appreciate. Ignoring these constraints can dramatically inflate false-positive rates, and often leads researchers to draw sweeping verbal generalizations that lack a meaningful connection to the statistical quantities they are putatively based on. I argue that failure to take the alignment between verbal and statistical expressions seriously lies at the heart of many of psychology's ongoing problems (e.g., the replication crisis), and conclude with a discussion of several potential avenues for improvement.

摘要

心理学中的大多数理论和假设都是口头的，但它们的评估主要依赖于推理统计程序。从定性分析到定量分析的转变的有效性取决于假设的口头和统计表达是否紧密一致，也就是说，两者必须大致指的是相同的假设观测集。在这里，我认为心理学中统计推断的许多应用都未能满足这一基本条件。我专注于心理学中最广泛使用的模型类别——线性混合模型，探讨了未能以尊重研究人员实际推广意图的方式对口头假设进行统计操作化的后果。我证明了尽管“随机效应”形式在心理学中被广泛用于对个体间变异性进行建模，但很少有研究人员对他们明确打算推广的其他变量（例如，刺激、任务或研究地点）给予相同的处理。随机效应的不充分指定对结果的可推广性施加了比大多数研究人员意识到的更严格的限制。忽略这些限制会极大地增加假阳性率，并且常常导致研究人员得出缺乏与他们所依据的统计数量有意义联系的全面口头概括。我认为，未能认真对待口头和统计表达之间的一致性是心理学许多持续存在的问题（例如，复制危机）的核心所在，并最后讨论了几个潜在的改进途径。

相似文献

The generalizability crisis.

Behav Brain Sci. 2020 Dec 21;45:e1. doi: 10.1017/S0140525X20001685.

Is psychology suffering from a replication crisis? What does "failure to replicate" really mean?

Am Psychol. 2015 Sep;70(6):487-98. doi: 10.1037/a0039400.

Kinds of Replication: Examining the Meanings of "Conceptual Replication" and "Direct Replication".

Perspect Psychol Sci. 2022 Sep;17(5):1490-1505. doi: 10.1177/17456916211041116. Epub 2022 Mar 4.

Why Hypothesis Testers Should Spend Less Time Testing Hypotheses.

Perspect Psychol Sci. 2021 Jul;16(4):744-755. doi: 10.1177/1745691620966795. Epub 2020 Dec 16.

Practicing what we preach in humanistic and positive psychology.

Am Psychol. 2014 Jan;69(1):90-2. doi: 10.1037/a0034868.

The Theory Crisis in Psychology: How to Move Forward.

Perspect Psychol Sci. 2021 Jul;16(4):779-788. doi: 10.1177/1745691620970586. Epub 2021 Jan 29.

Small is beautiful: In defense of the small-N design.

Psychon Bull Rev. 2018 Dec;25(6):2083-2101. doi: 10.3758/s13423-018-1451-8.

Constraints on Generality (COG): A Proposed Addition to All Empirical Papers.

Perspect Psychol Sci. 2017 Nov;12(6):1123-1128. doi: 10.1177/1745691617708630. Epub 2017 Aug 30.

A Review of Multisite Replication Projects in Social Psychology: Is It Viable to Sustain Any Confidence in Social Psychology's Knowledge Base?

Perspect Psychol Sci. 2023 Jul;18(4):912-935. doi: 10.1177/17456916221121815. Epub 2022 Nov 28.

Psychology's Replication Crisis and the Grant Culture: Righting the Ship.

Perspect Psychol Sci. 2017 Jul;12(4):660-664. doi: 10.1177/1745691616687745.

引用本文的文献

Identifying dynamic reproducible brain states using a predictive modelling approach.

Imaging Neurosci (Camb). 2025 Apr 17;3. doi: 10.1162/imag_a_00540. eCollection 2025.

The Voxelwise Encoding Model framework: A tutorial introduction to fitting encoding models to fMRI data.

Imaging Neurosci (Camb). 2025 May 9;3. doi: 10.1162/imag_a_00575. eCollection 2025.

Replicability and generalizability of the repeated exposure effect on moral condemnation of fake news.

Nat Commun. 2025 Aug 5;16(1):7206. doi: 10.1038/s41467-025-62462-x.

Predicting OCD severity from religiosity and personality: A machine learning and neural network approach.

J Mood Anxiety Disord. 2024 Oct 2;8:100089. doi: 10.1016/j.xjmad.2024.100089. eCollection 2024 Dec.

Towards collaborative data science in mental health research: The ECNP neuroimaging network accessible data repository.

Neurosci Appl. 2024 Dec 9;4:105407. doi: 10.1016/j.nsa.2024.105407. eCollection 2025.

Comparing Likert and visual analogue scales in ecological momentary assessment.

Behav Res Methods. 2025 Jul 2;57(8):217. doi: 10.3758/s13428-025-02706-2.

Big team science reveals promises and limitations of machine learning efforts to model physiological markers of affective experience.

R Soc Open Sci. 2025 Jun 25;12(6):241778. doi: 10.1098/rsos.241778. eCollection 2025 Jun.

Little evidence for a reduced late positive potential to unpleasant stimuli in major depressive disorder.

Neuroimage Rep. 2022 Jan 17;2(1):100077. doi: 10.1016/j.ynirp.2022.100077. eCollection 2022 Mar.

Affective neural signatures do not distinguish women with emotion dysregulation from healthy controls: A mega-analysis across three task-based fMRI studies.

Neuroimage Rep. 2021 May 29;1(2):100019. doi: 10.1016/j.ynirp.2021.100019. eCollection 2021 Jun.

Simplified Chinese lexicon project: A lexical decision database with 8105 characters and 4864 pseudocharacters.

Behav Res Methods. 2025 Jun 23;57(7):206. doi: 10.3758/s13428-025-02701-7.

本文引用的文献

PyMC: a modern, and comprehensive probabilistic programming framework in Python.

PeerJ Comput Sci. 2023 Sep 1;9:e1516. doi: 10.7717/peerj-cs.1516. eCollection 2023.

Stan: A Probabilistic Programming Language.

J Stat Softw. 2017;76. doi: 10.18637/jss.v076.i01. Epub 2017 Jan 11.

The revolution will not be controlled: natural stimuli in speech neuroscience.

Lang Cogn Neurosci. 2018 Jul 22;35(5):573-582. doi: 10.1080/23273798.2018.1499946. eCollection 2020.

The Psychological Science Accelerator: Advancing Psychology through a Distributed Collaborative Network.

Adv Methods Pract Psychol Sci. 2018 Dec;1(4):501-515. doi: 10.1177/2515245918797607. Epub 2018 Oct 1.

Redefine statistical significance.

Nat Hum Behav. 2018 Jan;2(1):6-10. doi: 10.1038/s41562-017-0189-z.

Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence.

Nat Genet. 2018 Jul;50(7):912-919. doi: 10.1038/s41588-018-0152-6. Epub 2018 Jun 25.

Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways.

Nat Genet. 2018 Jul;50(7):920-927. doi: 10.1038/s41588-018-0151-7. Epub 2018 Jun 25.

Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression.

Nat Genet. 2018 May;50(5):668-681. doi: 10.1038/s41588-018-0090-3. Epub 2018 Apr 26.

Metastudies for robust tests of theory.

Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2607-2612. doi: 10.1073/pnas.1708285114. Epub 2018 Mar 12.

Psychology, Science, and Knowledge Construction: Broadening Perspectives from the Replication Crisis.

Annu Rev Psychol. 2018 Jan 4;69:487-510. doi: 10.1146/annurev-psych-122216-011845.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

普遍性危机。

The generalizability crisis.

机构信息

Department of Psychology, The University of Texas at Austin, Austin, TX78712-1043,

出版信息

Behav Brain Sci. 2020 Dec 21;45:e1. doi: 10.1017/S0140525X20001685.

DOI:10.1017/S0140525X20001685

PMID:33342451

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10681374/

Abstract

摘要

普遍性危机。

The generalizability crisis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

普遍性危机。

The generalizability crisis.

机构信息

出版信息