我们相信模型：预注册、大样本和复制可能并不够。

In models we trust: preregistration, large samples, and replication may not suffice.

作者信息

Spiess Martin, Jordan Pascal

机构信息

Institute of Psychology, Department of Psychology and Human Movement Science, University of Hamburg, Hamburg, Germany.

出版信息

Front Psychol. 2023 Sep 21;14:1266447. doi: 10.3389/fpsyg.2023.1266447. eCollection 2023.

DOI:10.3389/fpsyg.2023.1266447

PMID:37809287

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10551181/

Abstract

Despite discussions about the replicability of findings in psychological research, two issues have been largely ignored: selection mechanisms and model assumptions. Both topics address the same fundamental question: Does the chosen statistical analysis tool adequately model the data generation process? In this article, we address both issues and show, in a first step, that in the face of selective samples and contrary to common practice, the validity of inferences, even when based on experimental designs, can be claimed without further justification and adaptation of standard methods only in very specific situations. We then broaden our perspective to discuss consequences of violated assumptions in linear models in the context of psychological research in general and in generalized linear mixed models as used in item response theory. These types of misspecification are oftentimes ignored in the psychological research literature. It is emphasized that the above problems cannot be overcome by strategies such as preregistration, large samples, replications, or a ban on testing null hypotheses. To avoid biased conclusions, we briefly discuss tools such as model diagnostics, statistical methods to compensate for selectivity and semi- or non-parametric estimation. At a more fundamental level, however, a twofold strategy seems indispensable: (1) iterative, cumulative theory development based on statistical methods with theoretically justified assumptions, and (2) empirical research on variables that affect (self-) selection into the observed part of the sample and the use of this information to compensate for selectivity.

摘要

尽管围绕心理学研究结果的可重复性展开了诸多讨论，但有两个问题在很大程度上被忽视了：选择机制和模型假设。这两个主题都涉及同一个基本问题：所选的统计分析工具是否能充分模拟数据生成过程？在本文中，我们将探讨这两个问题，并首先表明，面对选择性样本，与常见做法相反，即使基于实验设计，只有在非常特殊的情况下，才可以在无需进一步论证以及对标准方法进行调整的情况下宣称推理的有效性。然后，我们拓宽视野，在一般心理学研究的背景下，以及在项目反应理论中使用的广义线性混合模型中，讨论线性模型中假设被违反的后果。这类模型设定错误在心理学研究文献中常常被忽视。需要强调的是，诸如预先注册、大样本、重复研究或禁止检验零假设等策略无法克服上述问题。为避免得出有偏差的结论，我们简要讨论了诸如模型诊断、补偿选择性的统计方法以及半参数或非参数估计等工具。然而，在更基本的层面上，一种双重策略似乎必不可少：（1）基于具有理论依据假设的统计方法进行迭代式、累积式理论发展，以及（2）对影响样本观察部分的（自我）选择的变量进行实证研究，并利用这些信息来补偿选择性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a26c/10551181/3967db3080b9/fpsyg-14-1266447-g0001.jpg

相似文献

In models we trust: preregistration, large samples, and replication may not suffice.

Front Psychol. 2023 Sep 21;14:1266447. doi: 10.3389/fpsyg.2023.1266447. eCollection 2023.

The Effectiveness of Integrated Care Pathways for Adults and Children in Health Care Settings: A Systematic Review.

JBI Libr Syst Rev. 2009;7(3):80-129. doi: 10.11124/01938924-200907030-00001.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Methodological and conceptual issues regarding occupational psychosocial coronary heart disease epidemiology.

Scand J Work Environ Health. 2016 May 1;42(3):251-5. doi: 10.5271/sjweh.3557. Epub 2016 Mar 9.

Dan Med Bull. 2010 Sep;57(9):B4184.

Model misspecification and robust analysis for outcome-dependent sampling designs under generalized linear models.

Stat Med. 2023 Apr 30;42(9):1338-1352. doi: 10.1002/sim.9673. Epub 2023 Feb 9.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Response to letter to the editor from Dr Rahman Shiri: The challenging topic of suicide across occupational groups.

Scand J Work Environ Health. 2018 Jan 1;44(1):108-110. doi: 10.5271/sjweh.3698. Epub 2017 Dec 8.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification

From Discovery to Justification: Outline of an Ideal Research Program in Empirical Psychology.

Front Psychol. 2017 Oct 27;8:1847. doi: 10.3389/fpsyg.2017.01847. eCollection 2017.

本文引用的文献

Incomplete Tests of Conditional Association for the Assessment of Model Assumptions.

Psychometrika. 2022 Dec;87(4):1214-1237. doi: 10.1007/s11336-022-09841-1. Epub 2022 Feb 5.

The Use of Research Methods in Psychological Research: A Systematised Review.

Front Res Metr Anal. 2020 Mar 20;5:1. doi: 10.3389/frma.2020.00001. eCollection 2020.

A decade of theory as reflected in Psychological Science (2009-2019).

PLoS One. 2021 Mar 5;16(3):e0247986. doi: 10.1371/journal.pone.0247986. eCollection 2021.

Arrested Theory Development: The Misguided Distinction Between Exploratory and Confirmatory Research.

Perspect Psychol Sci. 2021 Jul;16(4):717-724. doi: 10.1177/1745691620966796. Epub 2021 Feb 16.

The Theory Crisis in Psychology: How to Move Forward.

Perspect Psychol Sci. 2021 Jul;16(4):779-788. doi: 10.1177/1745691620970586. Epub 2021 Jan 29.

Generalized estimating equations to estimate the ordered stereotype logit model for panel data.

Stat Med. 2020 Jun 30;39(14):1919-1940. doi: 10.1002/sim.8520. Epub 2020 Mar 30.

The preregistration revolution.

Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2600-2606. doi: 10.1073/pnas.1708274114.

Psychology, Science, and Knowledge Construction: Broadening Perspectives from the Replication Crisis.

Annu Rev Psychol. 2018 Jan 4;69:487-510. doi: 10.1146/annurev-psych-122216-011845.

What Constitutes Strong Psychological Science? The (Neglected) Role of Diagnosticity and A Priori Theorizing.

Perspect Psychol Sci. 2017 Jan;12(1):46-61. doi: 10.1177/1745691616654458.

Increasing Transparency Through a Multiverse Analysis.

Perspect Psychol Sci. 2016 Sep;11(5):702-712. doi: 10.1177/1745691616658637.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

我们相信模型：预注册、大样本和复制可能并不够。

In models we trust: preregistration, large samples, and replication may not suffice.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献