文献检索，用中文搜 PubMed

This article provides a comprehensive critique of psychology's overreliance on statistical modelling at the expense of epistemologically grounded measurement processes. It highlights that statistics deals with structural relations in data regardless of what these data represent, whereas measurement establishes traceable empirical relations between the phenomena studied and the data representing information about them. These crucial epistemic differences are elaborated using Rosen's general model of measurement, involving the coherent modelling of the (1) objects of research, (2) data generation (encoding), (3) formal manipulation (e.g., statistical analysis) and (4) result interpretation regarding the objects studied (decoding). This system of interrelated modelling relations is shown to underlie metrologists' approaches for tackling the problem of epistemic circularity in physical measurement, illustrated in the special cases of measurement coordination and calibration. The article then explicates psychology's challenges for establishing genuine analogues of measurement, which arise from the peculiarities of its study phenomena (e.g., higher-order complexity, non-ergodicity) and language-based methods (e.g., inbuilt semantics). It demonstrates that psychometrics cannot establish coordinated and calibrated modelling relations, thus generating only pragmatic quantifications with predictive power but precluding epistemically justified inferences on the phenomena studied. This epistemic gap is often overlooked, however, because many psychologists mistake their methods' inbuilt semantics-thus, descriptions of their study phenomena (e.g., in rating scales, item variables, statistical models)-for the phenomena described. This blurs the epistemically necessary distinction between the phenomena studied and those used as means of investigation, thereby confusing ontological with epistemological concepts-psychologists' cardinal error. Therefore, many mistake judgements of verbal statements for measurements of the phenomena described and overlook that statistics can neither establish nor analyze a model's relations to the phenomena explored. The article elaborates epistemological and methodological fundamentals to establish coherent modelling relations between real and formal study system and to distinguish the epistemic components involved, considering psychology's peculiarities. It shows that epistemically justified inferences necessitate methods for analysing individuals' unrestricted verbal responses, now advanced through artificial intelligence systems modelling natural language (e.g., NLP algorithms, LLMs). Their increasing use to generate standardised descriptions of study phenomena for rating scales and constructs, by contrast, will only perpetuate psychologists' cardinal error-and thus, psychology's crisis.

本文全面批判了心理学过度依赖统计建模而牺牲了基于认识论的测量过程的现象。文章强调，统计学处理数据中的结构关系，而不考虑这些数据代表什么，而测量则在研究的现象与表示有关这些现象信息的数据之间建立可追溯的经验关系。利用罗森的一般测量模型阐述了这些关键的认识论差异，该模型涉及对以下方面的连贯建模：（1）研究对象；（2）数据生成（编码）；（3）形式操作（如统计分析）；（4）关于所研究对象的结果解释（解码）。事实表明，这种相互关联的建模关系系统是计量学家解决物理测量中认识论循环问题方法的基础，在测量协调和校准的特殊情况下得到了说明。文章接着阐述了心理学在建立真正的测量类似物方面所面临的挑战，这些挑战源于其研究现象的特殊性（如高阶复杂性、非遍历性）和基于语言的方法（如内在语义）。文章表明，心理测量学无法建立协调和校准的建模关系，从而只能产生具有预测力的实用量化，但排除了对所研究现象的认识论上合理的推断。然而，这种认识论上的差距常常被忽视，因为许多心理学家将其方法的内在语义——即对其研究现象的描述（如在评分量表、项目变量、统计模型中）——误认为是所描述的现象。这模糊了所研究现象与用作调查手段的现象之间在认识论上必要的区别，从而将本体论概念与认识论概念混淆——这是心理学家的主要错误。因此，许多人将言语陈述的判断误认为是对所描述现象的测量，并忽视了统计学既不能建立也不能分析模型与所探索现象之间的关系。考虑到心理学的特殊性，文章阐述了认识论和方法论的基本原理，以在真实和形式化的研究系统之间建立连贯的建模关系，并区分其中涉及的认识论成分。文章表明，认识论上合理的推断需要分析个体不受限制的言语反应的方法，目前通过对自然语言进行建模的人工智能系统（如自然语言处理算法、大语言模型）取得了进展。相比之下，越来越多地使用这些系统来生成用于评分量表和结构的研究现象的标准化描述，只会使心理学家的主要错误永久化——从而导致心理学的危机。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

统计学并非测量：心理测量量表和基于语言的模型的内在语义掩盖了关键的认知差异。

Statistics is not measurement: The inbuilt semantics of psychometric scales and language-based models obscures crucial epistemic differences.

作者信息

机构信息

出版信息

相似文献

本文引用的文献