• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

统计学并非测量:心理测量量表和基于语言的模型的内在语义掩盖了关键的认知差异。

Statistics is not measurement: The inbuilt semantics of psychometric scales and language-based models obscures crucial epistemic differences.

作者信息

Uher Jana

机构信息

School of Human Sciences, University of Greenwich, London, United Kingdom.

出版信息

Front Psychol. 2025 Jun 26;16:1534270. doi: 10.3389/fpsyg.2025.1534270. eCollection 2025.

DOI:10.3389/fpsyg.2025.1534270
PMID:40642039
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12244230/
Abstract

This article provides a comprehensive critique of psychology's overreliance on statistical modelling at the expense of epistemologically grounded measurement processes. It highlights that statistics deals with structural relations in data regardless of what these data represent, whereas measurement establishes traceable empirical relations between the phenomena studied and the data representing information about them. These crucial epistemic differences are elaborated using Rosen's general model of measurement, involving the coherent modelling of the (1) objects of research, (2) data generation (encoding), (3) formal manipulation (e.g., statistical analysis) and (4) result interpretation regarding the objects studied (decoding). This system of interrelated modelling relations is shown to underlie metrologists' approaches for tackling the problem of epistemic circularity in physical measurement, illustrated in the special cases of measurement coordination and calibration. The article then explicates psychology's challenges for establishing genuine analogues of measurement, which arise from the peculiarities of its study phenomena (e.g., higher-order complexity, non-ergodicity) and language-based methods (e.g., inbuilt semantics). It demonstrates that psychometrics cannot establish coordinated and calibrated modelling relations, thus generating only pragmatic quantifications with predictive power but precluding epistemically justified inferences on the phenomena studied. This epistemic gap is often overlooked, however, because many psychologists mistake their methods' inbuilt semantics-thus, descriptions of their study phenomena (e.g., in rating scales, item variables, statistical models)-for the phenomena described. This blurs the epistemically necessary distinction between the phenomena studied and those used as means of investigation, thereby confusing ontological with epistemological concepts-psychologists' cardinal error. Therefore, many mistake judgements of verbal statements for measurements of the phenomena described and overlook that statistics can neither establish nor analyze a model's relations to the phenomena explored. The article elaborates epistemological and methodological fundamentals to establish coherent modelling relations between real and formal study system and to distinguish the epistemic components involved, considering psychology's peculiarities. It shows that epistemically justified inferences necessitate methods for analysing individuals' unrestricted verbal responses, now advanced through artificial intelligence systems modelling natural language (e.g., NLP algorithms, LLMs). Their increasing use to generate standardised descriptions of study phenomena for rating scales and constructs, by contrast, will only perpetuate psychologists' cardinal error-and thus, psychology's crisis.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/07c512df7660/fpsyg-16-1534270-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/28265ddc8d6d/fpsyg-16-1534270-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/54cafa5e50fa/fpsyg-16-1534270-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/5da9236b1bf3/fpsyg-16-1534270-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/b8206d203b5d/fpsyg-16-1534270-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/bc356d9c6340/fpsyg-16-1534270-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/83187fd0deaf/fpsyg-16-1534270-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/07c512df7660/fpsyg-16-1534270-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/28265ddc8d6d/fpsyg-16-1534270-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/54cafa5e50fa/fpsyg-16-1534270-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/5da9236b1bf3/fpsyg-16-1534270-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/b8206d203b5d/fpsyg-16-1534270-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/bc356d9c6340/fpsyg-16-1534270-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/83187fd0deaf/fpsyg-16-1534270-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d68/12244230/07c512df7660/fpsyg-16-1534270-g0008.jpg
摘要

本文全面批判了心理学过度依赖统计建模而牺牲了基于认识论的测量过程的现象。文章强调,统计学处理数据中的结构关系,而不考虑这些数据代表什么,而测量则在研究的现象与表示有关这些现象信息的数据之间建立可追溯的经验关系。利用罗森的一般测量模型阐述了这些关键的认识论差异,该模型涉及对以下方面的连贯建模:(1)研究对象;(2)数据生成(编码);(3)形式操作(如统计分析);(4)关于所研究对象的结果解释(解码)。事实表明,这种相互关联的建模关系系统是计量学家解决物理测量中认识论循环问题方法的基础,在测量协调和校准的特殊情况下得到了说明。文章接着阐述了心理学在建立真正的测量类似物方面所面临的挑战,这些挑战源于其研究现象的特殊性(如高阶复杂性、非遍历性)和基于语言的方法(如内在语义)。文章表明,心理测量学无法建立协调和校准的建模关系,从而只能产生具有预测力的实用量化,但排除了对所研究现象的认识论上合理的推断。然而,这种认识论上的差距常常被忽视,因为许多心理学家将其方法的内在语义——即对其研究现象的描述(如在评分量表、项目变量、统计模型中)——误认为是所描述的现象。这模糊了所研究现象与用作调查手段的现象之间在认识论上必要的区别,从而将本体论概念与认识论概念混淆——这是心理学家的主要错误。因此,许多人将言语陈述的判断误认为是对所描述现象的测量,并忽视了统计学既不能建立也不能分析模型与所探索现象之间的关系。考虑到心理学的特殊性,文章阐述了认识论和方法论的基本原理,以在真实和形式化的研究系统之间建立连贯的建模关系,并区分其中涉及的认识论成分。文章表明,认识论上合理的推断需要分析个体不受限制的言语反应的方法,目前通过对自然语言进行建模的人工智能系统(如自然语言处理算法、大语言模型)取得了进展。相比之下,越来越多地使用这些系统来生成用于评分量表和结构的研究现象的标准化描述,只会使心理学家的主要错误永久化——从而导致心理学的危机。

相似文献

1
Statistics is not measurement: The inbuilt semantics of psychometric scales and language-based models obscures crucial epistemic differences.统计学并非测量:心理测量量表和基于语言的模型的内在语义掩盖了关键的认知差异。
Front Psychol. 2025 Jun 26;16:1534270. doi: 10.3389/fpsyg.2025.1534270. eCollection 2025.
2
Short-Term Memory Impairment短期记忆障碍
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
Evidence-based toxicology: a comprehensive framework for causation.循证毒理学:因果关系的综合框架。
Hum Exp Toxicol. 2005 Apr;24(4):161-201. doi: 10.1191/0960327105ht517oa.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
7
"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”:对自闭症成年人任务沉浸现象学体验的质性研究
Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.
8
Surveillance of Barrett's oesophagus: exploring the uncertainty through systematic review, expert workshop and economic modelling.巴雷特食管的监测:通过系统评价、专家研讨会和经济模型探索不确定性
Health Technol Assess. 2006 Mar;10(8):1-142, iii-iv. doi: 10.3310/hta10080.
9
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
10
Access Points: Understanding Special Interests Through Autistic Narratives.切入点:通过自闭症患者的叙述理解特殊利益群体。
Autism Adulthood. 2025 Feb 5;7(1):100-111. doi: 10.1089/aut.2023.0157. eCollection 2025 Feb.

本文引用的文献

1
Question-based computational language approach outperform ratings scale in discriminating between anxiety and depression.基于问题的计算语言方法在区分焦虑和抑郁方面比评定量表表现更优。
J Anxiety Disord. 2025 Jun;112:103020. doi: 10.1016/j.janxdis.2025.103020. Epub 2025 Apr 16.
2
Dynamic Mathematical Processing Through Symbolic, Situational, and Verbal Representations.通过符号、情境和语言表征进行动态数学处理。
Integr Psychol Behav Sci. 2025 Mar 12;59(1):33. doi: 10.1007/s12124-025-09899-3.
3
Semantic embeddings reveal and address taxonomic incommensurability in psychological measurement.
语义嵌入揭示并解决了心理测量中的分类不可通约性问题。
Nat Hum Behav. 2025 Mar 11. doi: 10.1038/s41562-024-02089-y.
4
An Embedding-Based Semantic Analysis Approach: A Preliminary Study on Redundancy Detection in Psychological Concepts Operationalized by Scales.一种基于嵌入的语义分析方法:对量表所操作化的心理概念中冗余检测的初步研究
J Intell. 2025 Jan 16;13(1):11. doi: 10.3390/jintelligence13010011.
5
The Minds We Make: A Philosophical Inquiry into Theory of Mind and Artificial Intelligence.我们塑造的心灵:对心智理论与人工智能的哲学探究
Integr Psychol Behav Sci. 2025 Jan 2;59(1):10. doi: 10.1007/s12124-024-09876-2.
6
Agential realism as an alternative philosophy of science perspective for quantitative psychology.能动实在论作为定量心理学的一种替代科学哲学视角。
Front Psychol. 2024 Nov 13;15:1410047. doi: 10.3389/fpsyg.2024.1410047. eCollection 2024.
7
Qualitative (pure) mathematics as an alternative to measurement.定性(纯)数学作为测量的替代方法。
Front Psychol. 2024 Oct 2;15:1374308. doi: 10.3389/fpsyg.2024.1374308. eCollection 2024.
8
Patient-driven research priorities for patient-centered measurement.以患者为中心的测量的患者驱动型研究重点。
BMC Health Serv Res. 2024 Jun 14;24(1):735. doi: 10.1186/s12913-024-11182-x.
9
Epistemic circularity and measurement validity in quantitative psychology: insights from Fechner's psychophysics.定量心理学中的认知循环与测量效度:来自费希纳心理物理学的见解
Front Psychol. 2024 May 21;15:1354392. doi: 10.3389/fpsyg.2024.1354392. eCollection 2024.
10
Measuring the menu, not the food: "psychometric" data may instead measure "lingometrics" (and miss its greatest potential).衡量的是菜单,而非食物:“心理测量”数据或许反而衡量的是“语言测量”(并错失其最大潜力)。
Front Psychol. 2024 Mar 21;15:1308098. doi: 10.3389/fpsyg.2024.1308098. eCollection 2024.