• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在量表构建决策中权衡效度和信度。

Managing validity versus reliability trade-offs in scale-building decisions.

机构信息

University of Pennsylvania.

出版信息

Psychol Methods. 2020 Jun;25(3):259-270. doi: 10.1037/met0000236. Epub 2019 Aug 15.

DOI:10.1037/met0000236
PMID:31414848
Abstract

Scale builders strive to maximize dual priorities: validity and reliability. While the literature is full of tips for increasing one, the other, or both simultaneously, how to navigate tensions between them is less clear. Confusion shrouds the nature, prevalence, and practical implications of trade-offs between validity and reliability-formerly called paradoxes. This confusion results in most trade-offs being resolved de facto at validity's expense despite validity being de jure the higher priority. Decades-long battles against clear measurement malpractice persist because unspecified trade-offs render scale-building decisions favoring validity perennially unattractive to scale builders. In light of this confusion, the goal of this article is to make plain that the source of validity versus reliability trade-offs is systematic error that contributes to item communality. Moreover, straightforward, nontrivial trade-offs pervade the scale-building process. This article highlights common trade-offs in 6 contexts: item content, item construction, item difficulty, item scoring, item order, and item analysis. I end with 5 recommendations for managing trade-offs and out 7 "dirty tricks" often used to exploit them when nobody's looking. In short, reviewers should require scale builders to declare how validity and reliability will be prioritized and penalize those who resolve trade-offs in goal-inconsistent ways. (PsycInfo Database Record (c) 2020 APA, all rights reserved).

摘要

量表构建者努力实现两个优先事项的最大化

有效性和可靠性。虽然文献中充满了提高一个或另一个或同时提高两者的技巧,但如何处理它们之间的紧张关系并不清楚。在有效性和可靠性之间(以前称为悖论)的权衡取舍的性质、普遍性和实际影响方面存在混淆,这导致尽管有效性在法律上是更高的优先级,但大多数权衡实际上都是以牺牲有效性为代价的。尽管如此,与明确的测量不当行为的长期斗争仍在继续,因为未指定的权衡取舍使得有利于有效性的量表构建决策对量表构建者始终没有吸引力。鉴于这种混淆,本文的目的是阐明有效性与可靠性权衡取舍的根源是系统误差,它会导致项目共同性。此外,简单明了的实质性权衡取舍贯穿量表构建过程。本文在 6 个方面突出了常见的权衡取舍:项目内容、项目构建、项目难度、项目评分、项目顺序和项目分析。最后,我提出了 5 条管理权衡取舍的建议,并列出了 7 种“肮脏的把戏”,当没有人注意时,这些把戏经常被用来利用这些把戏。简而言之,评论者应该要求量表构建者声明如何优先考虑有效性和可靠性,并对那些以不一致的目标解决权衡取舍的人进行惩罚。(PsycInfo 数据库记录(c)2020 APA,保留所有权利)。

相似文献

1
Managing validity versus reliability trade-offs in scale-building decisions.在量表构建决策中权衡效度和信度。
Psychol Methods. 2020 Jun;25(3):259-270. doi: 10.1037/met0000236. Epub 2019 Aug 15.
2
A causal theory of error scores.错误分数的因果理论。
Psychol Methods. 2024 Aug;29(4):807-826. doi: 10.1037/met0000521. Epub 2022 Jul 25.
3
Reliability and Validity of a Turkish version of the Prenatal Breastfeeding Self-Efficacy Scale.土耳其语版产前母乳喂养自我效能量表的信效度
Midwifery. 2018 Sep;64:11-16. doi: 10.1016/j.midw.2018.05.007. Epub 2018 May 18.
4
Composite reliability of multilevel data: It's about observed scores and construct meanings.多级数据的组合信度:关乎观测分数与构念意义。
Psychol Methods. 2021 Feb;26(1):90-102. doi: 10.1037/met0000287. Epub 2020 Jul 16.
5
The accuracy of reliability coefficients: A reanalysis of existing simulations.可靠性系数的准确性:对现有模拟的重新分析。
Psychol Methods. 2024 Apr;29(2):331-349. doi: 10.1037/met0000475. Epub 2022 Jan 27.
6
The development and psychometric properties of the bipolar disorders knowledge scale.双相障碍知识量表的编制及心理测量学特性。
J Affect Disord. 2018 Oct 1;238:645-650. doi: 10.1016/j.jad.2018.05.043. Epub 2018 Jun 26.
7
Introduction to the special issue on methodological and statistical advancements in clinical assessment.临床评估方法学和统计学进展特刊介绍。
Psychol Assess. 2019 Dec;31(12):1383-1385. doi: 10.1037/pas0000786.
8
Reliability and validity of the Korean version of the 15-item Dispositional Resilience Scale.15 项特质韧性量表的韩文版的信度和效度。
Psychol Health Med. 2018 Jan-Dec;23(sup1):1-12. doi: 10.1080/13548506.2017.1417612.
9
Development of palliative care attitude and knowledge (PCAK) questionnaire for physicians in Kuwait.科威特医生的姑息治疗态度和知识(PCAK)问卷的制定。
BMC Palliat Care. 2019 Jun 6;18(1):49. doi: 10.1186/s12904-019-0430-9.
10
Development and psychometric testing of the toxic leadership behaviors of nurse managers (ToxBH-NM) scale.护理经理的毒性领导行为量表(ToxBH-NM)的编制与心理测量学检验。
J Nurs Manag. 2020 May;28(4):840-850. doi: 10.1111/jonm.13008. Epub 2020 Apr 19.

引用本文的文献

1
Ambulatory physiological measures obtained under naturalistic urban mobility conditions have acceptable reliability.在自然主义的城市移动性条件下获得的动态生理测量具有可接受的可靠性。
Sci Rep. 2025 Aug 7;15(1):28940. doi: 10.1038/s41598-025-13216-8.
2
Engaging people with lived experience of psychological disorders: Current research and future directions for community-engaged measure development in psychological science.让有心理障碍亲身经历的人参与进来:心理学领域社区参与式测量发展的当前研究与未来方向。
Clin Psychol Sci. 2025 Jul;13(4):720-739. doi: 10.1177/21677026241304339. Epub 2025 Feb 5.
3
Application of the ant colony optimization algorithm for the construction of a short version of the German alcohol decisional balance scale.
蚁群优化算法在构建德国酒精决策平衡量表简版中的应用。
Sci Rep. 2025 Jul 25;15(1):27122. doi: 10.1038/s41598-025-12087-3.
4
What makes an individual inclusive of others? Development of the individual inclusiveness inventory.是什么让一个人接纳他人?个体包容性量表的编制。
Front Psychol. 2025 May 7;16:1473120. doi: 10.3389/fpsyg.2025.1473120. eCollection 2025.
5
Within-Person Fluctuations in Ethnic-Racial Affect and Discrimination-Based Stress: Moderation by Average Ethnic-Racial Affect and Stress.种族情感和基于歧视的压力的个体内部波动:平均种族情感和压力的调节作用
J Adolesc. 2025 Jul;97(5):1173-1185. doi: 10.1002/jad.12484. Epub 2025 Feb 16.
6
The brief mind wandering three-factor scale (BMW-3).简短思维漫游三因素量表(BMW-3)。
Behav Res Methods. 2024 Dec;56(8):8720-8744. doi: 10.3758/s13428-024-02500-6. Epub 2024 Sep 11.
7
Model Specification Searches in Structural Equation Modeling Using Bee Swarm Optimization.使用蜂群优化算法进行结构方程模型中的模型规格搜索
Educ Psychol Meas. 2024 Feb;84(1):40-61. doi: 10.1177/00131644231160552. Epub 2023 Mar 29.
8
How Does Parental Monitoring Reduce Adolescent Substance Use? Preliminary Tests of Two Potential Mechanisms.父母监督如何减少青少年药物使用?两种潜在机制的初步测试。
J Stud Alcohol Drugs. 2024 May;85(3):389-394. doi: 10.15288/jsad.23-00297. Epub 2024 Jan 16.
9
Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity.为什么常规项目和反向项目加载在不同因素上?反应难度与项目极端性。
Educ Psychol Meas. 2023 Dec;83(6):1085-1112. doi: 10.1177/00131644221143972. Epub 2023 Jan 2.
10
Using artificial intelligence to assess personal qualities in college admissions.利用人工智能评估大学招生中的个人素质。
Sci Adv. 2023 Oct 13;9(41):eadg9405. doi: 10.1126/sciadv.adg9405. Epub 2023 Oct 12.