• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

李克特量表中项目措辞如何影响参与者的回答?来自项目反应理论分析的证据。

How does item wording affect participants' responses in Likert scale? Evidence from IRT analysis.

作者信息

Zeng Biao, Jeon Minjeong, Wen Hongbo

机构信息

Collaborative Innovation Center of Assessment toward Basic Education Quality, Beijing Normal University, Beijing, China.

Department of Education, School of Education and Information Studies, University of California, Los Angeles, CA, United States.

出版信息

Front Psychol. 2024 Oct 4;15:1304870. doi: 10.3389/fpsyg.2024.1304870. eCollection 2024.

DOI:10.3389/fpsyg.2024.1304870
PMID:39430906
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11486723/
Abstract

Researchers often combine both positively and negatively worded items when constructing Likert scales. This combination, however, may introduce method effects due to the variances in item wording. Although previous studies have tried to quantify these effects by using factor analysis on scales with different content, the impact of varied item wording on participants' choices among specific options remains unexplored. To address this gap, we utilized four versions of the Undergraduate Learning Burnout (ULB) scale, each characterized by a unique valence of item wording. After collecting responses from 1,131 college students, we employed unidimensional, multidimensional, and bi-factor Graded Response Models for analysis. The results suggested that the ULB scale supports a unidimensional structure for the learning burnout trait. However, the inclusion of different valences of wording within items introduced additional method factors, explaining a considerable degree of variance. Notably, positively worded items demonstrated greater discriminative power and more effectively counteracted the biased outcomes associated with negatively worded items, especially between the "Strongly Disagree" and "Disagree" options. While there were no substantial differences in the overall learning burnout traits among respondents of different scale versions, slight variations were noted in their distributions. The integration of both positive and negative wordings reduced the reliability of the learning burnout trait measurement. Consequently, it is recommended to use exclusively positively worded items and avoid a mix in item wording during scale construction. If a combination is essential, the bi-factor IRT model might help segregate the method effects resulting from the wording valence.

摘要

研究人员在构建李克特量表时,常常会同时纳入正向和负向表述的项目。然而,这种组合可能会因项目措辞的差异而引入方法效应。尽管先前的研究试图通过对具有不同内容的量表进行因子分析来量化这些效应,但不同的项目措辞对参与者在特定选项中的选择所产生的影响仍未得到探讨。为了填补这一空白,我们使用了本科学习倦怠(ULB)量表的四个版本,每个版本的项目措辞都具有独特的效价。在收集了1131名大学生的回答后,我们采用了单维、多维和双因素等级反应模型进行分析。结果表明,ULB量表支持学习倦怠特质的单维结构。然而,项目中纳入不同效价的措辞引入了额外的方法因素,解释了相当程度的方差。值得注意的是,正向表述的项目表现出更大的区分能力,并且更有效地抵消了与负向表述项目相关的偏差结果,尤其是在“强烈不同意”和“不同意”选项之间。虽然不同量表版本的受访者在总体学习倦怠特质上没有实质性差异,但在其分布上发现了细微变化。正负措辞的结合降低了学习倦怠特质测量的可靠性。因此,建议在量表构建过程中仅使用正向表述的项目,避免项目措辞的混合。如果必须进行组合,双因素IRT模型可能有助于分离由措辞效价产生的方法效应。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb47/11486723/7fc3e8f87c86/fpsyg-15-1304870-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb47/11486723/d7be34f6e180/fpsyg-15-1304870-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb47/11486723/7fc3e8f87c86/fpsyg-15-1304870-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb47/11486723/d7be34f6e180/fpsyg-15-1304870-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb47/11486723/7fc3e8f87c86/fpsyg-15-1304870-g002.jpg

相似文献

1
How does item wording affect participants' responses in Likert scale? Evidence from IRT analysis.李克特量表中项目措辞如何影响参与者的回答?来自项目反应理论分析的证据。
Front Psychol. 2024 Oct 4;15:1304870. doi: 10.3389/fpsyg.2024.1304870. eCollection 2024.
2
How Does the Valence of Wording Affect Features of a Scale? The Method Effects in the Undergraduate Learning Burnout Scale.措辞的效价如何影响量表的特征?大学生学习倦怠量表中的方法效应。
Front Psychol. 2020 Sep 28;11:585179. doi: 10.3389/fpsyg.2020.585179. eCollection 2020.
3
Item Response Theory Models for Wording Effects in Mixed-Format Scales.混合格式量表中措辞效应的项目反应理论模型
Educ Psychol Meas. 2015 Feb;75(1):157-178. doi: 10.1177/0013164414528209. Epub 2014 Apr 6.
4
Response tendencies due to item wording using eye-tracking methodology accounting for individual differences and item characteristics.使用眼动追踪方法,考虑个体差异和项目特征,研究因项目措辞导致的反应倾向。
Behav Res Methods. 2022 Oct;54(5):2252-2270. doi: 10.3758/s13428-021-01719-x. Epub 2022 Jan 14.
5
Direction of wording and responses to items in oral health-related quality of life questionnaires for children and their parents.儿童及其家长口腔健康相关生活质量问卷中条目的措辞方向及回答方式
Community Dent Oral Epidemiol. 2007 Aug;35(4):255-62. doi: 10.1111/j.1600-0528.2007.00320.x.
6
The Impact of Item Feature and Response Preference in a Mixed-Format Design.混合格式设计中项目特征和反应偏好的影响
Multivariate Behav Res. 2022 Mar-May;57(2-3):208-222. doi: 10.1080/00273171.2020.1820308. Epub 2020 Oct 1.
7
A Study of Reverse-Worded Matched Item Pairs Using the Generalized Partial Credit and Nominal Response Models.一项使用广义部分计分模型和名义反应模型的反向配对项目研究。
Educ Psychol Meas. 2018 Feb;78(1):103-127. doi: 10.1177/0013164416670211. Epub 2016 Oct 6.
8
Wording Effects in Assessment: Missing the Trees for the Forest.评估中的措辞效应:只见树木,不见森林。
Multivariate Behav Res. 2022 Sep-Oct;57(5):718-734. doi: 10.1080/00273171.2021.1925075. Epub 2021 May 28.
9
Modeling Wording Effects Does Not Help in Recovering Uncontaminated Person Scores: A Systematic Evaluation With Random Intercept Item Factor Analysis.建模措辞效应无助于恢复未受污染的个人得分:基于随机截距项目因素分析的系统评价
Front Psychol. 2021 Jun 2;12:685326. doi: 10.3389/fpsyg.2021.685326. eCollection 2021.
10
Methodological issues in measuring subjective well-being and quality-of-life: Applications to assessment of affect in older, chronically and cognitively impaired, ethnically diverse groups using the Feeling Tone Questionnaire.测量主观幸福感和生活质量中的方法学问题:使用情感基调问卷在老年、慢性和认知受损、种族多样化群体中评估情感的应用。
Appl Res Qual Life. 2017 Jun;12(2):251-288. doi: 10.1007/s11482-017-9516-9. Epub 2017 Apr 4.

引用本文的文献

1
Exploring faculty perceptions and concerns regarding artificial intelligence Chatbots in nursing education: potential benefits and limitations.探索教师对护理教育中人工智能聊天机器人的看法和担忧:潜在的益处和局限性。
BMC Nurs. 2025 Apr 18;24(1):440. doi: 10.1186/s12912-025-03082-0.
2
Construct-irrelevant item attributes: a framework to classifying items based on context and referent.与结构无关的项目属性:一种基于上下文和参照对项目进行分类的框架。
Front Psychol. 2025 Mar 12;16:1446798. doi: 10.3389/fpsyg.2025.1446798. eCollection 2025.

本文引用的文献

1
How Does the Valence of Wording Affect Features of a Scale? The Method Effects in the Undergraduate Learning Burnout Scale.措辞的效价如何影响量表的特征?大学生学习倦怠量表中的方法效应。
Front Psychol. 2020 Sep 28;11:585179. doi: 10.3389/fpsyg.2020.585179. eCollection 2020.
2
A little garbage in, lots of garbage out: Assessing the impact of careless responding in personality survey data.一入调查深似海,数据垃圾全都来:评估人格调查数据中草率作答的影响。
Behav Res Methods. 2020 Dec;52(6):2489-2505. doi: 10.3758/s13428-020-01401-8.
3
Simple Structure Detection Through Bayesian Exploratory Multidimensional IRT Models.
基于贝叶斯探索性多维IRT 模型的简单结构检测。
Multivariate Behav Res. 2019 Jan-Feb;54(1):100-112. doi: 10.1080/00273171.2018.1496317. Epub 2018 Nov 7.
4
Item Response Theory Models for Wording Effects in Mixed-Format Scales.混合格式量表中措辞效应的项目反应理论模型
Educ Psychol Meas. 2015 Feb;75(1):157-178. doi: 10.1177/0013164414528209. Epub 2014 Apr 6.
5
Is the Bifactor Model a Better Model or Is It Just Better at Modeling Implausible Responses? Application of Iteratively Reweighted Least Squares to the Rosenberg Self-Esteem Scale.双因素模型是一个更好的模型,还是仅仅在对不合理反应进行建模方面表现更好?迭代加权最小二乘法在罗森伯格自尊量表中的应用。
Multivariate Behav Res. 2016 Nov-Dec;51(6):818-838. doi: 10.1080/00273171.2016.1243461. Epub 2016 Nov 11.
6
Method effects: the problem with negatively versus positively keyed items.方法效应:负向与正向计分项目的问题。
J Pers Assess. 2012;94(2):196-204. doi: 10.1080/00223891.2011.645936.
7
Longitudinal tests of competing factor structures for the Rosenberg Self-Esteem Scale: traits, ephemeral artifacts, and stable response styles.罗森伯格自尊量表竞争因素结构的纵向测试:特质、短暂人工制品和稳定的反应风格。
Psychol Assess. 2010 Jun;22(2):366-81. doi: 10.1037/a0019225.
8
Psychometric evaluation of the Taiwanese version of the Kiddo-kINDL generic children's health-related quality of life instrument.台湾版儿童生活质量通用量表Kiddo-kINDL的心理测量学评估
Qual Life Res. 2008 May;17(4):603-11. doi: 10.1007/s11136-008-9328-3.
9
Common method biases in behavioral research: a critical review of the literature and recommended remedies.行为研究中的共同方法偏差:文献综述与建议补救措施
J Appl Psychol. 2003 Oct;88(5):879-903. doi: 10.1037/0021-9010.88.5.879.
10
Convergent and discriminant validation by the multitrait-multimethod matrix.通过多特质-多方法矩阵进行收敛效度和区分效度检验。
Psychol Bull. 1959 Mar;56(2):81-105.