• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

跨组态度比较:一种基于项目反应理论的项目拟合统计量用于测量不变性分析。

Comparing Attitudes Across Groups: An IRT-Based Item-Fit Statistic for the Analysis of Measurement Invariance.

作者信息

Buchholz Janine, Hartig Johannes

机构信息

Deutsches Institut für Internationale Pädagogische Forschung, Frankfurt, Germany.

出版信息

Appl Psychol Meas. 2019 May;43(3):241-250. doi: 10.1177/0146621617748323. Epub 2017 Dec 27.

DOI:10.1177/0146621617748323
PMID:31019359
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6463271/
Abstract

Questionnaires for the assessment of attitudes and other psychological traits are crucial in educational and psychological research, and item response theory (IRT) has become a viable tool for scaling such data. Many international large-scale assessments aim at comparing these constructs across countries, and the invariance of measures across countries is thus required. In its most recent cycle, the Programme for International Student Assessment (PISA 2015) implemented an innovative approach for testing the invariance of IRT-scaled constructs in the context questionnaires administered to students, parents, school principals, and teachers. On the basis of a concurrent calibration with equal item parameters across all groups (i.e., languages within countries), a group-specific item-fit statistic (root mean square deviance [RMSD]) was used as a measure for the invariance of item parameters for individual groups. The present simulation study examines the statistic's distribution under different types and extents of (non)invariance in polytomous items. Responses to five 4-point Likert-type items were generated under the generalized partial credit model (GPCM) for 1,000 simulees in 50 groups each. For one of the five items, either location or discrimination parameters were drawn from a normal distribution. In addition to the type of noninvariance, the extent of noninvariance was varied by manipulating the variation of these distributions. The results indicate that the RMSD statistic is better at detecting noninvariance related to between-group differences in item location than in item discrimination. The study's findings may be used as a starting point to sensitivity analysis aiming to define cutoff values for determining (non)invariance.

摘要

用于评估态度和其他心理特质的问卷在教育和心理学研究中至关重要,项目反应理论(IRT)已成为对此类数据进行量表编制的可行工具。许多国际大规模评估旨在跨国比较这些构念,因此需要测量在各国间具有不变性。在其最近一轮评估中,国际学生评估项目(PISA 2015)采用了一种创新方法,用于在向学生、家长、学校校长和教师发放的背景问卷中测试IRT量表化构念的不变性。基于对所有组(即各国的语言群体)采用相等项目参数进行同步校准,使用特定组的项目拟合统计量(均方根偏差[RMSD])作为衡量各个组项目参数不变性的指标。本模拟研究考察了该统计量在多分类项目不同类型和程度的(非)不变性情况下的分布。在广义部分计分模型(GPCM)下,为50个组中每组1000名模拟对象生成了对五个4点李克特式项目的回答。对于五个项目中的一个,位置参数或区分度参数取自正态分布。除了非不变性的类型外,还通过操纵这些分布的变化来改变非不变性的程度。结果表明,RMSD统计量在检测与项目位置组间差异相关的非不变性方面比在检测项目区分度方面表现更好。该研究结果可作为敏感性分析的起点,旨在确定用于判定(非)不变性的临界值。

相似文献

1
Comparing Attitudes Across Groups: An IRT-Based Item-Fit Statistic for the Analysis of Measurement Invariance.跨组态度比较:一种基于项目反应理论的项目拟合统计量用于测量不变性分析。
Appl Psychol Meas. 2019 May;43(3):241-250. doi: 10.1177/0146621617748323. Epub 2017 Dec 27.
2
Testing item response theory invariance of the standardized Quality-of-life Disease Impact Scale (QDIS(®)) in acute coronary syndrome patients: differential functioning of items and test.急性冠状动脉综合征患者中标准化生活质量疾病影响量表(QDIS(®))的项目反应理论不变性测试:项目和测试的差异功能
Qual Life Res. 2015 Aug;24(8):1809-22. doi: 10.1007/s11136-015-0916-8. Epub 2015 Jan 20.
3
Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.运用项目反应理论比较中国抑郁症患者的五种抑郁量表:项目特性、测量精度及分数可比性检验
Health Qual Life Outcomes. 2017 Apr 4;15(1):60. doi: 10.1186/s12955-017-0631-y.
4
Barthel Index of activities of daily living: item response theory analysis of ratings for long-term care residents.日常生活活动能力巴氏指数:长期护理居民评分的项目反应理论分析
Nurs Res. 2015 Mar-Apr;64(2):88-99. doi: 10.1097/NNR.0000000000000072.
5
Rasch fit statistics as a test of the invariance of item parameter estimates.拉施拟合统计作为项目参数估计不变性的一种检验。
J Appl Meas. 2003;4(2):153-63.
6
Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches.尺度长度确实很重要:类别因素分析和项目反应理论方法的测量不变性检验建议。
Behav Res Methods. 2022 Oct;54(5):2114-2145. doi: 10.3758/s13428-021-01690-7. Epub 2021 Dec 15.
7
Fit Indices for Measurement Invariance Tests in the Thurstonian IRT Model.瑟斯顿IRT模型中测量不变性检验的拟合指标
Appl Psychol Meas. 2020 Jun;44(4):282-295. doi: 10.1177/0146621619893785. Epub 2019 Dec 26.
8
The Impact of Model Parameterization and Estimation Methods on Tests of Measurement Invariance With Ordered Polytomous Data.模型参数化和估计方法对有序多分类数据测量不变性检验的影响
Educ Psychol Meas. 2018 Apr;78(2):272-296. doi: 10.1177/0013164416683754. Epub 2017 Jan 5.
9
Disentangling the effects of culture and language on measurement noninvariance in cross-cultural research: The culture, comprehension, and translation bias (CCT) procedure.解析文化和语言对跨文化研究中测量不变性的影响:文化、理解和翻译偏差(CCT)程序。
Psychol Assess. 2021 May;33(5):375-384. doi: 10.1037/pas0000989. Epub 2021 Mar 18.
10
Validity and reliability evidence for the scale of distance education satisfaction of medical students based on item response theory (IRT).基于项目反应理论的医学生远程教育满意度量表的效度和信度证据。
BMC Med Educ. 2022 Feb 11;22(1):94. doi: 10.1186/s12909-022-03153-9.

引用本文的文献

1
Severity Benchmarks for the Level of Personality Functioning Scale-Brief Form 2.0 (LPFS-BF 2.0) in Polish Adults.波兰成年人个性功能量表简版2.0(LPFS-BF 2.0)水平的严重程度基准。
Healthcare (Basel). 2025 Feb 6;13(3):340. doi: 10.3390/healthcare13030340.
2
Comparing Person-Fit and Traditional Indices Across Careless Response Patterns in Surveys.比较调查中粗心回答模式下的个体拟合度指标与传统指标。
Appl Psychol Meas. 2023 Sep;47(5-6):365-385. doi: 10.1177/01466216231194358. Epub 2023 Aug 3.
3
Detecting Rating Scale Malfunctioning With the Partial Credit Model and Generalized Partial Credit Model.

本文引用的文献

1
Editorial: Measurement Invariance.社论:测量不变性
Front Psychol. 2015 Jul 28;6:1064. doi: 10.3389/fpsyg.2015.01064. eCollection 2015.
2
Comparing results of an exact vs. an approximate (Bayesian) measurement invariance test: a cross-country illustration with a scale to measure 19 human values.比较精确测量与近似(贝叶斯)测量不变性检验的结果:使用一个衡量 19 个人类价值观的量表进行跨国说明。
Front Psychol. 2014 Sep 8;5:982. doi: 10.3389/fpsyg.2014.00982. eCollection 2014.
3
A general diagnostic model applied to language testing data.
使用部分计分模型和广义部分计分模型检测评分量表故障
Educ Psychol Meas. 2023 Oct;83(5):953-983. doi: 10.1177/00131644221116292. Epub 2022 Aug 12.
4
A Robust Method for Detecting Item Misfit in Large-Scale Assessments.一种用于大规模评估中检测项目不匹配的稳健方法。
Educ Psychol Meas. 2023 Aug;83(4):740-765. doi: 10.1177/00131644221105819. Epub 2022 Jul 2.
5
Exploring the Multiverse of Analytical Decisions in Scaling Educational Large-Scale Assessment Data: A Specification Curve Analysis for PISA 2018 Mathematics Data.探索教育大规模评估数据规模化分析决策的多元宇宙:基于2018年国际学生评估项目(PISA)数学数据的规格曲线分析
Eur J Investig Health Psychol Educ. 2022 Jul 7;12(7):731-753. doi: 10.3390/ejihpe12070054.
6
Analyzing Large-Scale Studies: Benefits and Challenges.分析大规模研究:益处与挑战。
Front Psychol. 2020 Dec 9;11:577410. doi: 10.3389/fpsyg.2020.577410. eCollection 2020.
7
Time to Renovate the Humor Styles Questionnaire? An Item Response Theory Analysis of the HSQ.是时候更新幽默风格问卷了?对幽默风格问卷的项目反应理论分析
Behav Sci (Basel). 2020 Nov 13;10(11):173. doi: 10.3390/bs10110173.
应用于语言测试数据的通用诊断模型。
Br J Math Stat Psychol. 2008 Nov;61(Pt 2):287-307. doi: 10.1348/000711007X193957. Epub 2007 Mar 22.
4
Confirmatory factor analysis and item response theory: two approaches for exploring measurement invariance.验证性因素分析与项目反应理论:探索测量不变性的两种方法。
Psychol Bull. 1993 Nov;114(3):552-566. doi: 10.1037/0033-2909.114.3.552.