• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

当评分者进行概括时:使用混合Rasch方面模型检验光环效应的来源。

When raters generalize: Examining sources of halo effects with mixture Rasch facets models.

作者信息

Jin Kuan-Yu, Eckes Thomas

机构信息

Hong Kong Examinations and Assessment Authority, 68 Gillies Avenue South, Kowloon City, Kowloon, Hong Kong.

TestDaF Institute, University of Bochum, Universitätsstr. 134, 44799, Bochum, Germany.

出版信息

Behav Res Methods. 2025 Apr 21;57(5):149. doi: 10.3758/s13428-025-02667-6.

DOI:10.3758/s13428-025-02667-6
PMID:40259154
Abstract

Halo effects are commonly considered a cognitive or judgmental bias leading to rating error when raters assign scores to persons or performances on multiple criteria. Though a long tradition of research has pointed to possible sources of halo effects, measurement models for identifying these sources and detecting halo have been lacking. In the present research, we propose a general mixture Rasch facets model for halo effects (MRFM-H) and derive two more specific models, each assuming a different psychological mechanism. According to the first model, MRFM-H(GI), persons evoke general impressions that guide raters when assigning scores on conceptually distinct criteria. The second model, MRFM-H(ID), assumes that raters fail to discriminate adequately between the criteria. We adopted a Bayesian inference approach to implement these models, conducting two simulation studies and a real-data analysis. In the simulation studies, we found that (a) the number of raters and criteria determined the accuracy of classifying persons as inducing or not inducing halo; (b) 90% classification accuracy was achieved when at least 25 ratings were available for each rater-person combination; (c) ignoring halo caused by either mechanism (general impressions or inadequate criterion discrimination) biased the criterion parameter estimates while having a negligible impact on person and rater estimates; (d) Bayesian data-model fit statistics (WAIC and WBIC) reliably identified the true, data-generating model. The real-data analysis highlighted the models' practical utility for examining the likely source of halo effects. The discussion focuses on the models' application in various assessment contexts and points to directions for future research.

摘要

晕轮效应通常被认为是一种认知或判断偏差,当评估者根据多个标准对人员或表现进行评分时,会导致评分误差。尽管长期以来的研究传统指出了晕轮效应可能的来源,但缺乏用于识别这些来源和检测晕轮效应的测量模型。在本研究中,我们提出了一种晕轮效应的通用混合Rasch方面模型(MRFM-H),并推导出另外两个更具体的模型,每个模型都假设了不同的心理机制。根据第一个模型,即MRFM-H(GI),个体唤起的总体印象会在评估者根据概念上不同的标准进行评分时指导他们。第二个模型,即MRFM-H(ID),假设评估者未能充分区分这些标准。我们采用贝叶斯推理方法来实现这些模型,进行了两项模拟研究和一项实际数据分析。在模拟研究中,我们发现:(a)评估者和标准的数量决定了将个体分类为是否引发晕轮效应的准确性;(b)当每个评估者与个体的组合至少有25个评分时,分类准确率达到90%;(c)忽略由任何一种机制(总体印象或标准区分不足)导致的晕轮效应会使标准参数估计产生偏差,而对个体和评估者估计的影响可忽略不计;(d)贝叶斯数据-模型拟合统计量(WAIC和WBIC)能够可靠地识别真实的数据生成模型。实际数据分析突出了这些模型在检查晕轮效应可能来源方面的实际效用。讨论聚焦于这些模型在各种评估情境中的应用,并指出了未来研究的方向。

相似文献

1
When raters generalize: Examining sources of halo effects with mixture Rasch facets models.当评分者进行概括时:使用混合Rasch方面模型检验光环效应的来源。
Behav Res Methods. 2025 Apr 21;57(5):149. doi: 10.3758/s13428-025-02667-6.
2
A mixture Rasch facets model for rater's illusory halo effects.一种用于评估者虚幻光环效应的混合拉施克侧面模型。
Behav Res Methods. 2022 Dec;54(6):2750-2764. doi: 10.3758/s13428-021-01721-3. Epub 2022 Jan 11.
3
Diagnosing a common rater halo effect using the polytomous Rasch model.使用多值Rasch模型诊断常见的评分者光环效应。
J Appl Meas. 2011;12(3):194-211.
4
Assessment of Differential Rater Functioning in Latent Classes with New Mixture Facets Models.使用新的混合方面模型评估潜在类别中的差异评分者功能。
Multivariate Behav Res. 2017 May-Jun;52(3):391-402. doi: 10.1080/00273171.2017.1299615. Epub 2017 Mar 22.
5
Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study.使用个体拟合统计量检测评分者偏差:一项蒙特卡罗模拟研究。
Perspect Med Educ. 2018 Apr;7(2):83-92. doi: 10.1007/s40037-017-0391-8.
6
Part 2. Development of Enhanced Statistical Methods for Assessing Health Effects Associated with an Unknown Number of Major Sources of Multiple Air Pollutants.第2部分。开发增强的统计方法,以评估与多种空气污染物的未知数量主要来源相关的健康影响。
Res Rep Health Eff Inst. 2015 Jun(183 Pt 1-2):51-113.
7
Implicit versus explicit first impressions in performance-based assessment: will raters overcome their first impressions when learner performance changes?基于表现的评估中的内隐印象与外显印象:当学习者表现改变时,评价者会克服他们的第一印象吗?
Adv Health Sci Educ Theory Pract. 2024 Sep;29(4):1155-1168. doi: 10.1007/s10459-023-10302-2. Epub 2023 Nov 27.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Identifying subtypes in persons, situations and person-situation interactions: Categorical latent state-trait modelling approaches.识别个体、情境及个体-情境交互中的亚型:分类潜在状态-特质建模方法。
Br J Psychol. 2025 May;116(2):291-315. doi: 10.1111/bjop.12718. Epub 2024 Jun 26.
10
A mixture model approach to indexing rater agreement.一种用于索引评分者一致性的混合模型方法。
Br J Math Stat Psychol. 2002 Nov;55(Pt 2):289-303. doi: 10.1348/000711002760554598.

本文引用的文献

1
Human ratings take time: A hierarchical facets model for the joint analysis of ratings and rating times.人力评分需要时间:一种联合分析评分和评分时间的层次因素模型。
Behav Res Methods. 2024 Apr;56(4):3535-3547. doi: 10.3758/s13428-023-02259-2. Epub 2023 Nov 2.
2
Detecting Rating Scale Malfunctioning With the Partial Credit Model and Generalized Partial Credit Model.使用部分计分模型和广义部分计分模型检测评分量表故障
Educ Psychol Meas. 2023 Oct;83(5):953-983. doi: 10.1177/00131644221116292. Epub 2022 Aug 12.
3
The Impact of Sample Size and Various Other Factors on Estimation of Dichotomous Mixture IRT Models.
样本量及其他各种因素对二分混合IRT模型估计的影响
Educ Psychol Meas. 2023 Jun;83(3):520-555. doi: 10.1177/00131644221094325. Epub 2022 May 19.
4
A Bayesian many-facet Rasch model with Markov modeling for rater severity drift.贝叶斯多项 RASCH 模型与马尔可夫建模用于评分者严重偏差。
Behav Res Methods. 2023 Oct;55(7):3910-3928. doi: 10.3758/s13428-022-01997-z. Epub 2022 Oct 25.
5
Differentiation of Illusory and True Halo in Writing Scores.写作分数中虚幻光环与真实光环的区分。
Educ Psychol Meas. 2015 Feb;75(1):102-125. doi: 10.1177/0013164414530990. Epub 2014 Apr 24.
6
Assessment of Differential Rater Functioning in Latent Classes with New Mixture Facets Models.使用新的混合方面模型评估潜在类别中的差异评分者功能。
Multivariate Behav Res. 2017 May-Jun;52(3):391-402. doi: 10.1080/00273171.2017.1299615. Epub 2017 Mar 22.
7
Blinded by Beauty: Attractiveness Bias and Accurate Perceptions of Academic Performance.被美貌蒙蔽:吸引力偏差与对学术表现的准确认知
PLoS One. 2016 Feb 17;11(2):e0148284. doi: 10.1371/journal.pone.0148284. eCollection 2016.
8
A relationship between attractiveness and performance in professional cyclists.职业自行车手的吸引力与表现之间的关系。
Biol Lett. 2014 Feb 5;10(2):20130966. doi: 10.1098/rsbl.2013.0966. Print 2014 Feb.
9
Diagnosing a common rater halo effect using the polytomous Rasch model.使用多值Rasch模型诊断常见的评分者光环效应。
J Appl Meas. 2011;12(3):194-211.
10
Detecting and measuring rater effects using many-facet Rasch measurement: Part II.使用多面Rasch测量法检测和衡量评分者效应:第二部分。
J Appl Meas. 2004;5(2):189-227.