• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在评分者介导评估的项目反应理论模型下对评分顺序效应进行建模。

Modeling Rating Order Effects Under Item Response Theory Models for Rater-Mediated Assessments.

作者信息

Huang Hung-Yu

机构信息

Department of Psychology and Counseling, University of Taipei, Taipei, Taiwan.

出版信息

Appl Psychol Meas. 2023 Jun;47(4):312-327. doi: 10.1177/01466216231174566. Epub 2023 May 13.

DOI:10.1177/01466216231174566
PMID:37283589
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10240569/
Abstract

Rater effects are commonly observed in rater-mediated assessments. By using item response theory (IRT) modeling, raters can be treated as independent factors that function as instruments for measuring ratees. Most rater effects are static and can be addressed appropriately within an IRT framework, and a few models have been developed for dynamic rater effects. Operational rating projects often require human raters to continuously and repeatedly score ratees over a certain period, imposing a burden on the cognitive processing abilities and attention spans of raters that stems from judgment fatigue and thus affects the rating quality observed during the rating period. As a result, ratees' scores may be influenced by the order in which they are graded by raters in a rating sequence, and the rating order effect should be considered in new IRT models. In this study, two types of many-faceted (MF)-IRT models are developed to account for such dynamic rater effects, which assume that rater severity can drift systematically or stochastically. The results obtained from two simulation studies indicate that the parameters of the newly developed models can be estimated satisfactorily using Bayesian estimation and that disregarding the rating order effect produces biased model structure and ratee proficiency parameter estimations. A creativity assessment is outlined to demonstrate the application of the new models and to investigate the consequences of failing to detect the possible rating order effect in a real rater-mediated evaluation.

摘要

评分者效应在评分者介导的评估中普遍存在。通过使用项目反应理论(IRT)建模,评分者可被视为独立因素,充当衡量被评者的工具。大多数评分者效应是静态的,可在IRT框架内得到妥善处理,并且已经开发了一些针对动态评分者效应的模型。实际的评分项目通常要求人工评分者在一定时期内持续且反复地对被评者进行评分,这给评分者的认知处理能力和注意力跨度带来了负担,这种负担源于判断疲劳,进而影响评分期间观察到的评分质量。因此,被评者的分数可能会受到评分者在评分序列中对其评分顺序的影响,在新的IRT模型中应考虑评分顺序效应。在本研究中,开发了两种多面(MF)-IRT模型来解释这种动态评分者效应,该效应假设评分者的严格程度会系统地或随机地漂移。两项模拟研究的结果表明,使用贝叶斯估计可以令人满意地估计新开发模型的参数,而忽略评分顺序效应会产生有偏差的模型结构和被评者能力参数估计。概述了一项创造力评估,以展示新模型的应用,并研究在实际的评分者介导评估中未能检测到可能的评分顺序效应的后果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7d/10240569/7ed95a806979/10.1177_01466216231174566-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7d/10240569/ed2e0e54b20c/10.1177_01466216231174566-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7d/10240569/78fdc5f8cb41/10.1177_01466216231174566-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7d/10240569/7ed95a806979/10.1177_01466216231174566-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7d/10240569/ed2e0e54b20c/10.1177_01466216231174566-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7d/10240569/78fdc5f8cb41/10.1177_01466216231174566-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7d/10240569/7ed95a806979/10.1177_01466216231174566-fig3.jpg

相似文献

1
Modeling Rating Order Effects Under Item Response Theory Models for Rater-Mediated Assessments.在评分者介导评估的项目反应理论模型下对评分顺序效应进行建模。
Appl Psychol Meas. 2023 Jun;47(4):312-327. doi: 10.1177/01466216231174566. Epub 2023 May 13.
2
Rater-based assessments as social judgments: rethinking the etiology of rater errors.基于评定者的评估即社会判断:重新思考评定者误差的病因。
Acad Med. 2011 Oct;86(10 Suppl):S1-7. doi: 10.1097/ACM.0b013e31822a6cf8.
3
Item response theory model highlighting rating scale of a rubric and rater-rubric interaction in objective structured clinical examination.项目反应理论模型突出了客观结构化临床考试中等级量表的评分和评分者-等级量表的交互作用。
PLoS One. 2024 Sep 6;19(9):e0309887. doi: 10.1371/journal.pone.0309887. eCollection 2024.
4
Exploring Within-Rater Category Ordering: A Simulation Study Using Adjacent-Categories Mokken Scale Analysis.探索评分者内类别排序:一项使用相邻类别莫肯量表分析的模拟研究
Educ Psychol Meas. 2018 Oct;78(5):887-904. doi: 10.1177/0013164417724841. Epub 2017 Aug 4.
5
Item Response Theory Modeling for Examinee-selected Items with Rater Effect.具有评分者效应的考生自选项目的项目反应理论建模
Appl Psychol Meas. 2019 Sep;43(6):435-448. doi: 10.1177/0146621618798667. Epub 2018 Oct 8.
6
A Bayesian many-facet Rasch model with Markov modeling for rater severity drift.贝叶斯多项 RASCH 模型与马尔可夫建模用于评分者严重偏差。
Behav Res Methods. 2023 Oct;55(7):3910-3928. doi: 10.3758/s13428-022-01997-z. Epub 2022 Oct 25.
7
A mixture Rasch facets model for rater's illusory halo effects.一种用于评估者虚幻光环效应的混合拉施克侧面模型。
Behav Res Methods. 2022 Dec;54(6):2750-2764. doi: 10.3758/s13428-021-01721-3. Epub 2022 Jan 11.
8
Linking essay-writing tests using many-facet models and neural automated essay scoring.运用多维模型和神经自动作文评分技术对作文考试进行关联。
Behav Res Methods. 2024 Dec;56(8):8450-8479. doi: 10.3758/s13428-024-02485-2. Epub 2024 Aug 20.
9
Reliability and Validity of Performance Evaluations of Pain Medicine Clinical Faculty by Residents and Fellows Using a Supervision Scale.住院医师和研究员使用监督量表对疼痛医学临床教师进行绩效评估的可靠性和有效性。
Anesth Analg. 2020 Sep;131(3):909-916. doi: 10.1213/ANE.0000000000004779.
10
Exploring Rating Quality in Rater-Mediated Assessments Using Mokken Scale Analysis.使用莫肯量表分析探索评分者介导评估中的评分质量
Educ Psychol Meas. 2016 Aug;76(4):685-706. doi: 10.1177/0013164415604704. Epub 2015 Sep 17.

引用本文的文献

1
Understanding Rater Cognition in Performance Assessment: A Mixed IRTree Approach.理解绩效评估中的评分者认知:一种混合IRTree方法。
Appl Psychol Meas. 2025 Apr 14:01466216251333578. doi: 10.1177/01466216251333578.
2
Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights From a Novel Modeling Approach.探索反应方式对连续量表评估的影响:来自一种新型建模方法的见解。
Educ Psychol Meas. 2025 Feb;85(1):178-214. doi: 10.1177/00131644241242789. Epub 2024 Apr 17.