• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Evaluating Rater Accuracy in Rater-Mediated Assessments Using an Unfolding Model.使用展开模型评估评分者介导评估中的评分者准确性。
Educ Psychol Meas. 2016 Dec;76(6):1005-1025. doi: 10.1177/0013164415621606. Epub 2015 Dec 14.
2
Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments With Unfolding Models.使用展开模型在评分者介导评估中探究评分者的客观判断和个人偏好。
Educ Psychol Meas. 2019 Aug;79(4):773-795. doi: 10.1177/0013164419827345. Epub 2019 Feb 5.
3
Incorporating Criterion Ratings Into Model-Based Rater Monitoring Procedures Using Latent-Class Signal Detection Theory.运用潜在类别信号检测理论将标准评分纳入基于模型的评分者监测程序
Appl Psychol Meas. 2017 Sep;41(6):472-491. doi: 10.1177/0146621617698452. Epub 2017 Mar 27.
4
Differences in inter-rater reliability and accuracy for a treatment adherence scale.一种治疗依从性量表的评分者间信度和准确性差异。
Cogn Behav Ther. 2007;36(4):230-9. doi: 10.1080/16506070701584367.
5
Examining rating quality in writing assessment: rater agreement, error, and accuracy.审视写作评估中的评分质量:评分者一致性、误差与准确性。
J Appl Meas. 2012;13(4):321-35.
6
Examining rating scales using Rasch and Mokken models for rater-mediated assessments.使用拉施模型和莫肯模型检查评分量表以进行评分者介导的评估。
J Appl Meas. 2014;15(2):100-32.
7
Exploring Rating Quality in Rater-Mediated Assessments Using Mokken Scale Analysis.使用莫肯量表分析探索评分者介导评估中的评分质量
Educ Psychol Meas. 2016 Aug;76(4):685-706. doi: 10.1177/0013164415604704. Epub 2015 Sep 17.
8
Rater characteristics, response content, and scoring contexts: Decomposing the determinates of scoring accuracy.评分者特征、回答内容和评分情境:剖析评分准确性的决定因素。
Front Psychol. 2022 Aug 10;13:937097. doi: 10.3389/fpsyg.2022.937097. eCollection 2022.
9
Effects of a rater training on rating accuracy in a physical examination skills assessment.评分员培训对体格检查技能评估中评分准确性的影响。
GMS Z Med Ausbild. 2014 Nov 17;31(4):Doc41. doi: 10.3205/zma000933. eCollection 2014.
10
Using Repeated Ratings to Improve Measurement Precision in Incomplete Rating Designs.在不完全评分设计中使用重复评分提高测量精度
J Appl Meas. 2018;19(2):148-161.

引用本文的文献

1
Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments With Unfolding Models.使用展开模型在评分者介导评估中探究评分者的客观判断和个人偏好。
Educ Psychol Meas. 2019 Aug;79(4):773-795. doi: 10.1177/0013164419827345. Epub 2019 Feb 5.
2
Investigation of Rater Effects Using Social Network Analysis and Exponential Random Graph Models.使用社会网络分析和指数随机图模型对评分者效应进行调查。
Educ Psychol Meas. 2018 Jun;78(3):430-459. doi: 10.1177/0013164416689696. Epub 2017 Feb 5.
3
Incorporating Criterion Ratings Into Model-Based Rater Monitoring Procedures Using Latent-Class Signal Detection Theory.运用潜在类别信号检测理论将标准评分纳入基于模型的评分者监测程序
Appl Psychol Meas. 2017 Sep;41(6):472-491. doi: 10.1177/0146621617698452. Epub 2017 Mar 27.

本文引用的文献

1
A Family of Rater Accuracy Models.一系列评分者准确性模型。
J Appl Meas. 2015;16(2):153-60.
2
An introduction to the theory of unidimensional unfolding.单维展开理论导论
J Appl Meas. 2006;7(3):260-77.
3
Psychological scaling without a unit of measurement.无测量单位的心理量表编制
Psychol Rev. 1950 May;57(3):145-58. doi: 10.1037/h0060984.
4
Detecting and measuring rater effects using many-facet Rasch measurement: Part II.使用多面Rasch测量法检测和衡量评分者效应:第二部分。
J Appl Meas. 2004;5(2):189-227.
5
Detecting and measuring rater effects using many-facet Rasch measurement: part I.使用多面Rasch测量法检测和衡量评分者效应:第一部分。
J Appl Meas. 2003;4(4):386-422.
6
A Class of Probabilistic Unfolding Models for Polytomous Responses.一类用于多分类响应的概率展开模型。
J Math Psychol. 2001 Apr;45(2):224-248. doi: 10.1006/jmps.2000.1310.
7
A General Formulation for Unidimensional Unfolding and Pairwise Preference Models: Making Explicit the Latitude of Acceptance.
J Math Psychol. 1998 Dec;42(4):400-417. doi: 10.1006/jmps.1998.1206.

使用展开模型评估评分者介导评估中的评分者准确性。

Evaluating Rater Accuracy in Rater-Mediated Assessments Using an Unfolding Model.

作者信息

Wang Jue, Engelhard George, Wolfe Edward W

机构信息

The University of Georgia, Athens, GA, USA.

Pearson, Iowa City, IA, USA.

出版信息

Educ Psychol Meas. 2016 Dec;76(6):1005-1025. doi: 10.1177/0013164415621606. Epub 2015 Dec 14.

DOI:10.1177/0013164415621606
PMID:29795898
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5965606/
Abstract

The number of performance assessments continues to increase around the world, and it is important to explore new methods for evaluating the quality of ratings obtained from raters. This study describes an unfolding model for examining rater accuracy. Accuracy is defined as the difference between observed and expert ratings. Dichotomous accuracy ratings (0 = inaccurate, 1 = accurate) are unfolded into three latent categories: inaccurate below expert ratings, accurate ratings, and inaccurate above expert ratings. The hyperbolic cosine model (HCM) is used to examine dichotomous accuracy ratings from a statewide writing assessment. This study suggests that HCM is a promising approach for examining rater accuracy, and that the HCM can provide a useful interpretive framework for evaluating the quality of ratings obtained within the context of rater-mediated assessments.

摘要

全球范围内绩效评估的数量持续增加,探索评估评分者所给评分质量的新方法很重要。本研究描述了一种用于检验评分者准确性的展开模型。准确性被定义为观察到的评分与专家评分之间的差异。二分制准确性评分(0 = 不准确,1 = 准确)被展开为三个潜在类别:低于专家评分的不准确、准确评分以及高于专家评分的不准确。双曲余弦模型(HCM)用于检验来自全州写作评估的二分制准确性评分。本研究表明,双曲余弦模型是检验评分者准确性的一种有前景的方法,并且双曲余弦模型可以为评估在评分者介导的评估背景下所获得评分的质量提供一个有用的解释框架。