• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用莫肯量表分析探索评分者介导评估中的评分质量

Exploring Rating Quality in Rater-Mediated Assessments Using Mokken Scale Analysis.

作者信息

Wind Stefanie A, Engelhard George

机构信息

The University of Alabama, Tuscaloosa, AL, USA.

The University of Georgia, Athens, GA, USA.

出版信息

Educ Psychol Meas. 2016 Aug;76(4):685-706. doi: 10.1177/0013164415604704. Epub 2015 Sep 17.

DOI:10.1177/0013164415604704
PMID:29795883
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5965569/
Abstract

Mokken scale analysis is a probabilistic nonparametric approach that offers statistical and graphical tools for evaluating the quality of social science measurement without placing potentially inappropriate restrictions on the structure of a data set. In particular, Mokken scaling provides a useful method for evaluating important measurement properties, such as invariance, in contexts where response processes are not well understood. Because rater-mediated assessments involve complex interactions among many variables, including assessment contexts, student artifacts, rubrics, individual rater characteristics, and others, rater-assigned scores are suitable candidates for Mokken scale analysis. The purposes of this study are to describe a suite of indices that can be used to explore the psychometric quality of data from rater-mediated assessments and to illustrate the substantive interpretation of Mokken-based statistics and displays in this context. Techniques that are commonly used in polytomous applications of Mokken scaling are adapted for use with rater-mediated assessments, with a focus on the substantive interpretation related to individual raters. Overall, the findings suggest that indices of rater monotonicity, rater scalability, and invariant rater ordering based on Mokken scaling provide diagnostic information at the level of individual raters related to the requirements for invariant measurement. These Mokken-based indices serve as an additional suite of diagnostic tools for exploring the quality of data from rater-mediated assessments that can supplement rating quality indices based on parametric models.

摘要

莫肯量表分析是一种概率非参数方法,它提供了统计和图形工具,用于评估社会科学测量的质量,而无需对数据集的结构施加可能不适当的限制。特别是,莫肯量表法提供了一种有用的方法,用于在响应过程不太清楚的情况下评估重要的测量属性,如不变性。由于评分者介导的评估涉及许多变量之间的复杂相互作用,包括评估背景、学生作品、评分标准、个体评分者特征等,评分者给出的分数是莫肯量表分析的合适候选对象。本研究的目的是描述一套可用于探索评分者介导评估数据的心理测量质量的指标,并说明在此背景下基于莫肯量表的统计数据和显示的实质性解释。莫肯量表法在多分类应用中常用的技术被改编用于评分者介导的评估,重点是与个体评分者相关的实质性解释。总体而言,研究结果表明,基于莫肯量表的评分者单调性、评分者可扩展性和不变评分者排序指标,在个体评分者层面提供了与不变测量要求相关的诊断信息。这些基于莫肯量表的指标可作为另一套诊断工具,用于探索评分者介导评估数据的质量,以补充基于参数模型的评分质量指标。

相似文献

1
Exploring Rating Quality in Rater-Mediated Assessments Using Mokken Scale Analysis.使用莫肯量表分析探索评分者介导评估中的评分质量
Educ Psychol Meas. 2016 Aug;76(4):685-706. doi: 10.1177/0013164415604704. Epub 2015 Sep 17.
2
Examining rating scales using Rasch and Mokken models for rater-mediated assessments.使用拉施模型和莫肯模型检查评分量表以进行评分者介导的评估。
J Appl Meas. 2014;15(2):100-32.
3
Adjacent-Categories Mokken Models for Rater-Mediated Assessments.用于评分者介导评估的相邻类别莫肯模型
Educ Psychol Meas. 2017 Apr;77(2):330-350. doi: 10.1177/0013164416643826. Epub 2016 Apr 18.
4
Exploring Incomplete Rating Designs With Mokken Scale Analysis.运用莫肯量表分析探索不完全评分设计
Educ Psychol Meas. 2018 Apr;78(2):319-342. doi: 10.1177/0013164416675393. Epub 2016 Oct 23.
5
Exploring Within-Rater Category Ordering: A Simulation Study Using Adjacent-Categories Mokken Scale Analysis.探索评分者内类别排序:一项使用相邻类别莫肯量表分析的模拟研究
Educ Psychol Meas. 2018 Oct;78(5):887-904. doi: 10.1177/0013164417724841. Epub 2017 Aug 4.
6
Examining the Psychometric Quality of Multiple-Choice Assessment Items using Mokken Scale Analysis.使用莫肯量表分析检验多项选择题评估项目的心理测量质量。
J Appl Meas. 2016;17(2):142-165.
7
Investigating psychometric properties and dimensional structure of an educational environment measure (DREEM) using Mokken scale analysis - a pragmatic approach.采用莫克尺度分析研究教育环境量表(DREEM)的心理计量学特性和维度结构 - 一种实用方法。
BMC Med Educ. 2018 Oct 11;18(1):235. doi: 10.1186/s12909-018-1334-8.
8
Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers.心理健康和幸福感问卷项目反应的莫肯量表分析:应用健康研究中实证研究的一种非参数 IRT 方法。
BMC Med Res Methodol. 2012 Jun 11;12:74. doi: 10.1186/1471-2288-12-74.
9
Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments With Unfolding Models.使用展开模型在评分者介导评估中探究评分者的客观判断和个人偏好。
Educ Psychol Meas. 2019 Aug;79(4):773-795. doi: 10.1177/0013164419827345. Epub 2019 Feb 5.
10
Examining Rater Judgements in Music Performance Assessment using Many-Facets Rasch Rating Scale Measurement Model.使用多面Rasch评分量表测量模型检验音乐表演评估中的评分者判断。
J Appl Meas. 2019;20(1):79-99.

引用本文的文献

1
Understanding Rater Cognition in Performance Assessment: A Mixed IRTree Approach.理解绩效评估中的评分者认知:一种混合IRTree方法。
Appl Psychol Meas. 2025 Apr 14:01466216251333578. doi: 10.1177/01466216251333578.
2
Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments With Unfolding Models.使用展开模型在评分者介导评估中探究评分者的客观判断和个人偏好。
Educ Psychol Meas. 2019 Aug;79(4):773-795. doi: 10.1177/0013164419827345. Epub 2019 Feb 5.
3
Exploring Within-Rater Category Ordering: A Simulation Study Using Adjacent-Categories Mokken Scale Analysis.探索评分者内类别排序:一项使用相邻类别莫肯量表分析的模拟研究
Educ Psychol Meas. 2018 Oct;78(5):887-904. doi: 10.1177/0013164417724841. Epub 2017 Aug 4.
4
Exploring Incomplete Rating Designs With Mokken Scale Analysis.运用莫肯量表分析探索不完全评分设计
Educ Psychol Meas. 2018 Apr;78(2):319-342. doi: 10.1177/0013164416675393. Epub 2016 Oct 23.
5
Adjacent-Categories Mokken Models for Rater-Mediated Assessments.用于评分者介导评估的相邻类别莫肯模型
Educ Psychol Meas. 2017 Apr;77(2):330-350. doi: 10.1177/0013164416643826. Epub 2016 Apr 18.

本文引用的文献

1
Fitting Item Response Theory Models to Two Personality Inventories: Issues and Insights.拟合两个人格量表的项目反应理论模型:问题与启示。
Multivariate Behav Res. 2001 Oct 1;36(4):523-62. doi: 10.1207/S15327906MBR3604_03.
2
Examining rating scales using Rasch and Mokken models for rater-mediated assessments.使用拉施模型和莫肯模型检查评分量表以进行评分者介导的评估。
J Appl Meas. 2014;15(2):100-32.
3
Examining rating quality in writing assessment: rater agreement, error, and accuracy.审视写作评估中的评分质量:评分者一致性、误差与准确性。
J Appl Meas. 2012;13(4):321-35.
4
Item and rater analysis of constructed response items via the multi-faceted Rasch model.通过多面Rasch模型对建构反应题项进行题项与评分者分析。
J Appl Meas. 2009;10(3):335-47.
5
Using classical and modern measurement theories to explore rater, domain, and gender influences on student writing ability.运用经典和现代测量理论,探究评分者、领域和性别对学生写作能力的影响。
J Appl Meas. 2009;10(3):225-46.
6
Item response theory and clinical measurement.项目反应理论与临床测量。
Annu Rev Clin Psychol. 2009;5:27-48. doi: 10.1146/annurev.clinpsy.032408.153553.
7
The technic of homogeneous tests compared with some aspects of scale analysis and factor analysis.同质性检验技术与量表分析和因子分析的某些方面的比较。
Psychol Bull. 1948 Nov;45(6):507-29. doi: 10.1037/h0055827.
8
Analyzing psychopathology items: a case for nonparametric item response theory modeling.分析精神病理学项目:非参数项目反应理论建模的一个实例
Psychol Methods. 2004 Sep;9(3):354-68. doi: 10.1037/1082-989X.9.3.354.
9
Constructing rater and task banks for performance assessments.构建用于绩效评估的评分者库和任务库。
J Outcome Meas. 1997;1(1):19-33.