• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Evaluating Equating Methods for Varying Levels of Form Difference.评估不同形式差异水平下的等值方法。
Educ Psychol Meas. 2024 Jun;84(3):510-529. doi: 10.1177/00131644231176989. Epub 2023 Jun 8.
2
Calculating Bias in Test Score Equating in a NEAT Design.计算NEAT设计中测验分数等值的偏差
Appl Psychol Meas. 2025 Mar 24:01466216251330305. doi: 10.1177/01466216251330305.
3
The NEAT Equating Via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method.小样本量情况下通过链式随机森林实现的NEAT等值性:一种机器学习方法
Educ Psychol Meas. 2023 Oct;83(5):984-1006. doi: 10.1177/00131644221120899. Epub 2022 Sep 4.
4
Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: psychometric approaches to test equating.不同医学学生群体在麻醉学课程中的熟练程度比较:用于测试等值性的心理测量方法。
J Chin Med Assoc. 2014 Mar;77(3):150-4. doi: 10.1016/j.jcma.2013.10.011. Epub 2013 Nov 28.
5
A Comparison of IRT Observed Score Kernel Equating and Several Equating Methods.IRT观测分数核等值法与几种等值方法的比较
Front Psychol. 2020 Mar 6;11:308. doi: 10.3389/fpsyg.2020.00308. eCollection 2020.
6
Longitudinal stability of IRT and equivalent-groups linear and equipercentile equating.项目反应理论(IRT)以及等效组线性和等百分位等值的纵向稳定性。
Psychol Rep. 2013 Aug;113(1):1303-25. doi: 10.2466/03.10.pr0.113x11z6.
7
Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing.评估计算机自适应测试连续项目池校准中的不同等值设置
Front Psychol. 2019 Jun 6;10:1277. doi: 10.3389/fpsyg.2019.01277. eCollection 2019.
8
Local Equating of Cognitively Diagnostic Modeled Observed Scores.认知诊断模型观测分数的局部等值
Appl Psychol Meas. 2015 Jan;39(1):44-61. doi: 10.1177/0146621614542427. Epub 2014 Jul 23.
9
Standard Errors of Kernel Equating: Accounting for Bandwidth Estimation.核等值的标准误差:带宽估计的考量
Appl Psychol Meas. 2022 May;46(3):200-218. doi: 10.1177/01466216211066601. Epub 2022 Mar 7.
10
Investigating the Effect of Differential Rapid Guessing on Population Invariance in Equating.探究差异快速猜测对等值中总体不变性的影响。
Appl Psychol Meas. 2022 Oct;46(7):589-604. doi: 10.1177/01466216221108991. Epub 2022 Jun 16.

评估不同形式差异水平下的等值方法。

Evaluating Equating Methods for Varying Levels of Form Difference.

作者信息

Sun Ting, Kim Stella Yun

机构信息

University of Utah, Salt Lake City, USA.

University of North Carolina at Charlotte, USA.

出版信息

Educ Psychol Meas. 2024 Jun;84(3):510-529. doi: 10.1177/00131644231176989. Epub 2023 Jun 8.

DOI:10.1177/00131644231176989
PMID:38756465
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11095324/
Abstract

Equating is a statistical procedure used to adjust for the difference in form difficulty such that scores on those forms can be used and interpreted comparably. In practice, however, equating methods are often implemented without considering the extent to which two forms differ in difficulty. The study aims to examine the effect of the magnitude of a form difficulty difference on equating results under random group (RG) and common-item nonequivalent group (CINEG) designs. Specifically, this study evaluates the performance of six equating methods under a set of simulation conditions including varying levels of form difference. Results revealed that, under the RG design, mean equating was proven to be the most accurate method when there is no or small form difference, whereas equipercentile is the most accurate method when the difficulty difference is medium or large. Under the CINEG design, Tucker Linear was found to be the most accurate method when the difficulty difference is medium or small, and either chained equipercentile or frequency estimation is preferred with a large difficulty level. This study would provide practitioners with research evidence-based guidance in the choice of equating methods with varying levels of form difference. As the condition of no form difficulty difference is also included, this study would inform testing companies of appropriate equating methods when two forms are similar in difficulty level.

摘要

等值化是一种统计程序,用于调整试卷形式难度的差异,以便能够以可比的方式使用和解释这些试卷上的分数。然而,在实际操作中,实施等值化方法时往往没有考虑两种试卷在难度上的差异程度。本研究旨在考察试卷难度差异幅度对随机组(RG)设计和共同题目非等组(CINEG)设计下等值化结果的影响。具体而言,本研究在包括不同试卷差异水平的一组模拟条件下,评估了六种等值化方法的性能。结果显示,在RG设计下,当试卷差异不存在或较小时,均值等值化被证明是最准确的方法,而当难度差异为中等或较大时,等百分位等值化是最准确的方法。在CINEG设计下,当难度差异为中等或较小时,发现塔克线性等值化是最准确的方法,而当难度水平较大时,链式等百分位等值化或频率估计法更受青睐。本研究将为从业者在选择不同试卷差异水平的等值化方法时提供基于研究证据的指导。由于也包括了不存在试卷难度差异的情况,本研究将告知测试公司在两种试卷难度水平相似时适用的等值化方法。