• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多变量概化理论评估子分数的质量。

The Use of Multivariate Generalizability Theory to Evaluate the Quality of Subscores.

作者信息

Jiang Zhehan, Raymond Mark

机构信息

The University of Alabama, Tuscaloosa, AL, USA.

National Board of Medical Examiners, Philadelphia, PA, USA.

出版信息

Appl Psychol Meas. 2018 Nov;42(8):595-612. doi: 10.1177/0146621618758698. Epub 2018 Apr 3.

DOI:10.1177/0146621618758698
PMID:30559569
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6291891/
Abstract

Conventional methods for evaluating the utility of subscores rely on reliability and correlation coefficients. However, correlations can overlook a notable source of variability: variation in subtest means/difficulties. Brennan introduced a reliability index for score profiles based on multivariate generalizability theory, designated as , which is sensitive to variation in subtest difficulty. However, there has been little, if any, research evaluating the properties of this index. A series of simulation experiments, as well as analyses of real data, were conducted to investigate under various conditions of subtest reliability, subtest correlations, and variability in subtest means. Three pilot studies evaluated in the context of a single group of examinees. Results of the pilots indicated that indices were typically low; across the 108 experimental conditions, ranged from .23 to .86, with an overall mean of 0.63. The findings were consistent with previous research, indicating that subscores often do not have interpretive value. Importantly, there were many conditions for which the correlation-based method known as proportion reduction in mean-square error (PRMSE; Haberman, 2006) indicated that subscores were worth reporting, but for which values of fell into the .50s, .60s, and .70s. The main study investigated within the context of score profiles for examinee subgroups. Again, not only indices were generally low, but it was also found that can be sensitive to subgroup differences when PRMSE is not. Analyses of real data and subsequent discussion address how can supplement PRMSE for characterizing the quality of subscores.

摘要

评估子分数效用的传统方法依赖于信度和相关系数。然而,相关性可能会忽略一个显著的变异性来源:子测验均值/难度的变化。布伦南基于多变量概化理论引入了一种用于分数剖面图的信度指数,记为 ,它对子测验难度的变化很敏感。然而,几乎没有(如果有的话)研究评估该指数的性质。进行了一系列模拟实验以及实际数据分析,以研究在子测验信度、子测验相关性和子测验均值变异性的各种条件下的 。三项预研究在一组考生的背景下评估了 。预研究结果表明 指数通常较低;在108个实验条件中, 范围从0.23到0.86,总体均值为0.63。这些发现与先前的研究一致,表明子分数往往没有解释价值。重要的是,在许多情况下,基于相关性的方法,即均方误差比例缩减法(PRMSE;哈伯曼,2006)表明子分数值得报告,但此时 值却处于0.50、0.60和0.70的范围。主要研究在考生亚组的分数剖面图背景下研究了 。同样,不仅 指数普遍较低,而且还发现当PRMSE不敏感时, 可能对亚组差异敏感。实际数据分析及后续讨论阐述了 如何补充PRMSE以刻画子分数的质量。

相似文献

1
The Use of Multivariate Generalizability Theory to Evaluate the Quality of Subscores.使用多变量概化理论评估子分数的质量。
Appl Psychol Meas. 2018 Nov;42(8):595-612. doi: 10.1177/0146621618758698. Epub 2018 Apr 3.
2
Indices of Subscore Utility for Individuals and Subgroups Based on Multivariate Generalizability Theory.基于多变量概化理论的个体及亚组子分数效用指标。
Educ Psychol Meas. 2020 Feb;80(1):67-90. doi: 10.1177/0013164419846936. Epub 2019 May 16.
3
Estimating Between-Person and Within-Person Subscore Reliability with Profile Analysis.使用轮廓分析估计个体间和个体内子分数信度
Multivariate Behav Res. 2017 Jan-Feb;52(1):86-104. doi: 10.1080/00273171.2016.1253452. Epub 2016 Nov 29.
4
Does subgroup membership information lead to better estimation of true subscores?亚组归属信息能否带来对真实子分数的更好估计?
Br J Math Stat Psychol. 2013 Nov;66(3):452-69. doi: 10.1111/j.2044-8317.2012.02061.x. Epub 2012 Oct 29.
5
Reliability and validity of the psychiatry resident in-training examination.住院医师培训考试的信度和效度。
Acad Psychiatry. 1990 Sep;14(3):115-21. doi: 10.1007/BF03341282.
6
The Evidence for a Subscore Structure in a Test of English Language Competency for English Language Learners.英语学习者英语语言能力测试中分项分数结构的证据。
Educ Psychol Meas. 2015 Oct;75(5):805-825. doi: 10.1177/0013164414554416. Epub 2014 Nov 6.
7
Development and validation of the nasopharyngeal cancer scale among the system of quality of life instruments for cancer patients (QLICP-NA V2.0): combined classical test theory and generalizability theory.癌症患者生活质量测评体系之鼻咽癌量表(QLICP-NA V2.0)的研制与验证:经典测试理论与概化理论相结合
Qual Life Res. 2016 Aug;25(8):2087-100. doi: 10.1007/s11136-016-1251-4. Epub 2016 Feb 29.
8
Using Generalizability Theory to Disattenuate Correlation Coefficients for Multiple Sources of Measurement Error.使用概化理论去相关多个测量误差源的相关系数。
Multivariate Behav Res. 2018 Jul-Aug;53(4):481-501. doi: 10.1080/00273171.2018.1457938. Epub 2018 May 2.
9
An Investigation of the Sources of Measurement Error in the Post-Encounter Written Scores from Standardized Patient Examinations.标准化患者检查后书面评分中测量误差来源的调查
Adv Health Sci Educ Theory Pract. 1998;3(2):89-100. doi: 10.1023/A:1009712810810.
10
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模
Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.

引用本文的文献

1
Customizing Bayesian multivariate generalizability theory to mixed-format tests.定制贝叶斯多元概化理论以适用于混合格式测试。
Behav Res Methods. 2024 Oct;56(7):8080-8090. doi: 10.3758/s13428-024-02472-7. Epub 2024 Jul 29.
2
Indices of Subscore Utility for Individuals and Subgroups Based on Multivariate Generalizability Theory.基于多变量概化理论的个体及亚组子分数效用指标。
Educ Psychol Meas. 2020 Feb;80(1):67-90. doi: 10.1177/0013164419846936. Epub 2019 May 16.
3
Integrating Differential Evolution Optimization to Cognitive Diagnostic Model Estimation.将差分进化优化算法集成到认知诊断模型估计中。
Front Psychol. 2018 Nov 6;9:2142. doi: 10.3389/fpsyg.2018.02142. eCollection 2018.
4
Gibbs Samplers for Logistic Item Response Models via the Pólya-Gamma Distribution: A Computationally Efficient Data-Augmentation Strategy.基于 Pólya-Gamma 分布的逻辑项目反应模型的 Gibbs 抽样:一种计算效率高的数据扩充策略。
Psychometrika. 2019 Jun;84(2):358-374. doi: 10.1007/s11336-018-9641-x. Epub 2018 Oct 31.

本文引用的文献

1
A Bayesian approach to estimating variance components within a multivariate generalizability theory framework.贝叶斯方法在多变量概化理论框架内估计方差分量。
Behav Res Methods. 2018 Dec;50(6):2193-2214. doi: 10.3758/s13428-017-0986-3.
2
Cognitive psychology meets psychometric theory: on the relation between process models for decision making and latent variable models for individual differences.认知心理学与心理计量理论的交汇:决策过程模型与个体差异潜在变量模型的关系。
Psychol Rev. 2011 Apr;118(2):339-356. doi: 10.1037/a0022749.
3
The validity of subscores for a credentialing test.一项资格认证考试分项分数的有效性。
Eval Health Prof. 2004 Dec;27(4):349-68. doi: 10.1177/0163278704270010.
4
Assessing similarity between profiles.评估配置文件之间的相似性。
Psychol Bull. 1953 Nov;50(6):456-73. doi: 10.1037/h0057173.