Suppr超能文献

为进展测试计划创建测试蓝图:一种配对比较方法。

Creating a test blueprint for a progress testing program: A paired-comparisons approach.

机构信息

a Education Research, Co-Chair of CTEC Assessment and Ed-Tech Subcommittee, Faculty of Dentistry , University of British Columbia , British Columbia , Canada.

b Department of Leadership, Higher and Adult Education , University of Toronto , Toronto , Canada.

出版信息

Med Teach. 2018 Mar;40(3):267-274. doi: 10.1080/0142159X.2017.1403015. Epub 2017 Nov 24.

Abstract

CONTEXT

Creating a new testing program requires the development of a test blueprint that will determine how the items on each test form are distributed across possible content areas and practice domains. To achieve validity, categories of a blueprint are typically based on the judgments of content experts. How experts judgments are elicited and combined is important to the quality of resulting test blueprints.

METHODS

Content experts in dentistry participated in a day-long faculty-wide workshop to discuss, refine, and confirm the categories and their relative weights. After reaching agreement on categories and their definitions, experts judged the relative importance between category pairs, registering their judgments anonymously using iClicker, an audience response system. Judgments were combined in two ways: a simple calculation that could be performed during the workshop and a multidimensional scaling of the judgments performed later.

RESULTS

Content experts were able to produce a set of relative weights using this approach. The multidimensional scaling yielded a three-dimensional model with the potential to provide deeper insights into the basis of the experts' judgments.

CONCLUSION

The approach developed and demonstrated in this study can be applied across academic disciplines to elicit and combine content experts judgments for the development of test blueprints.

摘要

背景

创建新的测试程序需要开发一个测试蓝图,该蓝图将决定每个测试表单上的项目如何分布在可能的内容领域和实践领域中。为了实现有效性,蓝图的类别通常基于内容专家的判断。专家判断是如何得出和组合的,对最终测试蓝图的质量很重要。

方法

牙科学的内容专家参加了为期一天的全院教员研讨会,讨论、完善和确认类别及其相对权重。在就类别及其定义达成一致意见后,专家们对类别对之间的相对重要性进行了判断,使用 iClicker(一种观众响应系统)匿名登记他们的判断。判断以两种方式进行组合:一种是在研讨会上进行的简单计算,另一种是判断的多维标度。

结果

内容专家使用这种方法能够得出一组相对权重。多维标度产生了一个具有三维模型的潜力,可以更深入地了解专家判断的基础。

结论

本研究中开发和演示的方法可以应用于各个学科领域,以征求和组合内容专家的判断,制定测试蓝图。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验