不同的书面评估方法：关于它们的优点和缺点能说些什么？

Different written assessment methods: what can be said about their strengths and weaknesses?

作者信息

Schuwirth Lambert W T, van der Vleuten Cees P M

机构信息

Department of Educational Development and Research, University of Maastricht, PO Box 616, 6200 MD Maastricht, The Netherlands.

出版信息

Med Educ. 2004 Sep;38(9):974-9. doi: 10.1111/j.1365-2929.2004.01916.x.

DOI:10.1111/j.1365-2929.2004.01916.x

PMID:15327679

Abstract

INTRODUCTION

Written assessment techniques can be subdivided according to their stimulus format--what the question asks--and their response format--how the answer is recorded. The former is more important in determining the type of competence being asked for than the latter. It is nevertheless important to consider both when selecting the most appropriate types. Some major elements to consider when making such a selection are cueing effect, reliability, validity, educational impact and resource-intensiveness.

RESPONSE FORMATS

Open-ended questions should be used solely to test aspects that cannot be tested with multiple-choice questions. In all other cases the loss of reliability and the higher resource-intensiveness represent a significant downside. In such cases, multiple-choice questions are not less valid than open-ended questions.

STIMULUS FORMAT

When making this distinction, it is important to consider whether the question is embedded within a relevant case or context and cannot be answered without the case, or not. This appears to be more or less essential according to what is being tested by the question. Context-rich questions test other cognitive skills than do context-free questions. If knowledge alone is the purpose of the test, context-free questions may be useful, but if it is the application of knowledge or knowledge as a part of problem solving that is being tested, then context is indispensable.

CONCLUSION

Every format has its (dis)advantages and a combination of formats based on rational selection is more useful than trying to find or develop a panacea. The response format is less important in this respect than the stimulus.

摘要

引言

书面评估技术可根据其刺激形式（即问题所问内容）和反应形式（即答案的记录方式）进行细分。在确定所要求的能力类型方面，前者比后者更重要。然而，在选择最合适的类型时，同时考虑两者很重要。进行这种选择时要考虑的一些主要因素包括提示效应、可靠性、有效性、教育影响和资源密集度。

反应形式

开放式问题应仅用于测试无法用多项选择题测试的方面。在所有其他情况下，可靠性的损失和更高的资源密集度是一个显著的缺点。在这种情况下，多项选择题的有效性并不低于开放式问题。

刺激形式

在进行这种区分时，重要的是要考虑问题是否嵌入在相关案例或情境中，以及没有该案例是否无法回答。根据问题所测试的内容，这似乎或多或少是必不可少的。与无情境问题相比，情境丰富的问题测试的是其他认知技能。如果测试的目的仅仅是知识，无情境问题可能有用，但如果测试的是知识的应用或作为解决问题一部分的知识，那么情境是不可或缺的。