Norcini J J
Institute for Clinical Evaluation, American Board of Internal Medicine, Philadelphia, Pennsylvania 19106-3699, USA.
Acad Med. 1999 Oct;74(10):1088-90. doi: 10.1097/00001888-199910000-00010.
The purpose of this paper is to identify situations in which two rules of thumb in evaluation do not apply. The first rule is that all standards should be absolute. When selection decisions are being made or when classroom tests are given, however, relative standards may be better. The second rule of thumb is that every test should have a reliability of .80 or better. Depending on the circumstances, though, the standard error of measurement, the consistency of pass/fail classifications, and the domain-referenced reliability coefficients may be better indicators of reproducibility.
本文的目的是识别评估中的两条经验法则不适用的情况。第一条法则是所有标准都应该是绝对的。然而,在做出选拔决定或进行课堂测试时,相对标准可能更好。第二条经验法则是每个测试的信度应该达到0.80或更高。不过,根据具体情况,测量标准误差、通过/失败分类的一致性以及领域参照信度系数可能是再现性的更好指标。