Emons Wilco H M, Sijtsma Klaas, Meijer Rob R
Department of Methodology and Statistics FSW, Tilburg University, Tilburg, Netherlands.
Psychol Methods. 2007 Mar;12(1):105-20. doi: 10.1037/1082-989X.12.1.105.
Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level, proportions of correct classifications were computed for varying test length, cut-scores, item scoring, and choices of item parameters. Short tests were found to classify at most 50% of a group consistently. Results were much better for tests containing 20 or 40 items. Small differences were found between dichotomous and polytomous (5 ordered scores) items. It is recommended that short tests for high-stakes decision making be used in combination with other information so as to increase reliability and classification consistency.
在临床与健康心理学、医学及精神病学领域,为对患者做出决策,会使用最多包含15个项目的简短测试。由于简短测试存在较大测量误差,作者们探讨这些测试在将患者分为治疗组和非治疗组时是否足够可靠。对于给定的置信水平,针对不同的测试长度、划界分数、项目计分以及项目参数选择,计算出正确分类的比例。结果发现,简短测试最多只能始终如一地将一组中的50%进行分类。对于包含20或40个项目的测试,结果要好得多。在二分法项目和多分法(5个有序分数)项目之间发现的差异较小。建议将用于高风险决策的简短测试与其他信息结合使用,以提高可靠性和分类一致性。