Court of Audit, The Hague, Netherlands.
University of Amsterdam, Amsterdam, Netherlands.
Assessment. 2019 Oct;26(7):1207-1216. doi: 10.1177/1073191117737375. Epub 2017 Oct 31.
Test authors report sample reliability values but rarely consider the sampling error and related confidence intervals. This study investigated the truth of this conjecture for 116 tests with 1,024 reliability estimates (105 pertaining to test batteries and 919 to tests measuring a single attribute) obtained from an online database. Based on 90% confidence intervals, approximately 20% of the initial quality assessments had to be downgraded. For 95% confidence intervals, the percentage was approximately 23%. The results demonstrated that reported reliability values cannot be trusted without considering their estimation precision.
测试作者报告样本可靠性值,但很少考虑抽样误差和相关置信区间。本研究调查了来自在线数据库的 116 项测试的 105 项测试组合和 919 项单项属性测试的 1024 个可靠性估计值的这一推测的真实性。基于 90%置信区间,初始质量评估中约有 20%需要降级。对于 95%置信区间,该百分比约为 23%。结果表明,如果不考虑估计精度,就不能信任报告的可靠性值。