使用置信区间评估真实测试的可靠性。

Court of Audit, The Hague, Netherlands.

University of Amsterdam, Amsterdam, Netherlands.

Assessment. 2019 Oct;26(7):1207-1216. doi: 10.1177/1073191117737375. Epub 2017 Oct 31.

Test authors report sample reliability values but rarely consider the sampling error and related confidence intervals. This study investigated the truth of this conjecture for 116 tests with 1,024 reliability estimates (105 pertaining to test batteries and 919 to tests measuring a single attribute) obtained from an online database. Based on 90% confidence intervals, approximately 20% of the initial quality assessments had to be downgraded. For 95% confidence intervals, the percentage was approximately 23%. The results demonstrated that reported reliability values cannot be trusted without considering their estimation precision.

测试作者报告样本可靠性值，但很少考虑抽样误差和相关置信区间。本研究调查了来自在线数据库的 116 项测试的 105 项测试组合和 919 项单项属性测试的 1024 个可靠性估计值的这一推测的真实性。基于 90%置信区间，初始质量评估中约有 20%需要降级。对于 95%置信区间，该百分比约为 23%。结果表明，如果不考虑估计精度，就不能信任报告的可靠性值。

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

Using Confidence Intervals for Assessing Reliability of Real Tests.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献