Charter Richard A
Department of Veterans Affairs, VA Long Beach Healthcare System, CA 90822, USA.
J Gen Psychol. 2003 Jul;130(3):290-304. doi: 10.1080/00221300309601160.
The author presented descriptive statistics for 937 reliability coefficients for various reliability methods (e.g., alpha) and test types (e.g., intelligence). He compared the average reliability coefficients with the reliability standards that are suggested by experts and found that most average reliabilities were less than ideal. Correlations showed that over the past several decades there has been neither a rise nor a decline in the value of internal consistency, retest, or interjudge reliability coefficients. Of the internal consistency approaches, there has been an increase in the use of coefficient alpha, whereas use of the split-half method has decreased over time. Decision analysis and true-score confidence intervals showed how low reliability can result in clinical decision errors.
作者展示了针对各种可靠性方法(如阿尔法系数)和测试类型(如智力测试)的937个可靠性系数的描述性统计数据。他将平均可靠性系数与专家建议的可靠性标准进行了比较,发现大多数平均可靠性低于理想水平。相关性分析表明,在过去几十年中,内部一致性、重测或评判间可靠性系数的值既没有上升也没有下降。在内部一致性方法中,阿尔法系数的使用有所增加,而随着时间的推移,分半法的使用有所减少。决策分析和真分数置信区间显示了低可靠性如何导致临床决策错误。