Putka Dan J, Le Huy, McCloy Rodney A, Diaz Tirso
Human Resources Research Organization, Alexandria, Virginia 22314-1591, USA.
J Appl Psychol. 2008 Sep;93(5):959-81. doi: 10.1037/0021-9010.93.5.959.
Organizational research and practice involving ratings are rife with what the authors term ill-structured measurement designs (ISMDs)--designs in which raters and ratees are neither fully crossed nor nested. This article explores the implications of ISMDs for estimating interrater reliability. The authors first provide a mock example that illustrates potential problems that ISMDs create for common reliability estimators (e.g., Pearson correlations, intraclass correlations). Next, the authors propose an alternative reliability estimator--G(q,k)--that resolves problems with traditional estimators and is equally appropriate for crossed, nested, and ill-structured designs. By using Monte Carlo simulation, the authors evaluate the accuracy of traditional reliability estimators compared with that of G(q,k) for ratings arising from ISMDs. Regardless of condition, G(q,k) yielded estimates as precise or more precise than those of traditional estimators. The advantage of G(q,k) over the traditional estimators became more pronounced with increases in the (a) overlap between the sets of raters that rated each ratee and (b) ratio of rater main effect variance to true score variance. Discussion focuses on implications of this work for organizational research and practice.
涉及评级的组织研究与实践中充斥着作者所称的结构不良测量设计(ISMDs),即评级者与被评级者既非完全交叉也非嵌套的设计。本文探讨了ISMDs对估计评分者间信度的影响。作者首先给出一个模拟示例,说明ISMDs给常用信度估计方法(如皮尔逊相关系数、组内相关系数)带来的潜在问题。接下来,作者提出一种替代信度估计方法——G(q,k),它解决了传统估计方法存在的问题,同样适用于交叉、嵌套和结构不良设计。通过蒙特卡洛模拟,作者评估了传统信度估计方法与G(q,k)对于ISMDs产生的评级的准确性。无论在何种条件下,G(q,k)得出的估计值都与传统估计方法一样精确或更精确。随着(a)对每个被评级者进行评级的评级者集合之间的重叠程度以及(b)评级者主效应方差与真分数方差之比的增加,G(q,k)相对于传统估计方法的优势变得更加明显。讨论聚焦于这项工作对组织研究与实践的影响。