KU Leuven Department of Development and Regeneration, Leuven, Belgium.
BMC Med Res Methodol. 2013 Oct 23;13:128. doi: 10.1186/1471-2288-13-128.
In multicenter studies, center-specific variations in measurements may arise for various reasons, such as low interrater reliability, differences in equipment, deviations from the protocol, sociocultural characteristics, and differences in patient populations due to e.g. local referral patterns. The aim of this research is to derive measures for the degree of clustering. We present a method to detect heavily clustered variables and to identify physicians with outlying measurements.
We use regression models with fixed effects to account for patient case-mix and a random cluster intercept to study clustering by physicians. We propose to use the residual intraclass correlation (RICC), the proportion of residual variance that is situated at the cluster level, to detect variables that are influenced by clustering. An RICC of 0 indicates that the variance in the measurements is not due to variation between clusters. We further suggest, where appropriate, to evaluate RICC in combination with R2, the proportion of variance that is explained by the fixed effects. Variables with a high R2 may have benefits that outweigh the disadvantages of clustering in terms of statistical analysis. We apply the proposed methods to a dataset collected for the development of models for ovarian tumor diagnosis. We study the variability in 18 tumor characteristics collected through ultrasound examination, 4 patient characteristics, and the serum marker CA-125 measured by 40 physicians on 2407 patients.
The RICC showed large variation between variables: from 2.2% for age to 25.1% for the amount of fluid in the pouch of Douglas. Seven variables had an RICC above 15%, indicating that a considerable part of the variance is due to systematic differences at the physician level, rather than random differences at the patient level. Accounting for differences in ultrasound machine quality reduced the RICC for a number of blood flow measurements.
We recommend that the degree of data clustering is addressed during the monitoring and analysis of multicenter studies. The RICC is a useful tool that expresses the degree of clustering as a percentage. Specific applications are data quality monitoring and variable screening prior to the development of a prediction model.
在多中心研究中,由于各种原因,如观察者间可靠性低、设备差异、偏离方案、社会文化特征以及由于当地转诊模式等导致的患者人群差异,可能会出现中心特异性测量值变化。本研究旨在得出衡量聚类程度的指标。我们提出了一种检测严重聚类变量和识别异常测量值医生的方法。
我们使用具有固定效应的回归模型来解释患者病例组合,并使用随机聚类截距来研究医生的聚类。我们建议使用残差组内相关系数(RICC),即位于聚类水平的残差方差比例,来检测受聚类影响的变量。RICC 为 0 表示测量值的方差不是由于聚类之间的差异引起的。我们进一步建议,在适当的情况下,将 RICC 与固定效应解释的方差比例 R2 结合起来评估。具有高 R2 的变量可能具有统计学分析中聚类的优势。我们将提出的方法应用于为卵巢肿瘤诊断模型开发而收集的数据集。我们研究了通过超声检查收集的 18 种肿瘤特征、4 种患者特征以及 40 位医生在 2407 位患者上测量的血清标志物 CA-125 的变异性。
RICC 在变量之间变化很大:年龄的 RICC 为 2.2%,而Douglas 袋中液体量的 RICC 为 25.1%。有 7 个变量的 RICC 高于 15%,这表明相当一部分方差是由于医生水平的系统差异,而不是患者水平的随机差异。考虑到超声机质量的差异,一些血流测量的 RICC 有所降低。
我们建议在多中心研究的监测和分析过程中解决数据聚类程度的问题。RICC 是一个有用的工具,它以百分比的形式表示聚类程度。具体应用包括数据质量监测和预测模型开发前的变量筛选。