Ma Xiaoye, Nie Lei, Cole Stephen R, Chu Haitao
Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA.
Division of Biometrics IV, Office of Biometrics/OTS/CDER /FDA, Silver Spring, MD, USA.
Stat Methods Med Res. 2016 Aug;25(4):1596-619. doi: 10.1177/0962280213492588. Epub 2013 Jun 26.
In this article, we present an overview and tutorial of statistical methods for meta-analysis of diagnostic tests under two scenarios: (1) when the reference test can be considered a gold standard and (2) when the reference test cannot be considered a gold standard. In the first scenario, we first review the conventional summary receiver operating characteristics approach and a bivariate approach using linear mixed models. Both approaches require direct calculations of study-specific sensitivities and specificities. We next discuss the hierarchical summary receiver operating characteristics curve approach for jointly modeling positivity criteria and accuracy parameters, and the bivariate generalized linear mixed models for jointly modeling sensitivities and specificities. We further discuss the trivariate generalized linear mixed models for jointly modeling prevalence, sensitivities and specificities, which allows us to assess the correlations among the three parameters. These approaches are based on the exact binomial distribution and thus do not require an ad hoc continuity correction. Lastly, we discuss a latent class random effects model for meta-analysis of diagnostic tests when the reference test itself is imperfect for the second scenario. A number of case studies with detailed annotated SAS code in MIXED and NLMIXED procedures are presented to facilitate the implementation of these approaches.
在本文中,我们概述并讲解了两种情况下诊断试验荟萃分析的统计方法:(1)当参考试验可被视为金标准时;(2)当参考试验不能被视为金标准时。在第一种情况下,我们首先回顾传统的汇总接受者操作特征方法和使用线性混合模型的双变量方法。这两种方法都需要直接计算特定研究的敏感性和特异性。接下来,我们讨论用于联合建模阳性标准和准确性参数的分层汇总接受者操作特征曲线方法,以及用于联合建模敏感性和特异性的双变量广义线性混合模型。我们进一步讨论用于联合建模患病率、敏感性和特异性的三变量广义线性混合模型,这使我们能够评估这三个参数之间的相关性。这些方法基于精确的二项分布,因此不需要特别的连续性校正。最后,对于第二种情况,当参考试验本身不完善时,我们讨论用于诊断试验荟萃分析的潜在类别随机效应模型。本文还给出了一些案例研究,并带有MIXED和NLMIXED过程中详细注释的SAS代码,以促进这些方法的实施。