Tang Li, Lyles Robert H, King Caroline C, Hogan Joseph W, Lo Yungtai
Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, U.S.A.
Department of Biostatistics and Bioinformatics, Rollins School of Public Health of Emory University, Atlanta, Georgia 30322, U.S.A.
J R Stat Soc Ser C Appl Stat. 2015 Apr;64(3):433-449. doi: 10.1111/rssc.12081.
In many epidemiological and clinical studies, misclassification may arise in one or several variables, resulting in potentially invalid analytic results (e.g., estimates of odds ratios of interest) when no correction is made. Here we consider the situation in which correlated binary response variables are subject to misclassification. Building upon prior work, we provide an approach to adjust for potentially complex differential misclassification via internal validation sampling applied at multiple study time points. We seek to estimate the parameters of a primary generalized linear mixed model (GLMM) that accounts for baseline and/or time-dependent covariates. The misclassification process is modeled via a second generalized linear model that captures variations in sensitivity and specificity parameters according to time and a set of subject-specific covariates that may or may not overlap with those in the primary model. Simulation studies demonstrate the precision and validity of the proposed method. An application is presented based on longitudinal assessments of bacterial vaginosis conducted in the HIV Epidemiology Research (HER) Study.
在许多流行病学和临床研究中,一个或多个变量可能会出现错误分类,若不进行校正,可能会导致分析结果无效(例如,感兴趣的比值比估计值)。在此,我们考虑相关二元反应变量存在错误分类的情况。基于先前的工作,我们提供了一种方法,通过在多个研究时间点应用内部验证抽样来调整潜在的复杂差异错误分类。我们试图估计一个主要的广义线性混合模型(GLMM)的参数,该模型考虑了基线和/或随时间变化的协变量。错误分类过程通过第二个广义线性模型进行建模,该模型根据时间以及一组可能与主要模型中的协变量重叠或不重叠的个体特定协变量来捕捉灵敏度和特异度参数的变化。模拟研究证明了所提方法的精度和有效性。基于在HIV流行病学研究(HER)中对细菌性阴道病进行的纵向评估给出了一个应用实例。