Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY 10461, USA.
Stat Med. 2012 Sep 20;31(21):2275-89. doi: 10.1002/sim.5371. Epub 2012 Jun 19.
Statistical approaches for estimating and drawing inference on the correlation between two biomarkers that are repeatedly assessed over time and subject to left-censoring because minimum detection levels are lacking. We propose a linear mixed-effects model and estimate the parameters with the Monte Carlo expectation maximization (MCEM) method. Inferences regarding the model parameters and the correlation between the biomarkers are performed by applying Louis's method and the delta method. Simulation studies were conducted to compare the proposed MCEM method with existing methods including the maximum likelihood estimation method, the multiple imputation method, and two widely used ad hoc approaches: replacing the censored values with the detection limit or with half of the detection limit. The results show that the performance of the MCEM with respect to relative bias and coverage probability for the 95% confidence interval is superior to the detection limit and half of the detection limit approaches and exceeds that of the multiple imputation method at medium to high levels of censoring, and the standard error estimates from the MCEM method are close to ideal. The maximum likelihood estimation method can estimate the parameters accurately; however, a nonpositive definite information matrix can occur so that the variances are not estimable. These five methods are illustrated with data from a longitudinal human immunodeficiency virus study to estimate and draw inference on the correlation between human immunodeficiency virus RNA levels measured in plasma and in cervical secretions at multiple time points.
用于估计和推断两个生物标志物之间相关性的统计方法,这些标志物是随着时间的推移而反复评估的,并且由于缺乏最低检测水平而受到左截断。我们提出了一个线性混合效应模型,并使用蒙特卡罗期望最大化(MCEM)方法估计参数。通过应用 Louis 方法和 delta 方法,对模型参数和生物标志物之间的相关性进行推断。进行了模拟研究,以比较提出的 MCEM 方法与现有的方法,包括最大似然估计方法、多重插补方法以及两种广泛使用的特定方法:用检测限或检测限的一半替换截断值。结果表明,对于 95%置信区间的相对偏差和覆盖率概率,MCEM 的性能优于检测限和检测限一半的方法,并且在中高截断水平下超过了多重插补方法,并且 MCEM 方法的标准误差估计接近理想。最大似然估计方法可以准确估计参数;但是,可能会出现非正定信息矩阵,从而无法估计方差。这些方法在来自纵向人类免疫缺陷病毒研究的数据中得到了说明,以估计和推断在血浆和宫颈分泌物中多次测量的人类免疫缺陷病毒 RNA 水平之间的相关性。