Ghosh Arpita, Wright Fred A, Zou Fei
Public Health Foundation of India, New Delhi, India.
Department of Biostatistics, University of North Carolina at Chapel Hill, North Carolina, USA.
J Am Stat Assoc. 2013;108(502). doi: 10.1080/01621459.2013.793121.
It has been repeatedly shown that in case-control association studies, analysis of a secondary trait which ignores the original sampling scheme can produce highly biased risk estimates. Although a number of approaches have been proposed to properly analyze secondary traits, most approaches fail to reproduce the marginal logistic model assumed for the original case-control trait and/or do not allow for interaction between secondary trait and genotype marker on primary disease risk. In addition, the flexible handling of covariates remains challenging. We present a general retrospective likelihood framework to perform association testing for both binary and continuous secondary traits which respects marginal models and incorporates the interaction term. We provide a computational algorithm, based on a reparameterized approximate profile likelihood, for obtaining the maximum likelihood (ML) estimate and its standard error for the genetic effect on secondary trait, in presence of covariates. For completeness we also present an alternative pseudo-likelihood method for handling covariates. We describe extensive simulations to evaluate the performance of the ML estimator in comparison with the pseudo-likelihood and other competing methods.
反复表明,在病例对照关联研究中,对次要性状进行分析时若忽略原始抽样方案,可能会产生高度有偏的风险估计。尽管已提出多种方法来正确分析次要性状,但大多数方法无法重现为原始病例对照性状假设的边际逻辑模型,和/或不考虑次要性状与基因型标记对原发性疾病风险的相互作用。此外,协变量的灵活处理仍然具有挑战性。我们提出了一个通用的回顾性似然框架,用于对二元和连续次要性状进行关联检验,该框架尊重边际模型并纳入了交互项。我们提供了一种基于重新参数化的近似轮廓似然的计算算法,用于在存在协变量的情况下获得次要性状遗传效应的最大似然(ML)估计及其标准误差。为了完整性,我们还提出了一种处理协变量的替代伪似然方法。我们描述了广泛的模拟,以评估ML估计器与伪似然及其他竞争方法相比的性能。