Department of Epidemiology and Biostatistics, School of Public Health, Indiana University Bloomington, Bloomington, IN 47401, USA.
Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN 46202, USA.
Genes (Basel). 2021 Oct 28;12(11):1723. doi: 10.3390/genes12111723.
The additive genetic model as implemented in logistic regression has been widely used in genome-wide association studies (GWASs) for binary outcomes. Unfortunately, for many complex diseases, the underlying genetic models are generally unknown and a mis-specification of the genetic model can result in a substantial loss of power. To address this issue, the MAX3 test (the maximum of three separate test statistics) has been proposed as a robust test that performs plausibly regardless of the underlying genetic model. However, the original implementation of MAX3 utilizes the trend test so it cannot adjust for any covariates such as age and gender. This drawback has significantly limited the application of the MAX3 in GWASs, as covariates account for a considerable amount of variability in these disorders. In this paper, we extended the MAX3 and proposed the CMAX3 (covariate-adjusted MAX3) based on logistic regression. The proposed test yielded a similar robust efficiency as the original MAX3 while easily adjusting for any covariate based on the likelihood framework. The asymptotic formula to calculate the -value of the proposed test was also developed in this paper. The simulation results showed that the proposed test performed desirably under both the null and alternative hypotheses. For the purpose of illustration, we applied the proposed test to re-analyze a case-control GWAS dataset from the Collaborative Studies on Genetics of Alcoholism (COGA). The R code to implement the proposed test is also introduced in this paper and is available for free download.
逻辑回归中实现的加性遗传模型已广泛应用于二元结局的全基因组关联研究(GWAS)中。不幸的是,对于许多复杂疾病,潜在的遗传模型通常是未知的,遗传模型的错误指定可能会导致大量的效力损失。为了解决这个问题,已经提出了 MAX3 检验(三个单独检验统计量的最大值)作为一种稳健的检验方法,无论潜在的遗传模型如何,它的表现都似乎是合理的。然而,MAX3 的原始实现利用了趋势检验,因此它不能调整任何协变量,如年龄和性别。这一缺点极大地限制了 MAX3 在 GWAS 中的应用,因为协变量在这些疾病中占相当大的变异量。在本文中,我们扩展了 MAX3,并提出了基于逻辑回归的 CMAX3(协变量调整的 MAX3)。与原始的 MAX3 相比,所提出的检验在容易调整任何协变量的同时,产生了类似的稳健效率,这是基于似然框架实现的。本文还开发了计算提议检验的 -值的渐近公式。模拟结果表明,在零假设和备择假设下,所提出的检验都表现良好。为了说明目的,我们将所提出的检验应用于重新分析来自酒精中毒遗传合作研究(COGA)的病例对照 GWAS 数据集。本文还介绍了实现所提出的检验的 R 代码,并可免费下载。