Ray Debashree, Pankow James S, Basu Saonli
Division of Biostatistics, School of Public Health, University of Minnesota, Minnesota, United States of America.
Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minnesota, United States of America.
Genet Epidemiol. 2016 Jan;40(1):20-34. doi: 10.1002/gepi.21937. Epub 2015 Dec 7.
Genome-wide association studies (GWASs) for complex diseases often collect data on multiple correlated endo-phenotypes. Multivariate analysis of these correlated phenotypes can improve the power to detect genetic variants. Multivariate analysis of variance (MANOVA) can perform such association analysis at a GWAS level, but the behavior of MANOVA under different trait models has not been carefully investigated. In this paper, we show that MANOVA is generally very powerful for detecting association but there are situations, such as when a genetic variant is associated with all the traits, where MANOVA may not have any detection power. In these situations, marginal model based methods, however, perform much better than multivariate methods. We investigate the behavior of MANOVA, both theoretically and using simulations, and derive the conditions where MANOVA loses power. Based on our findings, we propose a unified score-based test statistic USAT that can perform better than MANOVA in such situations and nearly as well as MANOVA elsewhere. Our proposed test reports an approximate asymptotic P-value for association and is computationally very efficient to implement at a GWAS level. We have studied through extensive simulations the performance of USAT, MANOVA, and other existing approaches and demonstrated the advantage of using the USAT approach to detect association between a genetic variant and multivariate phenotypes. We applied USAT to data from three correlated traits collected on 5, 816 Caucasian individuals from the Atherosclerosis Risk in Communities (ARIC, The ARIC Investigators []) Study and detected some interesting associations.
复杂疾病的全基因组关联研究(GWAS)通常会收集多个相关内表型的数据。对这些相关表型进行多变量分析可以提高检测遗传变异的效能。多变量方差分析(MANOVA)可以在GWAS层面进行此类关联分析,但尚未仔细研究MANOVA在不同性状模型下的表现。在本文中,我们表明MANOVA通常在检测关联方面非常有效,但存在一些情况,例如当一个遗传变异与所有性状都相关时,MANOVA可能没有任何检测效能。然而,在这些情况下,基于边际模型的方法比多变量方法表现得更好。我们从理论和模拟两方面研究了MANOVA的表现,并推导了MANOVA失去效能的条件。基于我们的发现,我们提出了一种基于分数的统一检验统计量USAT,在这种情况下它比MANOVA表现更好,在其他情况下与MANOVA表现相近。我们提出的检验报告了关联的近似渐近P值,并且在GWAS层面计算实现非常高效。我们通过广泛的模拟研究了USAT、MANOVA和其他现有方法的性能,并证明了使用USAT方法检测遗传变异与多变量表型之间关联的优势。我们将USAT应用于从社区动脉粥样硬化风险(ARIC,ARIC研究调查组)研究中5816名白种人个体收集的三个相关性状的数据,并检测到了一些有趣的关联。