Van der Sluis Sophie, Dolan Conor V, Li Jiang, Song Youqiang, Sham Pak, Posthuma Danielle, Li Miao-Xin
Department of Complex Trait Genetics, Section Clinical Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU Medical Center, Amsterdam, The Netherlands, Department of Biological Psychology, VU University Amsterdam, Amsterdam, The Netherlands,Department of Biochemistry, State Key Laboratory for Cognitive and Brain Sciences, The Centre for Reproduction, Development and Growth, The Centre for Genomic Sciences and Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong and Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU University Amsterdam, Amsterdam, The Netherlands.
Department of Complex Trait Genetics, Section Clinical Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU Medical Center, Amsterdam, The Netherlands, Department of Biological Psychology, VU University Amsterdam, Amsterdam, The Netherlands,Department of Biochemistry, State Key Laboratory for Cognitive and Brain Sciences, The Centre for Reproduction, Development and Growth, The Centre for Genomic Sciences and Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong and Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU University Amsterdam, Amsterdam, The Netherlands Department of Complex Trait Genetics, Section Clinical Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU Medical Center, Amsterdam, The Netherlands, Department of Biological Psychology, VU University Amsterdam, Amsterdam, The Netherlands,Department of Biochemistry, State Key Laboratory for Cognitive and Brain Sciences, The Centre for Reproduction, Development and Growth, The Centre for Genomic Sciences and Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong and Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU University Amsterdam, Amsterdam, The Netherlands Department of Complex Trait Genetics, Section Clinical Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU Medical Center, Amsterdam, The Netherlands, Department of Biological Psychology, VU University Amsterdam, Amsterdam, The Netherlands,Department of Biochemistry, State Key Laboratory for Cognitive and Brain Sciences, The Centre for Reproduction, Development and Growth, The Centre for Genomic Sciences and Department of Psychiatry, The University of Hong Kong, Pokfulam, Hong Kong and Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research (CNCR), VU University Amsterdam, Amsterdam, The Netherlands Department of Complex Trait Genetics,
Bioinformatics. 2015 Apr 1;31(7):1007-15. doi: 10.1093/bioinformatics/btu783. Epub 2014 Nov 26.
Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem.
Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis.
MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models.
MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis.
Supplementary data are available at Bioinformatics online.
标准的全基因组关联研究通过测试一种表型与大量单核苷酸多态性(SNP)之间的关联来进行,但存在两方面的局限性:(i)性状通常是多变量的,对综合得分的分析会导致统计功效的损失;(ii)基于基因的分析可能更受青睐,例如为了减少多重检验问题。
在此,我们提出了一种新方法——基于扩展西姆斯程序的多变量基因关联检验(MGAS),该方法允许对无关个体的多变量表型进行基于基因的检验。通过广泛的模拟,我们表明,在大多数性状产生的基因型 - 表型模型下,与基于单变量表型综合得分的基因分析(即GATES、多元回归)以及多变量方差分析(MANOVA)相比,MGAS在检测相关基因方面具有更高的统计功效。对代谢数据的重新分析揭示了32个经错误发现率控制的全基因组显著基因,以及12个包含多个基因的区域;在这44个区域中,30个在原始分析中未被报道。
MGAS使研究人员能够有效地进行基于多变量基因的分析,且不会因基因型 - 表型模型指定错误而导致功效损失。
补充数据可在《生物信息学》在线版获取。