Zhu Lirong, Zhang Shuanglin, Sha Qiuying
Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, United States.
Front Genet. 2024 Sep 5;15:1359591. doi: 10.3389/fgene.2024.1359591. eCollection 2024.
Genome-wide association studies (GWAS) have emerged as popular tools for identifying genetic variants that are associated with complex diseases. Standard analysis of a GWAS involves assessing the association between each variant and a disease. However, this approach suffers from limited reproducibility and difficulties in detecting multi-variant and pleiotropic effects. Although joint analysis of multiple phenotypes for GWAS can identify and interpret pleiotropic loci which are essential to understand pleiotropy in diseases and complex traits, most of the multiple phenotype association tests are designed for a single variant, resulting in much lower power, especially when their effect sizes are small and only their cumulative effect is associated with multiple phenotypes. To overcome these limitations, set-based multiple phenotype association tests have been developed to enhance statistical power and facilitate the identification and interpretation of pleiotropic regions. In this research, we propose a new method, named Meta-TOW-S, which conducts joint association tests between multiple phenotypes and a set of variants (such as variants in a gene) utilizing GWAS summary statistics from different cohorts. Our approach applies the set-based method that Tests for the effect of an Optimal Weighted combination of variants in a gene (TOW) and accounts for sample size differences across GWAS cohorts by employing the Cauchy combination method. Meta-TOW-S combines the advantages of set-based tests and multi-phenotype association tests, exhibiting computational efficiency and enabling analysis across multiple phenotypes while accommodating overlapping samples from different GWAS cohorts. To assess the performance of Meta-TOW-S, we develop a phenotype simulator package that encompasses a comprehensive simulation scheme capable of modeling multiple phenotypes and multiple variants, including noise structures and diverse correlation patterns among phenotypes. Simulation studies validate that Meta-TOW-S maintains a desirable Type I error rate. Further simulation under different scenarios shows that Meta-TOW-S can improve power compared with other existing meta-analysis methods. When applied to four psychiatric disorders summary data, Meta-TOW-S detects a greater number of significant genes.
全基因组关联研究(GWAS)已成为识别与复杂疾病相关的基因变异的常用工具。GWAS的标准分析包括评估每个变异与一种疾病之间的关联。然而,这种方法存在可重复性有限以及检测多变异和多效性效应困难的问题。尽管对GWAS的多个表型进行联合分析可以识别和解释对理解疾病和复杂性状中的多效性至关重要的多效性位点,但大多数多表型关联测试是针对单个变异设计的,导致功效低得多,特别是当它们的效应大小较小时,并且只有它们的累积效应与多个表型相关。为了克服这些限制,已经开发了基于集合的多表型关联测试,以提高统计功效并促进多效性区域的识别和解释。在本研究中,我们提出了一种名为Meta-TOW-S的新方法,该方法利用来自不同队列的GWAS汇总统计数据,对多个表型和一组变异(例如基因中的变异)进行联合关联测试。我们的方法应用基于集合的方法,即测试基因中变异的最佳加权组合的效应(TOW),并通过采用柯西组合方法来考虑GWAS队列之间的样本量差异。Meta-TOW-S结合了基于集合的测试和多表型关联测试的优点,具有计算效率,能够跨多个表型进行分析,同时适应来自不同GWAS队列的重叠样本。为了评估Meta-TOW-S的性能,我们开发了一个表型模拟器包,该包包含一个全面的模拟方案,能够对多个表型和多个变异进行建模,包括噪声结构和表型之间的各种相关模式。模拟研究验证了Meta-TOW-S保持了理想的I型错误率。在不同场景下的进一步模拟表明,与其他现有的荟萃分析方法相比,Meta-TOW-S可以提高功效。当应用于四种精神疾病的汇总数据时,Meta-TOW-S检测到更多的显著基因。