Morrison Jean
Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.
Genet Epidemiol. 2025 Jan;49(1):e22594. doi: 10.1002/gepi.22594. Epub 2024 Oct 6.
Many statistical genetics analysis methods make use of GWAS summary statistics. Best statistical practice requires evaluating these methods in realistic simulation experiments. However, simulating summary statistics by first simulating individual genotype and phenotype data is extremely computationally demanding. This high cost may force researchers to conduct overly simplistic simulations that fail to accurately measure method performance. Alternatively, summary statistics can be simulated directly from their theoretical distribution. Although this is a common need among statistical genetics researchers, no software packages exist for comprehensive GWAS summary statistic simulation. We present GWASBrewer, an open source R package for direct simulation of GWAS summary statistics. We show that statistics simulated by GWASBrewer have the same distribution as statistics generated from individual level data, and can be produced at a fraction of the computational expense. Additionally, GWASBrewer can simulate standard error estimates, something that is typically not done when sampling summary statistics directly. GWASBrewer is highly flexible, allowing the user to simulate data for multiple traits connected by causal effects and with complex distributions of effect sizes. We demonstrate example uses of GWASBrewer for evaluating Mendelian randomization, polygenic risk score, and heritability estimation methods.
许多统计遗传学分析方法都利用全基因组关联研究(GWAS)汇总统计数据。最佳统计实践要求在现实的模拟实验中评估这些方法。然而,通过首先模拟个体基因型和表型数据来模拟汇总统计数据在计算上要求极高。这种高成本可能迫使研究人员进行过于简单化的模拟,从而无法准确衡量方法性能。或者,可以直接从其理论分布模拟汇总统计数据。尽管这是统计遗传学研究人员的普遍需求,但目前还没有用于全面GWAS汇总统计模拟的软件包。我们展示了GWASBrewer,这是一个用于直接模拟GWAS汇总统计数据的开源R包。我们表明,由GWASBrewer模拟的统计数据与从个体水平数据生成的统计数据具有相同的分布,并且可以以一小部分计算成本生成。此外,GWASBrewer可以模拟标准误差估计,而直接对汇总统计数据进行采样时通常不会这样做。GWASBrewer具有高度灵活性,允许用户模拟由因果效应连接且效应大小分布复杂的多个性状的数据。我们展示了GWASBrewer在评估孟德尔随机化、多基因风险评分和遗传力估计方法方面的示例用法。