Suppr超能文献

GWASBrewer:一个用于模拟逼真的全基因组关联研究汇总统计数据的R包。

GWASBrewer: An R Package for Simulating Realistic GWAS Summary Statistics.

作者信息

Morrison Jean

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Genet Epidemiol. 2025 Jan;49(1):e22594. doi: 10.1002/gepi.22594. Epub 2024 Oct 6.

Abstract

Many statistical genetics analysis methods make use of GWAS summary statistics. Best statistical practice requires evaluating these methods in realistic simulation experiments. However, simulating summary statistics by first simulating individual genotype and phenotype data is extremely computationally demanding. This high cost may force researchers to conduct overly simplistic simulations that fail to accurately measure method performance. Alternatively, summary statistics can be simulated directly from their theoretical distribution. Although this is a common need among statistical genetics researchers, no software packages exist for comprehensive GWAS summary statistic simulation. We present GWASBrewer, an open source R package for direct simulation of GWAS summary statistics. We show that statistics simulated by GWASBrewer have the same distribution as statistics generated from individual level data, and can be produced at a fraction of the computational expense. Additionally, GWASBrewer can simulate standard error estimates, something that is typically not done when sampling summary statistics directly. GWASBrewer is highly flexible, allowing the user to simulate data for multiple traits connected by causal effects and with complex distributions of effect sizes. We demonstrate example uses of GWASBrewer for evaluating Mendelian randomization, polygenic risk score, and heritability estimation methods.

摘要

许多统计遗传学分析方法都利用全基因组关联研究(GWAS)汇总统计数据。最佳统计实践要求在现实的模拟实验中评估这些方法。然而,通过首先模拟个体基因型和表型数据来模拟汇总统计数据在计算上要求极高。这种高成本可能迫使研究人员进行过于简单化的模拟,从而无法准确衡量方法性能。或者,可以直接从其理论分布模拟汇总统计数据。尽管这是统计遗传学研究人员的普遍需求,但目前还没有用于全面GWAS汇总统计模拟的软件包。我们展示了GWASBrewer,这是一个用于直接模拟GWAS汇总统计数据的开源R包。我们表明,由GWASBrewer模拟的统计数据与从个体水平数据生成的统计数据具有相同的分布,并且可以以一小部分计算成本生成。此外,GWASBrewer可以模拟标准误差估计,而直接对汇总统计数据进行采样时通常不会这样做。GWASBrewer具有高度灵活性,允许用户模拟由因果效应连接且效应大小分布复杂的多个性状的数据。我们展示了GWASBrewer在评估孟德尔随机化、多基因风险评分和遗传力估计方法方面的示例用法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44ab/11656154/18c49faf72db/GEPI-49-0-g002.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验