University of California, Berkeley - University of California, San Francisco Graduate Program in Bioengineering, San Francisco, California.
Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California.
Genet Epidemiol. 2020 Jan;44(1):90-103. doi: 10.1002/gepi.22264. Epub 2019 Oct 6.
While it is well established that genetics can be a major contributor to population variation of complex traits, the relative contributions of rare and common variants to phenotypic variation remains a matter of considerable debate. Here, we simulate genetic and phenotypic data across different case/control panel sampling strategies, sequencing methods, and genetic architecture models based on evolutionary forces to determine the statistical performance of rare variant association tests (RVATs) widely in use. We find that the highest statistical power of RVATs is achieved by sampling case/control individuals from the extremes of an underlying quantitative trait distribution. We also demonstrate that the use of genotyping arrays, in conjunction with imputation from a whole-genome sequenced (WGS) reference panel, recovers the vast majority (90%) of the power that could be achieved by sequencing the case/control panel using current tools. Finally, we show that for dichotomous traits, the statistical performance of RVATs decreases as rare variants become more important in the trait architecture. Our results extend previous work to show that RVATs are insufficiently powered to make generalizable conclusions about the role of rare variants in dichotomous complex traits.
虽然遗传因素是导致复杂性状群体变异的主要因素已得到广泛证实,但稀有变异和常见变异对表型变异的相对贡献仍然存在相当大的争议。在这里,我们基于进化压力模拟了不同病例/对照小组采样策略、测序方法和遗传结构模型的遗传和表型数据,以确定广泛使用的稀有变异关联测试 (RVAT) 的统计性能。我们发现,通过从潜在数量性状分布的极端值中抽样病例/对照个体,RVAT 可实现最高的统计功效。我们还证明,使用基因分型阵列并结合全基因组测序 (WGS) 参考面板的推断,可以恢复当前工具用于测序病例/对照面板所能实现的绝大多数(90%)功效。最后,我们表明,对于二项性状,随着稀有变异在性状结构中变得更加重要,RVAT 的统计性能会降低。我们的结果扩展了先前的工作,表明 RVAT 不足以得出关于稀有变异在二项复杂性状中的作用的普遍结论。