Molecular and Cellular Biology Program, University of Washington, Seattle, WA 98195, USA.
Mamm Genome. 2010 Apr;21(3-4):143-52. doi: 10.1007/s00335-010-9249-7. Epub 2010 Feb 5.
The remarkable success in mapping genes linked to a number of disease traits using genome-wide association studies (GWAS) in human cohorts has renewed interest in applying this same technique in model organisms such as inbred laboratory mice. Unlike humans, however, the limited genetic diversity in the ancestry of laboratory mice combined with selection pressure over the past decades have yielded an intricate population genetic structure that can complicate the results obtained from association studies. This problem is further exacerbated by the small number of strains typically used in such studies where multiple spurious associations arise as a result of random chance. We sought to empirically assess the viability of GWAS in inbred mice using hundreds of expression traits for which the true location of the expression quantitative trait locus was known a priori. We then measured transcript abundance levels for these expression traits in 16 classical and 3 wild-derived inbred strains and carried out a genome-wide association scan, demonstrating the low statistical power of such studies and empirically estimating the large extent to which allelic association of transcripts gives rise to spurious associations. We provide evidence illustrating that in a large fraction of cases, the marker with the most significant p values fails to map to the location of the true eQTL. Finally, we provide experimental support for hundreds of traits, and that combining linkage analysis with association mapping provides significant increases in statistical power over a stand-alone GWAS as well as significantly higher mapping resolution than either study alone.
全基因组关联研究(GWAS)在人类群体中成功地绘制了与许多疾病特征相关的基因图谱,这重新激发了人们在模式生物(如近交系实验小鼠)中应用这一技术的兴趣。然而,与人类不同的是,实验小鼠的遗传多样性有限,加上过去几十年的选择压力,导致了复杂的群体遗传结构,这可能会使关联研究的结果变得复杂。这个问题在使用此类研究中常用的少数几个品系时进一步加剧,因为随机机会导致多个虚假关联出现。我们试图使用数百个表达性状来实证评估近交系小鼠中 GWAS 的可行性,这些性状的表达数量性状基因座的真实位置是事先知道的。然后,我们测量了这些表达性状在 16 个经典近交系和 3 个野生衍生近交系中的转录丰度水平,并进行了全基因组关联扫描,证明了这些研究的统计效力很低,并实证估计了等位基因关联转录本导致虚假关联的程度。我们提供的证据表明,在很大一部分情况下,具有最显著 p 值的标记未能映射到真实 eQTL 的位置。最后,我们为数百个性状提供了实验支持,并表明将连锁分析与关联作图相结合,相对于单独进行 GWAS 可以显著提高统计效力,并且比单独进行任何一种研究都具有更高的作图分辨率。