Gatti Daniel M, Svenson Karen L, Shabalin Andrey, Wu Long-Yang, Valdar William, Simecek Petr, Goodwin Neal, Cheng Riyan, Pomp Daniel, Palmer Abraham, Chesler Elissa J, Broman Karl W, Churchill Gary A
The Jackson Laboratory, Bar Harbor, Maine 04609.
Medical College of Virginia of Virginia Commonwealth University, Richmond, Virginia 23298.
G3 (Bethesda). 2014 Sep 18;4(9):1623-33. doi: 10.1534/g3.114.013748.
Genetic mapping studies in the mouse and other model organisms are used to search for genes underlying complex phenotypes. Traditional genetic mapping studies that employ single-generation crosses have poor mapping resolution and limit discovery to loci that are polymorphic between the two parental strains. Multiparent outbreeding populations address these shortcomings by increasing the density of recombination events and introducing allelic variants from multiple founder strains. However, multiparent crosses present new analytical challenges and require specialized software to take full advantage of these benefits. Each animal in an outbreeding population is genetically unique and must be genotyped using a high-density marker set; regression models for mapping must accommodate multiple founder alleles, and complex breeding designs give rise to polygenic covariance among related animals that must be accounted for in mapping analysis. The Diversity Outbred (DO) mice combine the genetic diversity of eight founder strains in a multigenerational breeding design that has been maintained for >16 generations. The large population size and randomized mating ensure the long-term genetic stability of this population. We present a complete analytical pipeline for genetic mapping in DO mice, including algorithms for probabilistic reconstruction of founder haplotypes from genotyping array intensity data, and mapping methods that accommodate multiple founder haplotypes and account for relatedness among animals. Power analysis suggests that studies with as few as 200 DO mice can detect loci with large effects, but loci that account for <5% of trait variance may require a sample size of up to 1000 animals. The methods described here are implemented in the freely available R package DOQTL.
在小鼠和其他模式生物中进行的遗传图谱研究用于寻找复杂表型背后的基因。采用单代杂交的传统遗传图谱研究具有较差的图谱分辨率,并且将发现局限于两个亲本品系之间具有多态性的位点。多亲本远交群体通过增加重组事件的密度并引入来自多个奠基者品系的等位基因变异来解决这些缺点。然而,多亲本杂交带来了新的分析挑战,需要专门的软件来充分利用这些优势。远交群体中的每只动物在遗传上都是独特的,必须使用高密度标记集进行基因分型;用于图谱绘制的回归模型必须适应多个奠基者等位基因,复杂的育种设计会导致相关动物之间产生多基因协方差,这在图谱分析中必须加以考虑。多样性远交(DO)小鼠在一个已经维持了超过16代的多代育种设计中结合了八个奠基者品系的遗传多样性。庞大的种群规模和随机交配确保了该群体的长期遗传稳定性。我们提出了一个用于DO小鼠遗传图谱绘制的完整分析流程,包括从基因分型阵列强度数据中概率重建奠基者单倍型的算法,以及适应多个奠基者单倍型并考虑动物之间亲缘关系的图谱绘制方法。功效分析表明,使用少至200只DO小鼠的研究可以检测到具有大效应的位点,但解释性状变异小于5%的位点可能需要多达1000只动物的样本量。这里描述的方法在免费可用的R包DOQTL中实现。