Zhang Jinghui, Hunter Kent W, Gandolph Michael, Rowe William L, Finney Richard P, Kelley Jenny M, Edmonson Michael, Buetow Kenneth H
Laboratory of Population Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892-8302, USA.
Genome Res. 2005 Feb;15(2):241-9. doi: 10.1101/gr.2901705.
Understanding of the structure and the origin of genetic variation patterns in the laboratory inbred mouse provides insight into the utility of the mouse model for studying human complex diseases and strategies for disease gene mapping. In order to address this issue, we have constructed a multistrain, high-resolution haplotype map for the 99-Mb mouse Chromosome 16 using approximately 70,000 single nucleotide polymorphism (SNP) markers derived from whole-genome shotgun sequencing of five laboratory inbred strains. We discovered that large polymorphic blocks (i.e., regions where only two haplotypes, thus one SNP conformation, are found in the five strains), large monomorphic blocks (i.e., regions where the five strains share the same haplotype), and fragmented blocks (i.e., regions of greater complexity not resembling at all the first two categories) span 50%, 18%, and 32% of the chromosome, respectively. The haplotype map has 98% accuracy in predicting mouse genotypes in two other studies. Its predictions are also confirmed by experimental results obtained from resequencing of 40-kb genomic sequences at 21 distinct genomic loci in 13 laboratory inbred strains and 12 wild-derived strains. We demonstrate that historic recombination, intra-subspecies variations and inter-subspecies variations have all contributed to the formation of the three distinctive genetic signatures. The results suggest that the controlled complexity of the laboratory inbred strains may provide a means for uncovering the biological factors that have shaped genetic variation patterns.
了解实验室近交系小鼠遗传变异模式的结构和起源,有助于深入理解小鼠模型在研究人类复杂疾病中的效用以及疾病基因定位策略。为了解决这一问题,我们利用从五个实验室近交系全基因组鸟枪法测序中获得的约70,000个单核苷酸多态性(SNP)标记,构建了99兆碱基的小鼠16号染色体的多品系、高分辨率单倍型图谱。我们发现,大的多态性区域(即五个品系中仅发现两种单倍型,也就是一种SNP构象的区域)、大的单态性区域(即五个品系共享相同单倍型的区域)和片段化区域(即复杂度更高、与前两类完全不同的区域)分别占该染色体的50%、18%和32%。在另外两项研究中,该单倍型图谱预测小鼠基因型的准确率达98%。对13个实验室近交系和12个野生来源品系中21个不同基因组位点的40千碱基基因组序列进行重测序所获得的实验结果,也证实了其预测。我们证明,历史重组、亚种内变异和亚种间变异都对这三种独特的遗传特征的形成有所贡献。结果表明,实验室近交系所具有的可控复杂性,可能为揭示塑造遗传变异模式的生物学因素提供一种手段。