Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin 53706, USA.
Genetics. 2012 Feb;190(2):403-12. doi: 10.1534/genetics.111.132647.
The mouse Collaborative Cross (CC) is a panel of eight-way recombinant inbred lines: eight diverse parental strains are intermated, followed by repeated sibling mating, many times in parallel, to create a new set of inbred lines whose genomes are random mosaics of the genomes of the original eight strains. Many generations are required to reach inbreeding, and so a number of investigators have sought to make use of phenotype and genotype data on mice from intermediate generations during the formation of the CC lines (so-called pre-CC mice). The development of a hidden Markov model for genotype reconstruction in such pre-CC mice, on the basis of incompletely informative genetic markers (such as single-nucleotide polymorphisms), formally requires the two-locus genotype probabilities at an arbitrary generation along the path to inbreeding. In this article, I describe my efforts to calculate such probabilities. While closed-form solutions for the two-locus genotype probabilities could not be derived, I provide a prescription for calculating such probabilities numerically. In addition, I present a number of useful quantities, including single-locus genotype probabilities, two-locus haplotype probabilities, and the fixation probability and map expansion at each generation along the course to inbreeding.
小鼠共杂交(CC)是一个由 8 种重组近交系组成的面板:8 种不同的亲本品系进行杂交,然后进行多次平行的同胞交配,以创建一组新的近交系,其基因组是原始 8 个品系基因组的随机镶嵌体。需要经过多代才能达到近交,因此许多研究人员试图利用 CC 系形成过程中的中间代(所谓的预 CC 小鼠)的表型和基因型数据。在这种预 CC 小鼠中,基于不完全信息遗传标记(如单核苷酸多态性),使用隐马尔可夫模型进行基因型重构,正式要求在近亲繁殖路径上的任意一代具有两个基因座的基因型概率。在本文中,我描述了我计算这些概率的努力。虽然无法推导出两基因座基因型概率的闭式解,但我提供了一种数值计算此类概率的方法。此外,我还提出了一些有用的数量,包括单基因座基因型概率、双基因座单倍型概率以及在近亲繁殖过程中每一代的固定概率和图谱扩展。