Hanein Sylvain, Perrault Isabelle, Gerber Sylvie, Delphin Nathalie, Benezra David, Shalev Stavit, Carmi Rivka, Feingold Josué, Dufier Jean-Louis, Munnich Arnold, Kaplan Josseline, Rozet Jean-Michel, Jeanpierre Marc
Unité de Recherches sur les Handicaps Génétiques de l'Enfant. Hôpital Necker - Enfants Malades, Paris, France.
Eur J Hum Genet. 2008 Jan;16(1):115-23. doi: 10.1038/sj.ejhg.5201905. Epub 2007 Aug 8.
The mosaic pattern of haplotypes observed around a single mutation results from one or several founder events. The difficulties involved in calculating the age of the variant are greatly reduced by assuming a single event, but this simplification may bias analysis of the genealogy of the mutation. However, if it is assumed that more than one founder event occurred, the number of genealogies is very large and the likelihood of every possible tree could not be realistically calculated. A multipoint approach is required, given the number of independent variables needed to describe a complex bifurcating genealogy. Starting from the observation that a limited number of parameters is needed for calculation of the simplest models of bifurcating genealogies, we show that the probability density of a two-ancestor model genealogy can be simply described as an algebraic function in a closed form, two coalescence times being calculated simultaneously without compromising accuracy. Implementation in a Bayesian framework is facilitated by the simplicity of the function, which describes the reciprocal relationship between the region of complete linkage disequilibrium and the branch length of the tree. We illustrate the use of haplotype information about allele-sharing decay around a mutation as a genetic clock, using data for two GUCY2D mutations in Mediterranean populations.
在单个突变周围观察到的单倍型镶嵌模式是由一次或几次奠基者事件导致的。假设为单一事件可大大减少计算该变异年龄所涉及的困难,但这种简化可能会使对该突变谱系的分析产生偏差。然而,如果假设发生了不止一次奠基者事件,那么谱系数量会非常庞大,几乎不可能实际计算出每一棵可能树的似然性。鉴于描述复杂分支谱系所需的独立变量数量,需要采用多点法。从观察到计算最简单的分支谱系模型所需参数数量有限这一点出发,我们表明双祖先模型谱系的概率密度可以简单地用一个封闭形式的代数函数来描述,两个合并时间可以同时计算且不影响准确性。该函数的简单性便于在贝叶斯框架中实现,它描述了完全连锁不平衡区域与树的分支长度之间的倒数关系。我们利用地中海人群中两个GUCY2D突变的数据,说明了将围绕突变的等位基因共享衰减的单倍型信息用作遗传时钟的用途。