Cheng En, Ozsoyoglu Z Meral
Computer Science Department, The University of Akron, Akron, OH 44325, USA.
Electrical Engineering and Computer Science Department, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, OH 44106, USA.
Comput Math Methods Med. 2014;2014:898424. doi: 10.1155/2014/898424. Epub 2014 Jul 21.
An important computation on pedigree data is the calculation of condensed identity coefficients, which provide a complete description of the degree of relatedness of two individuals. The applications of condensed identity coefficients range from genetic counseling to disease tracking. Condensed identity coefficients can be computed using linear combinations of generalized kinship coefficients for two, three, four individuals, and two pairs of individuals and there are recursive formulas for computing those generalized kinship coefficients (Karigl, 1981). Path-counting formulas have been proposed for the (generalized) kinship coefficients for two (three) individuals but there have been no path-counting formulas for the other generalized kinship coefficients. It has also been shown that the computation of the (generalized) kinship coefficients for two (three) individuals using path-counting formulas is efficient for large pedigrees, together with path encoding schemes tailored for pedigree graphs. In this paper, we propose a framework for deriving path-counting formulas for generalized kinship coefficients. Then, we present the path-counting formulas for all generalized kinship coefficients for which there are recursive formulas and which are sufficient for computing condensed identity coefficients. We also perform experiments to compare the efficiency of our method with the recursive method for computing condensed identity coefficients on large pedigrees.
系谱数据的一项重要计算是凝聚性同宗系数的计算,它能完整描述两个个体的亲缘程度。凝聚性同宗系数的应用范围从遗传咨询到疾病追踪。凝聚性同宗系数可通过两个、三个、四个个体以及两对个体的广义亲缘系数的线性组合来计算,并且存在用于计算这些广义亲缘系数的递归公式(卡里格尔,1981年)。已经有人提出了针对两个(三个)个体的(广义)亲缘系数的路径计数公式,但对于其他广义亲缘系数却没有路径计数公式。研究还表明,使用路径计数公式计算两个(三个)个体的(广义)亲缘系数对于大型系谱来说是高效的,同时还有为系谱图量身定制的路径编码方案。在本文中,我们提出了一个推导广义亲缘系数路径计数公式的框架。然后,我们给出了所有具有递归公式且足以计算凝聚性同宗系数的广义亲缘系数的路径计数公式。我们还进行了实验,以比较我们的方法与在大型系谱上计算凝聚性同宗系数的递归方法的效率。