Albers Cornelis A, Leisink Martijn A R, Kappen Hilbert J
Department of Medical Physics and Biophysics, Radboud University, Nijmegen, The Netherlands.
BMC Bioinformatics. 2006 Mar 20;7 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-7-S1-S1.
Computing exact multipoint LOD scores for extended pedigrees rapidly becomes infeasible as the number of markers and untyped individuals increase. When markers are excluded from the computation, significant power may be lost. Therefore accurate approximate methods which take into account all markers are desirable.
We present a novel method for efficient estimation of LOD scores on extended pedigrees. Our approach is based on the Cluster Variation Method, which deterministically estimates likelihoods by performing exact computations on tractable subsets of variables (clusters) of a Bayesian network. First a distribution over inheritances on the marker loci is approximated with the Cluster Variation Method. Then this distribution is used to estimate the LOD score for each location of the trait locus.
First we demonstrate that significant power may be lost if markers are ignored in the multi-point analysis. On a set of pedigrees where exact computation is possible we compare the estimates of the LOD scores obtained with our method to the exact LOD scores. Secondly, we compare our method to a state of the art MCMC sampler. When both methods are given equal computation time, our method is more efficient. Finally, we show that CVM scales to large problem instances.
We conclude that the Cluster Variation Method is as accurate as MCMC and generally is more efficient. Our method is a promising alternative to approaches based on MCMC sampling.
随着标记数量和未分型个体数量的增加,对扩展家系计算精确的多点LOD分数很快变得不可行。当标记被排除在计算之外时,可能会损失大量功效。因此,需要考虑所有标记的精确近似方法。
我们提出了一种在扩展家系上有效估计LOD分数的新方法。我们的方法基于聚类变分法,该方法通过对贝叶斯网络的易处理变量子集(聚类)进行精确计算来确定性地估计似然性。首先,用聚类变分法近似标记位点上遗传的分布。然后,使用该分布来估计性状位点每个位置的LOD分数。
首先,我们证明了在多点分析中忽略标记可能会损失大量功效。在一组可以进行精确计算的家系上,我们将用我们的方法获得的LOD分数估计值与精确的LOD分数进行比较。其次,我们将我们的方法与一种先进的MCMC采样器进行比较。当两种方法给予相同的计算时间时,我们的方法更有效。最后,我们表明CVM可扩展到大型问题实例。
我们得出结论,聚类变分法与MCMC一样准确,并且通常更有效。我们的方法是基于MCMC采样方法的一个有前途的替代方法。