Sheikh Saad I, Berger-Wolf Tanya Y, Ashley Mary V, Caballero Isabel C, Chaovalitwongse Wanpracha, DasGupta Bhaskar
Department of Computer Science, University of Illinois at Chicago, 851 S. Morgan (M/C 152), Room 1120 SEO, Chicago, IL 60607, USA.
Comput Syst Bioinformatics Conf. 2008;7:273-84.
Kinship analysis using genetic data is important for many biological applications, including many in conservation biology. Wide availability of microsatellites has boosted studies in wild populations that rely on the knowledge of kinship, particularly sibling relationships (sibship). While there exist many methods for reconstructing sibling relationships, almost none account for errors and mutations in microsatellite data, which are prevalent and affect the quality of reconstruction. We present an error-tolerant method for reconstructing sibling relationships based on the concept of consensus methods. We test our approach on both real and simulated data, with both pre-existing and introduced errors. Our method is highly accurate on almost all simulations, giving over 90% accuracy in most cases. Ours is the first method designed to tolerate errors while making no assumptions about the population or the sampling.
利用遗传数据进行亲缘关系分析对许多生物学应用都很重要,包括保护生物学中的许多应用。微卫星的广泛可得性推动了对依赖亲缘关系知识的野生种群的研究,特别是同胞关系(同胞群)。虽然存在许多重建同胞关系的方法,但几乎没有一种方法考虑到微卫星数据中的错误和突变,而这些错误和突变很普遍并会影响重建质量。我们提出了一种基于共识方法概念的容错重建同胞关系的方法。我们在真实数据和模拟数据上测试了我们的方法,包括既有错误和引入错误的数据。我们的方法在几乎所有模拟中都高度准确,在大多数情况下准确率超过90%。我们的方法是第一种旨在容错且不对种群或抽样做任何假设的方法。