Wang Bowen, Sverdlov Serge, Thompson Elizabeth
Department of Statistics, University of Washington, Seattle, Washington 98195-4322.
Department of Statistics, University of Washington, Seattle, Washington 98195-4322
Genetics. 2017 Mar;205(3):1063-1078. doi: 10.1534/genetics.116.197004. Epub 2017 Jan 18.
Realized kinship is a key statistic in analyses of genetic data involving relatedness of individuals or structure of populations. There are several estimators of kinship that make use of dense SNP genotypes. We introduce a class of estimators, of which some existing estimators are special cases. Within this class, we derive properties of the estimators and determine an optimal estimator. Additionally, we introduce an alternative marker weighting that takes allelic associations [linkage disequilibrium (LD)] into account, and apply this weighting to several estimators. In a simulation study, we show that improved estimators are obtained (1) by optimal weighting of markers, (2) by taking physical contiguity of genome into account, and (3) by weighting on the basis of LD.
实现的亲缘关系是涉及个体亲缘关系或群体结构的遗传数据分析中的关键统计量。有几种利用密集单核苷酸多态性(SNP)基因型的亲缘关系估计方法。我们引入了一类估计方法,一些现有的估计方法是其特殊情况。在这类方法中,我们推导了估计方法的性质并确定了最优估计方法。此外,我们引入了一种考虑等位基因关联[连锁不平衡(LD)]的替代标记加权方法,并将此加权方法应用于几种估计方法。在一项模拟研究中,我们表明通过(1)对标记进行最优加权、(2)考虑基因组的物理连续性以及(3)基于连锁不平衡进行加权,可以获得改进的估计方法。