Ellis Thomas James, Field David Luke, Barton Nicholas H
Institute of Science and Technology Austria, Klosterneuburg, Austria.
Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden.
Mol Ecol Resour. 2018 Apr 6. doi: 10.1111/1755-0998.12782.
Pedigree and sibship reconstruction are important methods in quantifying relationships and fitness of individuals in natural populations. Current methods employ a Markov chain-based algorithm to explore plausible possible pedigrees iteratively. This provides accurate results, but is time-consuming. Here, we develop a method to infer sibship and paternity relationships from half-sibling arrays of known maternity using hierarchical clustering. Given 50 or more unlinked SNP markers and empirically derived error rates, the method performs as well as the widely used package Colony, but is faster by two orders of magnitude. Using simulations, we show that the method performs well across contrasting mating scenarios, even when samples are large. We then apply the method to open-pollinated arrays of the snapdragon Antirrhinum majus and find evidence for a high degree of multiple mating. Although we focus on diploid SNP data, the method does not depend on marker type and as such has broad applications in nonmodel systems.
谱系和同胞关系重建是量化自然种群中个体间关系和适合度的重要方法。当前的方法采用基于马尔可夫链的算法来迭代探索可能的谱系。这能提供准确的结果,但耗时较长。在此,我们开发了一种方法,利用层次聚类从已知母系的半同胞阵列中推断同胞关系和父系关系。给定50个或更多不连锁的单核苷酸多态性(SNP)标记以及根据经验得出的错误率,该方法的表现与广泛使用的Colony软件包相当,但速度快两个数量级。通过模拟,我们表明该方法在不同的交配场景下都能表现良好,即使样本量很大。然后我们将该方法应用于金鱼草的开放授粉阵列,发现了高度多重交配的证据。尽管我们专注于二倍体SNP数据,但该方法不依赖于标记类型,因此在非模式系统中有广泛的应用。