AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand
AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand.
G3 (Bethesda). 2019 Oct 7;9(10):3239-3247. doi: 10.1534/g3.119.400501.
Genotypes are often used to assign parentage in agricultural and ecological settings. Sequencing can be used to obtain genotypes but does not provide unambiguous genotype calls, especially when sequencing depth is low in order to reduce costs. In that case, standard parentage analysis methods no longer apply. A strategy for using low-depth sequencing data for parentage assignment is developed here. It entails the use of relatedness estimates along with a metric termed excess mismatch rate which, for parent-offspring pairs or trios, is the difference between the observed mismatch rate and the rate expected under a model of inheritance and allele reads without error. When more than one putative parent has similar statistics, bootstrapping can provide a measure of the relatedness similarity. Putative parent-offspring trios can be further checked for consistency by comparing the offspring's estimated inbreeding to half the parent relatedness. Suitable thresholds are required for each metric. These methods were applied to a deer breeding operation consisting of two herds of different breeds. Relatedness estimates were more in line with expectation when the herds were analyzed separately than when combined, although this did not alter which parents were the best matches with each offspring. Parentage results were largely consistent with those based on a microsatellite parentage panel with three discordant parent assignments out of 1561. Two models are investigated to allow the parentage metrics to be calculated with non-random selection of alleles. The tools and strategies given here allow parentage to be assigned from low-depth sequencing data.
基因型通常用于农业和生态环境中的亲子关系鉴定。测序可用于获取基因型,但不能提供明确的基因型呼叫,尤其是在为降低成本而降低测序深度的情况下。在这种情况下,标准的亲子关系分析方法不再适用。这里开发了一种使用低深度测序数据进行亲子关系鉴定的策略。它涉及使用亲缘关系估计值以及一种称为多余错配率的度量标准,对于亲子对或三人组,多余错配率是观察到的错配率与遗传模型和无错误等位基因读数下预期的错配率之间的差异。当有多个假定的父母具有相似的统计数据时,可以通过自举法提供亲缘关系相似性的度量标准。可以通过比较后代的估计近亲繁殖与父母亲缘关系的一半来进一步检查假定的亲子三人组是否一致。每个度量标准都需要合适的阈值。这些方法应用于一个由两个不同品种的鹿养殖场组成的繁殖操作中。当分别分析两个牛群时,亲缘关系估计值更符合预期,而当将它们组合在一起时则不然,尽管这并没有改变哪些父母与每个后代最匹配。亲子关系结果与基于微卫星亲子面板的结果基本一致,在 1561 个亲子关系中只有 3 个亲子关系不一致。研究了两种模型,允许使用非随机选择等位基因来计算亲子关系度量标准。此处提供的工具和策略允许从低深度测序数据中分配亲子关系。