Suppr超能文献

基因流分析方法,D 统计量,在广泛的参数空间中具有稳健性。

Gene flow analysis method, the D-statistic, is robust in a wide parameter space.

机构信息

Biodiversität und Klima Forschungszentrum, Senckenberg Gesellschaft für Naturforschung, 60325, Frankfurt, Germany.

出版信息

BMC Bioinformatics. 2018 Jan 8;19(1):10. doi: 10.1186/s12859-017-2002-4.

Abstract

BACKGROUND

We evaluated the sensitivity of the D-statistic, a parsimony-like method widely used to detect gene flow between closely related species. This method has been applied to a variety of taxa with a wide range of divergence times. However, its parameter space and thus its applicability to a wide taxonomic range has not been systematically studied. Divergence time, population size, time of gene flow, distance of outgroup and number of loci were examined in a sensitivity analysis.

RESULT

The sensitivity study shows that the primary determinant of the D-statistic is the relative population size, i.e. the population size scaled by the number of generations since divergence. This is consistent with the fact that the main confounding factor in gene flow detection is incomplete lineage sorting by diluting the signal. The sensitivity of the D-statistic is also affected by the direction of gene flow, size and number of loci. In addition, we examined the ability of the f-statistics, [Formula: see text] and [Formula: see text], to estimate the fraction of a genome affected by gene flow; while these statistics are difficult to implement to practical questions in biology due to lack of knowledge of when the gene flow happened, they can be used to compare datasets with identical or similar demographic background.

CONCLUSIONS

The D-statistic, as a method to detect gene flow, is robust against a wide range of genetic distances (divergence times) but it is sensitive to population size. The D-statistic should only be applied with critical reservation to taxa where population sizes are large relative to branch lengths in generations.

摘要

背景

我们评估了 D 统计量的敏感性,这是一种广泛用于检测亲缘关系密切的物种间基因流动的简约方法。该方法已应用于具有广泛分歧时间的多种分类群。然而,其参数空间及其在广泛分类范围内的适用性尚未得到系统研究。在敏感性分析中检查了分歧时间、种群大小、基因流动时间、外群距离和基因座数量。

结果

敏感性研究表明,D 统计量的主要决定因素是相对种群大小,即种群大小与分歧以来的世代数之比。这与基因流动检测中的主要混杂因素是不完全谱系分选,从而稀释信号的事实是一致的。D 统计量的敏感性也受到基因流动的方向、大小和基因座数量的影响。此外,我们还检验了 f 统计量[Formula: see text]和[Formula: see text]估计基因组受基因流动影响部分的能力;虽然由于缺乏关于基因流动何时发生的知识,这些统计量难以应用于生物学中的实际问题,但它们可用于比较具有相同或相似的人口统计学背景的数据集。

结论

作为检测基因流动的方法,D 统计量在广泛的遗传距离(分歧时间)范围内具有稳健性,但对种群大小敏感。只有在相对世代分支长度较大的种群中,才应谨慎应用 D 统计量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47ef/5759368/cf19aabf711d/12859_2017_2002_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验