Suppr超能文献

Dsuite-从 VCF 文件中快速计算 D 统计量和相关的混合证据。

Dsuite - Fast D-statistics and related admixture evidence from VCF files.

机构信息

Zoological Institute, University of Basel, Basel, Switzerland.

Department of Paleontology and Museum, University of Zurich, Zurich, Switzerland.

出版信息

Mol Ecol Resour. 2021 Feb;21(2):584-595. doi: 10.1111/1755-0998.13265. Epub 2020 Oct 24.

Abstract

Patterson's D, also known as the ABBA-BABA statistic, and related statistics such as the f -ratio, are commonly used to assess evidence of gene flow between populations or closely related species. Currently available implementations often require custom file formats, implement only small subsets of the available statistics, and are impractical to evaluate all gene flow hypotheses across data sets with many populations or species due to computational inefficiencies. Here, we present a new software package Dsuite, an efficient implementation allowing genome scale calculations of the D and f -ratio statistics across all combinations of tens or hundreds of populations or species directly from a variant call format (VCF) file. Our program also implements statistics suited for application to genomic windows, providing evidence of whether introgression is confined to specific loci, and it can also aid in interpretation of a system of f -ratio results with the use of the "f-branch" method. Dsuite is available at https://github.com/millanek/Dsuite, is straightforward to use, substantially more computationally efficient than comparable programs, and provides a convenient suite of tools and statistics, including some not previously available in any software package. Thus, Dsuite facilitates the assessment of evidence for gene flow, especially across larger genomic data sets.

摘要

帕特森 D,也称为 ABBA-BABA 统计量,以及相关的统计量,如 f-比,通常用于评估种群或密切相关物种之间基因流动的证据。目前可用的实现方法通常需要自定义文件格式,仅实现可用统计量的一小部分,并且由于计算效率低下,对于具有许多种群或物种的数据集,评估所有基因流动假设是不切实际的。在这里,我们提出了一个新的软件包 Dsuite,这是一种高效的实现方法,允许直接从变体调用格式(VCF)文件计算 D 和 f-比统计量在数十个或数百个种群或物种之间的所有组合。我们的程序还实现了适用于基因组窗口应用的统计量,提供了是否有基因渐渗仅限于特定基因座的证据,并且还可以通过使用“f-分支”方法帮助解释 f-比结果系统。Dsuite 可在 https://github.com/millanek/Dsuite 上获得,使用简单,计算效率比可比程序高得多,并且提供了一套方便的工具和统计量,包括以前在任何软件包中都不可用的一些统计量。因此,Dsuite 促进了对基因流动证据的评估,特别是在更大的基因组数据集上。

相似文献

3
Estimates of introgression as a function of pairwise distances.估计基因渐渗作为成对距离的函数。
BMC Bioinformatics. 2019 Apr 23;20(1):207. doi: 10.1186/s12859-019-2747-z.

引用本文的文献

本文引用的文献

4
Estimates of introgression as a function of pairwise distances.估计基因渐渗作为成对距离的函数。
BMC Bioinformatics. 2019 Apr 23;20(1):207. doi: 10.1186/s12859-019-2747-z.
10
The contribution of admixture to primate evolution.混血对灵长类进化的贡献。
Curr Opin Genet Dev. 2017 Dec;47:61-68. doi: 10.1016/j.gde.2017.08.010. Epub 2017 Sep 15.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验