• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Dsuite-从 VCF 文件中快速计算 D 统计量和相关的混合证据。

Dsuite - Fast D-statistics and related admixture evidence from VCF files.

机构信息

Zoological Institute, University of Basel, Basel, Switzerland.

Department of Paleontology and Museum, University of Zurich, Zurich, Switzerland.

出版信息

Mol Ecol Resour. 2021 Feb;21(2):584-595. doi: 10.1111/1755-0998.13265. Epub 2020 Oct 24.

DOI:10.1111/1755-0998.13265
PMID:33012121
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7116594/
Abstract

Patterson's D, also known as the ABBA-BABA statistic, and related statistics such as the f -ratio, are commonly used to assess evidence of gene flow between populations or closely related species. Currently available implementations often require custom file formats, implement only small subsets of the available statistics, and are impractical to evaluate all gene flow hypotheses across data sets with many populations or species due to computational inefficiencies. Here, we present a new software package Dsuite, an efficient implementation allowing genome scale calculations of the D and f -ratio statistics across all combinations of tens or hundreds of populations or species directly from a variant call format (VCF) file. Our program also implements statistics suited for application to genomic windows, providing evidence of whether introgression is confined to specific loci, and it can also aid in interpretation of a system of f -ratio results with the use of the "f-branch" method. Dsuite is available at https://github.com/millanek/Dsuite, is straightforward to use, substantially more computationally efficient than comparable programs, and provides a convenient suite of tools and statistics, including some not previously available in any software package. Thus, Dsuite facilitates the assessment of evidence for gene flow, especially across larger genomic data sets.

摘要

帕特森 D,也称为 ABBA-BABA 统计量,以及相关的统计量,如 f-比,通常用于评估种群或密切相关物种之间基因流动的证据。目前可用的实现方法通常需要自定义文件格式,仅实现可用统计量的一小部分,并且由于计算效率低下,对于具有许多种群或物种的数据集,评估所有基因流动假设是不切实际的。在这里,我们提出了一个新的软件包 Dsuite,这是一种高效的实现方法,允许直接从变体调用格式(VCF)文件计算 D 和 f-比统计量在数十个或数百个种群或物种之间的所有组合。我们的程序还实现了适用于基因组窗口应用的统计量,提供了是否有基因渐渗仅限于特定基因座的证据,并且还可以通过使用“f-分支”方法帮助解释 f-比结果系统。Dsuite 可在 https://github.com/millanek/Dsuite 上获得,使用简单,计算效率比可比程序高得多,并且提供了一套方便的工具和统计量,包括以前在任何软件包中都不可用的一些统计量。因此,Dsuite 促进了对基因流动证据的评估,特别是在更大的基因组数据集上。

相似文献

1
Dsuite - Fast D-statistics and related admixture evidence from VCF files.Dsuite-从 VCF 文件中快速计算 D 统计量和相关的混合证据。
Mol Ecol Resour. 2021 Feb;21(2):584-595. doi: 10.1111/1755-0998.13265. Epub 2020 Oct 24.
2
Evaluating the use of ABBA-BABA statistics to locate introgressed loci.评估使用ABBA - BABA统计量来定位渐渗位点。
Mol Biol Evol. 2015 Jan;32(1):244-57. doi: 10.1093/molbev/msu269. Epub 2014 Sep 22.
3
Estimates of introgression as a function of pairwise distances.估计基因渐渗作为成对距离的函数。
BMC Bioinformatics. 2019 Apr 23;20(1):207. doi: 10.1186/s12859-019-2747-z.
4
re-Searcher: GUI-based bioinformatics tool for simplified genomics data mining of VCF files.再搜索者:用于简化VCF文件基因组学数据挖掘的基于图形用户界面的生物信息学工具。
PeerJ. 2021 May 3;9:e11333. doi: 10.7717/peerj.11333. eCollection 2021.
5
Detection and quantification of introgression using Bayesian inference based on conjugate priors.基于共轭先验的贝叶斯推断检测和量化基因渐渗。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae642.
6
VariantQC: a visual quality control report for variant evaluation.变异质控:用于变异评估的可视化质量控制报告。
Bioinformatics. 2019 Dec 15;35(24):5370-5371. doi: 10.1093/bioinformatics/btz560.
7
Comparative Performance of Popular Methods for Hybrid Detection using Genomic Data.基于基因组数据的混合检测常用方法的性能比较。
Syst Biol. 2021 Aug 11;70(5):891-907. doi: 10.1093/sysbio/syaa092.
8
ILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications.ILIAD:一套用于处理基因组数据以用于下游应用的自动化 Snakemake 工作流程套件。
BMC Bioinformatics. 2023 Nov 8;24(1):424. doi: 10.1186/s12859-023-05548-x.
9
vcfr: a package to manipulate and visualize variant call format data in R.vcfr:一个用于在R中处理和可视化变异调用格式数据的软件包。
Mol Ecol Resour. 2017 Jan;17(1):44-53. doi: 10.1111/1755-0998.12549. Epub 2016 Jul 12.
10
MungeSumstats: a Bioconductor package for the standardization and quality control of many GWAS summary statistics.MungeSumstats:一个 Bioconductor 软件包,用于对许多 GWAS 汇总统计数据进行标准化和质量控制。
Bioinformatics. 2021 Dec 7;37(23):4593-4596. doi: 10.1093/bioinformatics/btab665.

引用本文的文献

1
Integrative Population Genomics Reveals Niche Differentiation and Gene Flow in Chinese Sclerophyllous Oaks ( Sect. ).整合群体基因组学揭示了中国硬叶栎(栎属)的生态位分化和基因流。
Plants (Basel). 2025 Aug 3;14(15):2403. doi: 10.3390/plants14152403.
2
Kazakh Tobet dogs in the genomic landscape: refining the history of livestock guardian breeds.哈萨克托贝特犬在基因组层面的情况:完善家畜守护犬品种的历史
BMC Biol. 2025 Aug 5;23(1):240. doi: 10.1186/s12915-025-02344-2.
3
The unique morphological basis and repeated evolutionary origins of personate flowers in Penstemon.钓钟柳属中假面状花独特的形态学基础及多次进化起源
Am J Bot. 2025 Aug;112(8):e70078. doi: 10.1002/ajb2.70078. Epub 2025 Jul 31.
4
A complex interplay of genetic introgression and local adaptation during the evolutionary history of three closely related spruce species.在三种亲缘关系密切的云杉物种的进化历史中,基因渐渗与局部适应之间存在复杂的相互作用。
Plant Divers. 2025 May 15;47(4):620-632. doi: 10.1016/j.pld.2025.04.007. eCollection 2025 Jul.
5
Phylogenomic insights into and its allies (Campanulaceae): Revisiting generic delimitation and hybridization dynamics.对风铃草属及其近缘类群(桔梗科)的系统基因组学见解:重新审视属的界定和杂交动态。
Plant Divers. 2025 May 27;47(4):576-592. doi: 10.1016/j.pld.2025.05.010. eCollection 2025 Jul.
6
A trans-species cytoplasmic polymorphism is associated with seed shape and aridity across multiple species of sunflowers.跨物种的细胞质多态性与多种向日葵的种子形状和干旱程度相关。
Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2410943122. doi: 10.1073/pnas.2410943122. Epub 2025 Jul 28.
7
Genomics of Neotropical biodiversity indicators: Two butterfly radiations with rampant chromosomal rearrangements and hybridization.新热带生物多样性指标的基因组学:两个具有大量染色体重排和杂交现象的蝴蝶辐射类群。
Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2410939122. doi: 10.1073/pnas.2410939122. Epub 2025 Jul 28.
8
Additional Complexity in Historic and Contemporary Gene Flow Among Hoary, Vancouver Island, and Olympic Marmots Revealed by Microsatellites and Ultraconserved Elements.微卫星和超保守元件揭示的北美毛旱獭、温哥华岛旱獭和奥林匹克旱獭在历史和当代基因流动中的额外复杂性
Ecol Evol. 2025 Jul 27;15(7):e71711. doi: 10.1002/ece3.71711. eCollection 2025 Jul.
9
Radiation with reproductive isolation in the near-absence of phylogenetic signal.在几乎没有系统发育信号的情况下出现辐射与生殖隔离。
Sci Adv. 2025 Jul 25;11(30):eadt0973. doi: 10.1126/sciadv.adt0973.
10
Research on the Historical Dynamics of Baicheng Oil Chicken Populations.白城油鸡种群历史动态研究
Animals (Basel). 2025 Jul 2;15(13):1952. doi: 10.3390/ani15131952.

本文引用的文献

1
Rampant Genome-Wide Admixture across the Heliconius Radiation.杂种化在姬蝴蝶群系中广泛存在。
Genome Biol Evol. 2021 Jul 6;13(7). doi: 10.1093/gbe/evab099.
2
Efficiently inferring the demographic history of many populations with allele count data.利用等位基因计数数据高效推断多个群体的人口历史。
J Am Stat Assoc. 2020;115(531):1472-1487. doi: 10.1080/01621459.2019.1635482. Epub 2019 Jul 22.
3
Ancestral Hybridization Facilitated Species Diversification in the Lake Malawi Cichlid Fish Adaptive Radiation.祖先杂交促进马拉维湖慈鲷鱼类适应辐射中的物种多样化。
Mol Biol Evol. 2020 Apr 1;37(4):1100-1113. doi: 10.1093/molbev/msz294.
4
Estimates of introgression as a function of pairwise distances.估计基因渐渗作为成对距离的函数。
BMC Bioinformatics. 2019 Apr 23;20(1):207. doi: 10.1186/s12859-019-2747-z.
5
An ABC Method for Whole-Genome Sequence Data: Inferring Paleolithic and Neolithic Human Expansions.一种全基因组序列数据的 ABC 方法:推断古石器时代和新石器时代人类的扩张。
Mol Biol Evol. 2019 Jul 1;36(7):1565-1579. doi: 10.1093/molbev/msz038.
6
Whole-genome sequences of Malawi cichlids reveal multiple radiations interconnected by gene flow.马拉维慈鲷的全基因组序列揭示了由基因流相互连接的多个辐射演化。
Nat Ecol Evol. 2018 Dec;2(12):1940-1955. doi: 10.1038/s41559-018-0717-x. Epub 2018 Nov 19.
7
Whole-genome sequence analysis shows that two endemic species of North American wolf are admixtures of the coyote and gray wolf.全基因组序列分析表明,北美狼的两个特有物种是郊狼和灰狼的混合体。
Sci Adv. 2016 Jul 27;2(7):e1501714. doi: 10.1126/sciadv.1501714. eCollection 2016 Jul.
8
HyDe: A Python Package for Genome-Scale Hybridization Detection.HyDe:用于全基因组杂交检测的 Python 包。
Syst Biol. 2018 Sep 1;67(5):821-829. doi: 10.1093/sysbio/syy023.
9
Powerful Inference with the D-Statistic on Low-Coverage Whole-Genome Data.利用D统计量对低覆盖率全基因组数据进行有力推断。
G3 (Bethesda). 2018 Feb 2;8(2):551-566. doi: 10.1534/g3.117.300192.
10
The contribution of admixture to primate evolution.混血对灵长类进化的贡献。
Curr Opin Genet Dev. 2017 Dec;47:61-68. doi: 10.1016/j.gde.2017.08.010. Epub 2017 Sep 15.