• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ASTRAL-Pro:基于四重奏的系统发生树推断,即便存在基因重复。

ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.

机构信息

Bioinformatics and Systems Biology, University of California San Diego, San Diego, CA.

ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France.

出版信息

Mol Biol Evol. 2020 Nov 1;37(11):3292-3307. doi: 10.1093/molbev/msaa139.

DOI:10.1093/molbev/msaa139
PMID:32886770
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7751180/
Abstract

Phylogenetic inference from genome-wide data (phylogenomics) has revolutionized the study of evolution because it enables accounting for discordance among evolutionary histories across the genome. To this end, summary methods have been developed to allow accurate and scalable inference of species trees from gene trees. However, most of these methods, including the widely used ASTRAL, can only handle single-copy gene trees and do not attempt to model gene duplication and gene loss. As a result, most phylogenomic studies have focused on single-copy genes and have discarded large parts of the data. Here, we first propose a measure of quartet similarity between single-copy and multicopy trees that accounts for orthology and paralogy. We then introduce a method called ASTRAL-Pro (ASTRAL for PaRalogs and Orthologs) to find the species tree that optimizes our quartet similarity measure using dynamic programing. By studying its performance on an extensive collection of simulated data sets and on real data sets, we show that ASTRAL-Pro is more accurate than alternative methods.

摘要

基于全基因组数据的系统发育推断(系统基因组学)彻底改变了进化研究,因为它能够解释整个基因组中进化历史的不一致性。为此,已经开发了汇总方法,以允许从基因树准确且可扩展地推断物种树。然而,这些方法中的大多数,包括广泛使用的 ASTRAL,只能处理单拷贝基因树,并且不尝试对基因复制和基因丢失进行建模。结果,大多数系统基因组学研究都集中在单拷贝基因上,并丢弃了大部分数据。在这里,我们首先提出了一种在单拷贝和多拷贝树之间测量四联体相似性的方法,该方法考虑了直系同源物和旁系同源物。然后,我们引入了一种称为 ASTRAL-Pro(针对旁系同源物和直系同源物的 ASTRAL)的方法,该方法使用动态编程找到优化我们四联体相似性度量的物种树。通过在大量模拟数据集和真实数据集上研究其性能,我们表明 ASTRAL-Pro 比替代方法更准确。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/eb31ea94ec1b/msaa139f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/d11d0ca8e793/msaa139f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/f52499e55456/msaa139f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/d9a34003bb9c/msaa139f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/86ef5f86a50b/msaa139f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/1e5c9f6a480a/msaa139f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/cda4cf403793/msaa139f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/eb31ea94ec1b/msaa139f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/d11d0ca8e793/msaa139f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/f52499e55456/msaa139f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/d9a34003bb9c/msaa139f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/86ef5f86a50b/msaa139f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/1e5c9f6a480a/msaa139f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/cda4cf403793/msaa139f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e383/7751180/eb31ea94ec1b/msaa139f7.jpg

相似文献

1
ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.ASTRAL-Pro:基于四重奏的系统发生树推断,即便存在基因重复。
Mol Biol Evol. 2020 Nov 1;37(11):3292-3307. doi: 10.1093/molbev/msaa139.
2
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.
3
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition.利用多拷贝基因家族树分解进行种系树推断。
Syst Biol. 2022 Apr 19;71(3):610-629. doi: 10.1093/sysbio/syab070.
4
Multi-allele species reconstruction using ASTRAL.使用 ASTRAL 进行多等位基因物种重建。
Mol Phylogenet Evol. 2019 Jan;130:286-296. doi: 10.1016/j.ympev.2018.10.033. Epub 2018 Oct 26.
5
Species Tree Inference Methods Intended to Deal with Incomplete Lineage Sorting Are Robust to the Presence of Paralogs.旨在处理不完全谱系分类的种系树推断方法对旁系同源基因的存在具有稳健性。
Syst Biol. 2022 Feb 10;71(2):367-381. doi: 10.1093/sysbio/syab056.
6
Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication.种系树估计及全基因组复制后基因丢失的影响。
Syst Biol. 2022 Oct 12;71(6):1348-1361. doi: 10.1093/sysbio/syac040.
7
Quartet-based inference is statistically consistent under the unified duplication-loss-coalescence model.基于四重体的推断在统一的复制-丢失-合并模型下是统计一致的。
Bioinformatics. 2021 Nov 18;37(22):4064-4074. doi: 10.1093/bioinformatics/btab414.
8
Weighting by Gene Tree Uncertainty Improves Accuracy of Quartet-based Species Trees.基于基因树不确定性的加权可提高基于四元组的种系发生树的准确性。
Mol Biol Evol. 2022 Dec 5;39(12). doi: 10.1093/molbev/msac215.
9
Bayesian-Weighted Triplet and Quartet Methods for Species Tree Inference.贝叶斯加权三重态和四重态方法在种系发生树推断中的应用。
Bull Math Biol. 2021 Jul 23;83(9):93. doi: 10.1007/s11538-021-00918-z.
10
Forcing external constraints on tree inference using ASTRAL.使用 ASTRAL 强制对树推断施加外部约束。
BMC Genomics. 2020 Apr 16;21(Suppl 2):218. doi: 10.1186/s12864-020-6607-z.

引用本文的文献

1
Insect Phylogenomics: From Experiment Planning to Post-phylogenetic Analyses.昆虫系统发育基因组学:从实验规划到系统发育后分析
Methods Mol Biol. 2025;2935:211-235. doi: 10.1007/978-1-0716-4583-3_9.
2
Building a robust backbone for Astragalus using a clade-specific target enrichment bait set.使用特定分支的目标富集诱饵集构建黄芪属植物强大的主干。
Am J Bot. 2025 Aug;112(8):e70084. doi: 10.1002/ajb2.70084. Epub 2025 Aug 19.
3
ASTER: A Package for Large-Scale Phylogenomic Reconstructions.ASTER:一个用于大规模系统发育基因组重建的软件包。

本文引用的文献

1
Quartet-based inference is statistically consistent under the unified duplication-loss-coalescence model.基于四重体的推断在统一的复制-丢失-合并模型下是统计一致的。
Bioinformatics. 2021 Nov 18;37(22):4064-4074. doi: 10.1093/bioinformatics/btab414.
2
Polynomial-Time Statistical Estimation of Species Trees Under Gene Duplication and Loss.多项式时间下基因重复和缺失下种系树的统计估计
J Comput Biol. 2021 May;28(5):452-468. doi: 10.1089/cmb.2020.0424. Epub 2020 Dec 15.
3
The Multilocus Multispecies Coalescent: A Flexible New Model of Gene Family Evolution.
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf172.
4
Methane filtration and metabolic cooperation of microbial communities in cold seep water columns from South China Sea.南海冷泉水柱中微生物群落的甲烷过滤与代谢合作
Commun Biol. 2025 Jul 15;8(1):1052. doi: 10.1038/s42003-025-08471-4.
5
Phylogenomics reveals the slow-burning fuse of diatom evolution.系统发育基因组学揭示了硅藻进化的缓慢导火索。
Proc Natl Acad Sci U S A. 2025 Jun 3;122(22):e2500153122. doi: 10.1073/pnas.2500153122. Epub 2025 May 29.
6
Integrating phylogenies with chronology to assemble the tree of life.将系统发育与时间顺序相结合以构建生命之树。
Front Bioinform. 2025 Apr 30;5:1571568. doi: 10.3389/fbinf.2025.1571568. eCollection 2025.
7
Phylogenomic Inference Suggests Differential Deep Time Phylogenetic Signals from Nuclear and Organellar Genomes in Gymnosperms.系统发育基因组学推断表明裸子植物中核基因组和细胞器基因组存在不同的深层系统发育信号。
Plants (Basel). 2025 Apr 28;14(9):1335. doi: 10.3390/plants14091335.
8
AutoMLST2: a web server for phylogeny and microbial taxonomy.自动多位点序列分型工具2:一个用于系统发育和微生物分类的网络服务器
Nucleic Acids Res. 2025 Jul 7;53(W1):W45-W50. doi: 10.1093/nar/gkaf397.
9
An updated phylogeny of Boraginales based on the Angiosperms353 probe set: a roadmap for understanding morphological evolution.基于被子植物353探针集的紫草科更新系统发育树:理解形态演化的路线图。
Ann Bot. 2025 Sep 2;136(1):77-97. doi: 10.1093/aob/mcaf061.
10
Augmenting microbial phylogenomic signal with tailored marker gene sets.用定制的标记基因集增强微生物系统发育信号。
bioRxiv. 2025 Mar 15:2025.03.13.643052. doi: 10.1101/2025.03.13.643052.
多位点多物种合并模型:一种基因家族进化的灵活新模型。
Syst Biol. 2021 Jun 16;70(4):822-837. doi: 10.1093/sysbio/syaa084.
4
FastMulRFS: fast and accurate species tree estimation under generic gene duplication and loss models.FastMulRFS:在通用的基因复制和缺失模型下快速准确的物种树估计。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i57-i65. doi: 10.1093/bioinformatics/btaa444.
5
One thousand plant transcriptomes and the phylogenomics of green plants.一万种植物转录组与绿色植物的系统发生基因组学
Nature. 2019 Oct;574(7780):679-685. doi: 10.1038/s41586-019-1693-2. Epub 2019 Oct 23.
6
A Critical Appraisal of the Placement of Xiphosura (Chelicerata) with Account of Known Sources of Phylogenetic Error.对剑尾目(螯肢动物)位置的批判性评价,以及已知系统发育错误来源的说明。
Syst Biol. 2019 Nov 1;68(6):896-917. doi: 10.1093/sysbio/syz011.
7
ASTRAL-MP: scaling ASTRAL to very large datasets using randomization and parallelization.ASTRAL-MP:通过随机化和并行化扩展 ASTRAL 以适应非常大的数据集。
Bioinformatics. 2019 Oct 15;35(20):3961-3969. doi: 10.1093/bioinformatics/btz211.
8
Multi-allele species reconstruction using ASTRAL.使用 ASTRAL 进行多等位基因物种重建。
Mol Phylogenet Evol. 2019 Jan;130:286-296. doi: 10.1016/j.ympev.2018.10.033. Epub 2018 Oct 26.
9
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.
10
DiscoVista: Interpretable visualizations of gene tree discordance.DiscoVista:基因树分歧的可解释可视化
Mol Phylogenet Evol. 2018 May;122:110-115. doi: 10.1016/j.ympev.2018.01.019. Epub 2018 Feb 5.