• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FastMulRFS:在通用的基因复制和缺失模型下快速准确的物种树估计。

FastMulRFS: fast and accurate species tree estimation under generic gene duplication and loss models.

机构信息

Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.

出版信息

Bioinformatics. 2020 Jul 1;36(Suppl_1):i57-i65. doi: 10.1093/bioinformatics/btaa444.

DOI:10.1093/bioinformatics/btaa444
PMID:32657396
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7355287/
Abstract

MOTIVATION

Species tree estimation is a basic part of biological research but can be challenging because of gene duplication and loss (GDL), which results in genes that can appear more than once in a given genome. All common approaches in phylogenomic studies either reduce available data or are error-prone, and thus, scalable methods that do not discard data and have high accuracy on large heterogeneous datasets are needed.

RESULTS

We present FastMulRFS, a polynomial-time method for estimating species trees without knowledge of orthology. We prove that FastMulRFS is statistically consistent under a generic model of GDL when adversarial GDL does not occur. Our extensive simulation study shows that FastMulRFS matches the accuracy of MulRF (which tries to solve the same optimization problem) and has better accuracy than prior methods, including ASTRAL-multi (the only method to date that has been proven statistically consistent under GDL), while being much faster than both methods.

AVAILABILITY AND IMPEMENTATION

FastMulRFS is available on Github (https://github.com/ekmolloy/fastmulrfs).

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

物种树估计是生物研究的基本部分,但由于基因复制和丢失(GDL),这会导致给定基因组中的基因出现多次,因此具有挑战性。系统发育基因组学研究中的所有常见方法要么减少可用数据,要么容易出错,因此需要不丢弃数据且在大型异构数据集上具有高精度的可扩展方法。

结果

我们提出了 FastMulRFS,这是一种在不知道同源性的情况下估计物种树的多项式时间方法。我们证明了在对抗性 GDL 不发生的情况下,FastMulRFS 在通用 GDL 模型下具有统计一致性。我们广泛的模拟研究表明,FastMulRFS 与 MulRF 的准确性相匹配(MulRF 试图解决相同的优化问题),并且比包括 ASTRAL-multi(迄今为止唯一在 GDL 下证明具有统计一致性的方法)在内的先前方法具有更高的准确性,同时比这两种方法都快得多。

可用性和实现

FastMulRFS 可在 Github(https://github.com/ekmolloy/fastmulrfs)上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efb4/7355287/229fe9749618/btaa444f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efb4/7355287/0cecf32a4f3d/btaa444f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efb4/7355287/cd1cd5d4a485/btaa444f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efb4/7355287/229fe9749618/btaa444f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efb4/7355287/0cecf32a4f3d/btaa444f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efb4/7355287/cd1cd5d4a485/btaa444f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efb4/7355287/229fe9749618/btaa444f3.jpg

相似文献

1
FastMulRFS: fast and accurate species tree estimation under generic gene duplication and loss models.FastMulRFS:在通用的基因复制和缺失模型下快速准确的物种树估计。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i57-i65. doi: 10.1093/bioinformatics/btaa444.
2
Polynomial-Time Statistical Estimation of Species Trees Under Gene Duplication and Loss.多项式时间下基因重复和缺失下种系树的统计估计
J Comput Biol. 2021 May;28(5):452-468. doi: 10.1089/cmb.2020.0424. Epub 2020 Dec 15.
3
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition.利用多拷贝基因家族树分解进行种系树推断。
Syst Biol. 2022 Apr 19;71(3):610-629. doi: 10.1093/sysbio/syab070.
4
Quartet-based inference is statistically consistent under the unified duplication-loss-coalescence model.基于四重体的推断在统一的复制-丢失-合并模型下是统计一致的。
Bioinformatics. 2021 Nov 18;37(22):4064-4074. doi: 10.1093/bioinformatics/btab414.
5
ASTRAL: genome-scale coalescent-based species tree estimation.ASTRAL:基于基因组规模合并的物种树估计。
Bioinformatics. 2014 Sep 1;30(17):i541-8. doi: 10.1093/bioinformatics/btu462.
6
wQFM: highly accurate genome-scale species tree estimation from weighted quartets.wQFM:基于加权四重奏的高精度基因组规模物种树估计
Bioinformatics. 2021 Nov 5;37(21):3734-3743. doi: 10.1093/bioinformatics/btab428.
7
Weighted ASTRID: fast and accurate species trees from weighted internode distances.加权ASTRID:基于加权节间距离的快速准确物种树构建方法
Algorithms Mol Biol. 2023 Jul 19;18(1):6. doi: 10.1186/s13015-023-00230-6.
8
ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes.ASTRAL-II:基于合并的数百个分类群和数千个基因的种系发生树估计。
Bioinformatics. 2015 Jun 15;31(12):i44-52. doi: 10.1093/bioinformatics/btv234.
9
MulRF: a software package for phylogenetic analysis using multi-copy gene trees.MulRF:一个使用多拷贝基因树进行系统发育分析的软件包。
Bioinformatics. 2015 Feb 1;31(3):432-3. doi: 10.1093/bioinformatics/btu648. Epub 2014 Oct 1.
10
Assessing approaches for inferring species trees from multi-copy genes.评估从多拷贝基因推断物种树的方法。
Syst Biol. 2015 Mar;64(2):325-39. doi: 10.1093/sysbio/syu128. Epub 2014 Dec 23.

引用本文的文献

1
wQFM-DISCO: DISCO-enabled wQFM improves phylogenomic analyses despite the presence of paralogs.wQFM-DISCO:尽管存在旁系同源物,但启用DISCO的wQFM改善了系统发育基因组分析。
Bioinform Adv. 2024 Nov 27;4(1):vbae189. doi: 10.1093/bioadv/vbae189. eCollection 2024.
2
AleRax: a tool for gene and species tree co-estimation and reconciliation under a probabilistic model of gene duplication, transfer, and loss.AleRax:一种在基因复制、转移和丢失的概率模型下,进行基因和物种树共同估计和协调的工具。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae162.
3
Unifying duplication episode clustering and gene-species mapping inference.

本文引用的文献

1
Polynomial-Time Statistical Estimation of Species Trees Under Gene Duplication and Loss.多项式时间下基因重复和缺失下种系树的统计估计
J Comput Biol. 2021 May;28(5):452-468. doi: 10.1089/cmb.2020.0424. Epub 2020 Dec 15.
2
ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.ASTRAL-Pro:基于四重奏的系统发生树推断,即便存在基因重复。
Mol Biol Evol. 2020 Nov 1;37(11):3292-3307. doi: 10.1093/molbev/msaa139.
3
Evolution through segmental duplications and losses: a Super-Reconciliation approach.通过片段重复和缺失实现的进化:一种超级比对方法。
统一重复事件聚类和基因-物种映射推断。
Algorithms Mol Biol. 2024 Feb 14;19(1):7. doi: 10.1186/s13015-024-00252-8.
4
Dollo-CDP: a polynomial-time algorithm for the clade-constrained large Dollo parsimony problem.多洛 - CDP:一种用于分支约束大型多洛简约问题的多项式时间算法。
Algorithms Mol Biol. 2024 Jan 8;19(1):2. doi: 10.1186/s13015-023-00249-9.
5
DISCO+QR: rooting species trees in the presence of GDL and ILS.DISCO+QR:在存在基因水平转移(GDL)和不完全谱系分选(ILS)的情况下确定物种树的根。
Bioinform Adv. 2023 Feb 7;3(1):vbad015. doi: 10.1093/bioadv/vbad015. eCollection 2023.
6
Phylogenetic reconciliation.系统发育和解
PLoS Comput Biol. 2022 Nov 3;18(11):e1010621. doi: 10.1371/journal.pcbi.1010621. eCollection 2022 Nov.
7
Recent progress on methods for estimating and updating large phylogenies.关于估计和更新大型系统发育树的方法的最新进展。
Philos Trans R Soc Lond B Biol Sci. 2022 Oct 10;377(1861):20210244. doi: 10.1098/rstb.2021.0244. Epub 2022 Aug 22.
8
Phylogenomic Analyses of 2,786 Genes in 158 Lineages Support a Root of the Eukaryotic Tree of Life between Opisthokonts and All Other Lineages.158 个谱系的 2786 个基因的系统基因组分析支持后生动物与其他所有谱系之间的真核生物树的根部。
Genome Biol Evol. 2022 Aug 3;14(8). doi: 10.1093/gbe/evac119.
9
Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication.种系树估计及全基因组复制后基因丢失的影响。
Syst Biol. 2022 Oct 12;71(6):1348-1361. doi: 10.1093/sysbio/syac040.
10
Embedding gene trees into phylogenetic networks by conflict resolution algorithms.通过冲突解决算法将基因树嵌入系统发育网络。
Algorithms Mol Biol. 2022 May 19;17(1):11. doi: 10.1186/s13015-022-00218-8.
Algorithms Mol Biol. 2020 May 26;15:12. doi: 10.1186/s13015-020-00171-4. eCollection 2020.
4
One thousand plant transcriptomes and the phylogenomics of green plants.一万种植物转录组与绿色植物的系统发生基因组学
Nature. 2019 Oct;574(7780):679-685. doi: 10.1038/s41586-019-1693-2. Epub 2019 Oct 23.
5
Reconciling multiple genes trees via segmental duplications and losses.通过片段重复和缺失来协调多个基因树。
Algorithms Mol Biol. 2019 Mar 20;14:7. doi: 10.1186/s13015-019-0139-6. eCollection 2019.
6
Gene tree species tree reconciliation with gene conversion.基因树与物种树的基因转换校正。
J Math Biol. 2019 May;78(6):1981-2014. doi: 10.1007/s00285-019-01331-w. Epub 2019 Feb 15.
7
Multi-allele species reconstruction using ASTRAL.使用 ASTRAL 进行多等位基因物种重建。
Mol Phylogenet Evol. 2019 Jan;130:286-296. doi: 10.1016/j.ympev.2018.10.033. Epub 2018 Oct 26.
8
On the impact of uncertain gene tree rooting on duplication-transfer-loss reconciliation.关于基因树无根状态对重复-转移-丢失事件整合的影响。
BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):290. doi: 10.1186/s12859-018-2269-0.
9
Accurate prediction of orthologs in the presence of divergence after duplication.在复制后发生分歧的情况下准确预测直系同源物。
Bioinformatics. 2018 Jul 1;34(13):i366-i375. doi: 10.1093/bioinformatics/bty242.
10
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.