• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

西兰花:结合系统发育分析和网络分析进行直系同源物分配

Broccoli: Combining Phylogenetic and Network Analyses for Orthology Assignment.

作者信息

Derelle Romain, Philippe Hervé, Colbourne John K

机构信息

School of Biosciences, University of Birmingham, Birmingham, United Kingdom.

Station d'Ecologie Théorique et Expérimentale, UMR CNRS 5321, Moulis, France.

出版信息

Mol Biol Evol. 2020 Nov 1;37(11):3389-3396. doi: 10.1093/molbev/msaa159.

DOI:10.1093/molbev/msaa159
PMID:32602888
Abstract

Orthology assignment is a key step of comparative genomic studies, for which many bioinformatic tools have been developed. However, all gene clustering pipelines are based on the analysis of protein distances, which are subject to many artifacts. In this article, we introduce Broccoli, a user-friendly pipeline designed to infer, with high precision, orthologous groups, and pairs of proteins using a phylogeny-based approach. Briefly, Broccoli performs ultrafast phylogenetic analyses on most proteins and builds a network of orthologous relationships. Orthologous groups are then identified from the network using a parameter-free machine learning algorithm. Broccoli is also able to detect chimeric proteins resulting from gene-fusion events and to assign these proteins to the corresponding orthologous groups. Tested on two benchmark data sets, Broccoli outperforms current orthology pipelines. In addition, Broccoli is scalable, with runtimes similar to those of recent distance-based pipelines. Given its high level of performance and efficiency, this new pipeline represents a suitable choice for comparative genomic studies. Broccoli is freely available at https://github.com/rderelle/Broccoli.

摘要

直系同源物分配是比较基因组研究的关键步骤,为此已开发了许多生物信息学工具。然而,所有基因聚类流程都基于蛋白质距离分析,而蛋白质距离容易受到多种假象的影响。在本文中,我们介绍了Broccoli,这是一个用户友好的流程,旨在使用基于系统发育的方法高精度地推断直系同源组和蛋白质对。简而言之,Broccoli对大多数蛋白质进行超快速系统发育分析,并构建直系同源关系网络。然后使用无参数机器学习算法从网络中识别直系同源组。Broccoli还能够检测由基因融合事件产生的嵌合蛋白,并将这些蛋白分配到相应的直系同源组。在两个基准数据集上进行测试时,Broccoli的表现优于当前的直系同源流程。此外,Broccoli具有可扩展性,运行时间与最近基于距离的流程相似。鉴于其高性能和高效率,这个新流程是比较基因组研究的合适选择。可在https://github.com/rderelle/Broccoli上免费获取Broccoli。

相似文献

1
Broccoli: Combining Phylogenetic and Network Analyses for Orthology Assignment.西兰花:结合系统发育分析和网络分析进行直系同源物分配
Mol Biol Evol. 2020 Nov 1;37(11):3389-3396. doi: 10.1093/molbev/msaa159.
2
Benchmarking orthology methods using phylogenetic patterns defined at the base of Eukaryotes.使用在真核生物基部定义的系统发育模式对同源物方法进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa206.
3
OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.OrthoSelect:一种在系统发育基因组学中选择直系同源组的方案。
BMC Bioinformatics. 2009 Jul 16;10:219. doi: 10.1186/1471-2105-10-219.
4
Orthograph: a versatile tool for mapping coding nucleotide sequences to clusters of orthologous genes.Orthograph:一种将编码核苷酸序列映射到直系同源基因簇的多功能工具。
BMC Bioinformatics. 2017 Feb 16;18(1):111. doi: 10.1186/s12859-017-1529-8.
5
Orthologous Matrix (OMA) algorithm 2.0: more robust to asymmetric evolutionary rates and more scalable hierarchical orthologous group inference.直系同源矩阵(OMA)算法2.0:对不对称进化速率更具鲁棒性,且在分层直系同源组推断方面更具扩展性。
Bioinformatics. 2017 Jul 15;33(14):i75-i82. doi: 10.1093/bioinformatics/btx229.
6
A New Orthology Assessment Method for Phylogenomic Data: Unrooted Phylogenetic Orthology.一种新的系统发育基因组数据的同源物评估方法:无根系统发育同源物。
Mol Biol Evol. 2016 Aug;33(8):2117-34. doi: 10.1093/molbev/msw069. Epub 2016 Apr 6.
7
SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier.SwiftOrtho:一种快速、内存高效、多基因组同源分类器。
Gigascience. 2019 Oct 1;8(10). doi: 10.1093/gigascience/giz118.
8
OrthoFinder: phylogenetic orthology inference for comparative genomics.OrthoFinder:用于比较基因组学的系统发育直系同源推断。
Genome Biol. 2019 Nov 14;20(1):238. doi: 10.1186/s13059-019-1832-y.
9
OrthoGNC: A Software for Accurate Identification of Orthologs Based on Gene Neighborhood Conservation.OrthoGNC:一款基于基因邻域保守性准确鉴定直系同源基因的软件。
Genomics Proteomics Bioinformatics. 2017 Dec;15(6):361-370. doi: 10.1016/j.gpb.2017.07.002. Epub 2017 Nov 11.
10
MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.MetaPhOrs:使用基于一致性的置信分数,从多种系统发育证据预测直系同源和旁系同源。
Nucleic Acids Res. 2011 Mar;39(5):e32. doi: 10.1093/nar/gkq953. Epub 2010 Dec 11.

引用本文的文献

1
Evolutionary diversification of ancestral genes across vertebrates and insects.脊椎动物和昆虫中祖先基因的进化多样化。
Genome Biol. 2025 Sep 4;26(1):268. doi: 10.1186/s13059-025-03699-z.
2
Insect Phylogenomics: From Experiment Planning to Post-phylogenetic Analyses.昆虫系统发育基因组学:从实验规划到系统发育后分析
Methods Mol Biol. 2025;2935:211-235. doi: 10.1007/978-1-0716-4583-3_9.
3
The genomic origin of the unique chaetognath body plan.独特箭虫身体结构的基因组起源。
Nature. 2025 Aug 13. doi: 10.1038/s41586-025-09403-2.
4
Cell type-specific immune regulation under symbiosis in a facultatively symbiotic coral.兼性共生珊瑚中共生状态下的细胞类型特异性免疫调节
ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wraf132.
5
Chromatin loops are an ancestral hallmark of the animal regulatory genome.染色质环是动物调控基因组的一个古老特征。
Nature. 2025 May 7. doi: 10.1038/s41586-025-08960-w.
6
SOI: robust identification of orthologous synteny with the Orthology Index and broad applications in evolutionary genomics.SOI:利用直系同源索引对直系同源同线性进行可靠识别及其在进化基因组学中的广泛应用。
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf320.
7
Chimeric origins and dynamic evolution of central carbon metabolism in eukaryotes.真核生物中心碳代谢的嵌合起源与动态进化
Nat Ecol Evol. 2025 Apr;9(4):613-627. doi: 10.1038/s41559-025-02648-0. Epub 2025 Mar 3.
8
Genomic signatures of SnRKs highlighted conserved evolution within orchids and stress responses through ABA signaling in the Cymbidium ensifolium.SnRKs的基因组特征突出了兰花内部的保守进化以及建兰通过脱落酸信号传导的应激反应。
BMC Plant Biol. 2025 Mar 3;25(1):277. doi: 10.1186/s12870-025-06280-9.
9
Different orthology inference algorithms generate similar predicted orthogroups among Brassicaceae species.不同的直系同源推断算法在十字花科物种中生成相似的预测直系同源组。
Appl Plant Sci. 2024 Dec 25;13(1):e11627. doi: 10.1002/aps3.11627. eCollection 2025 Jan-Feb.
10
A metric and its derived protein network for evaluation of ortholog database inconsistency.一种用于评估直系同源数据库不一致性的度量标准及其衍生的蛋白质网络。
BMC Bioinformatics. 2025 Jan 7;26(1):6. doi: 10.1186/s12859-024-06023-x.