• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于系统发育树的自动直系同源物推断及直系同源性可靠性计算。

Automated ortholog inference from phylogenetic trees and calculation of orthology reliability.

作者信息

Storm Christian E V, Sonnhammer Erik L L

机构信息

Center for Genomics and Bioinformatics, Karolinska Institutet, S-171 77 Stockholm, Sweden.

出版信息

Bioinformatics. 2002 Jan;18(1):92-9. doi: 10.1093/bioinformatics/18.1.92.

DOI:10.1093/bioinformatics/18.1.92
PMID:11836216
Abstract

MOTIVATION

Orthologous proteins in different species are likely to have similar biochemical function and biological role. When annotating a newly sequenced genome by sequence homology, the most precise and reliable functional information can thus be derived from orthologs in other species. A standard method of finding orthologs is to compare the sequence tree with the species tree. However, since the topology of phylogenetic tree is not always reliable one might get incorrect assignments.

RESULTS

Here we present a novel method that resolves this problem by analyzing a set of bootstrap trees instead of the optimal tree. The frequency of orthology assignments in the bootstrap trees can be interpreted as a support value for the possible orthology of the sequences. Our method is efficient enough to analyze data in the scale of whole genomes. It is implemented in Java and calculates orthology support levels for all pairwise combinations of homologous sequences of two species. The method was tested on simulated datasets and on real data of homologous proteins.

摘要

动机

不同物种中的直系同源蛋白可能具有相似的生化功能和生物学作用。因此,当通过序列同源性对新测序的基因组进行注释时,最精确和可靠的功能信息可从其他物种的直系同源物中获得。寻找直系同源物的标准方法是将序列树与物种树进行比较。然而,由于系统发育树的拓扑结构并不总是可靠的,可能会得到错误的分配结果。

结果

在此我们提出一种新方法,该方法通过分析一组自引导树而非最优树来解决此问题。自引导树中直系同源分配的频率可解释为序列可能的直系同源性的支持值。我们的方法效率足够高,能够分析全基因组规模的数据。它用Java实现,并计算两个物种同源序列所有成对组合的直系同源支持水平。该方法在模拟数据集和同源蛋白的真实数据上进行了测试。

相似文献

1
Automated ortholog inference from phylogenetic trees and calculation of orthology reliability.基于系统发育树的自动直系同源物推断及直系同源性可靠性计算。
Bioinformatics. 2002 Jan;18(1):92-9. doi: 10.1093/bioinformatics/18.1.92.
2
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.通过成对物种比较对直系同源基因和旁系同源基因进行自动聚类。
J Mol Biol. 2001 Dec 14;314(5):1041-52. doi: 10.1006/jmbi.2000.5197.
3
RIO: analyzing proteomes by automated phylogenomics using resampled inference of orthologs.RIO:使用直系同源物的重采样推断通过自动化系统发育组学分析蛋白质组。
BMC Bioinformatics. 2002 May 16;3:14. doi: 10.1186/1471-2105-3-14.
4
Integrating Sequence Evolution into Probabilistic Orthology Analysis.将序列进化纳入概率同源分析。
Syst Biol. 2015 Nov;64(6):969-82. doi: 10.1093/sysbio/syv044. Epub 2015 Jun 30.
5
Assessment of phylogenomic and orthology approaches for phylogenetic inference.用于系统发育推断的系统发育基因组学和直系同源方法评估。
Bioinformatics. 2007 Apr 1;23(7):815-24. doi: 10.1093/bioinformatics/btm015. Epub 2007 Jan 19.
6
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
7
Orthology prediction at scalable resolution by phylogenetic tree analysis.通过系统发育树分析实现可扩展分辨率下的直系同源预测。
BMC Bioinformatics. 2007 Mar 8;8:83. doi: 10.1186/1471-2105-8-83.
8
An approach of orthology detection from homologous sequences under minimum evolution.一种在最小进化条件下从同源序列中进行直系同源检测的方法。
Nucleic Acids Res. 2008 Oct;36(17):e110. doi: 10.1093/nar/gkn485. Epub 2008 Aug 1.
9
Inferring Orthology and Paralogy.推断直系同源和旁系同源关系。
Methods Mol Biol. 2019;1910:149-175. doi: 10.1007/978-1-4939-9074-0_5.
10
Comprehensive analysis of orthologous protein domains using the HOPS database.使用HOPS数据库对直系同源蛋白结构域进行综合分析。
Genome Res. 2003 Oct;13(10):2353-62. doi: 10.1101/gr1305203.

引用本文的文献

1
Inferring Interaction Networks from Transcriptomic Data: Methods and Applications.从转录组数据推断相互作用网络:方法与应用。
Methods Mol Biol. 2024;2812:11-37. doi: 10.1007/978-1-0716-3886-6_2.
2
OrthoPhy: A Program to Construct Ortholog Data Sets Using Taxonomic Information.OrthoPhy:使用分类信息构建直系同源数据的程序。
Genome Biol Evol. 2023 Mar 3;15(3). doi: 10.1093/gbe/evad026.
3
Inter-paralog amino acid inversion events in large phylogenies of duplicated proteins.在复制蛋白质的大型系统发育中发生的基因间旁系同源氨基酸倒位事件。
PLoS Comput Biol. 2022 Apr 4;18(4):e1010016. doi: 10.1371/journal.pcbi.1010016. eCollection 2022 Apr.
4
An Overview of Duplicated Gene Detection Methods: Why the Duplication Mechanism Has to Be Accounted for in Their Choice.重复基因检测方法概述:选择重复基因检测方法时为何必须考虑重复机制。
Genes (Basel). 2020 Sep 4;11(9):1046. doi: 10.3390/genes11091046.
5
Best match graphs and reconciliation of gene trees with species trees.最佳匹配图和基因树与物种树的协调。
J Math Biol. 2020 Apr;80(5):1459-1495. doi: 10.1007/s00285-020-01469-y. Epub 2020 Jan 30.
6
Gene tree species tree reconciliation with gene conversion.基因树与物种树的基因转换校正。
J Math Biol. 2019 May;78(6):1981-2014. doi: 10.1007/s00285-019-01331-w. Epub 2019 Feb 15.
7
OrthoGNC: A Software for Accurate Identification of Orthologs Based on Gene Neighborhood Conservation.OrthoGNC:一款基于基因邻域保守性准确鉴定直系同源基因的软件。
Genomics Proteomics Bioinformatics. 2017 Dec;15(6):361-370. doi: 10.1016/j.gpb.2017.07.002. Epub 2017 Nov 11.
8
Substrate specificity characterization for eight putative nudix hydrolases. Evaluation of criteria for substrate identification within the Nudix family.八种假定的Nudix水解酶的底物特异性表征。Nudix家族内底物识别标准的评估。
Proteins. 2016 Dec;84(12):1810-1822. doi: 10.1002/prot.25163. Epub 2016 Oct 1.
9
Inferring Orthologs: Open Questions and Perspectives.推断直系同源基因:未解决的问题与展望
Genomics Insights. 2016 Feb 25;9:17-28. doi: 10.4137/GEI.S37925. eCollection 2016.
10
The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches.2014年的PFP和ESG蛋白质功能预测方法:数据库更新和集成方法的影响。
Gigascience. 2015 Sep 14;4:43. doi: 10.1186/s13742-015-0083-4. eCollection 2015.