• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MSOAR:一种基于基因组重排的高通量直系同源物分配系统。

MSOAR: a high-throughput ortholog assignment system based on genome rearrangement.

作者信息

Fu Zheng, Chen Xin, Vacic Vladimir, Nan Peng, Zhong Yang, Jiang Tao

机构信息

Department of Computer Science and Engineering, University of California, Riverside, California 92521, USA.

出版信息

J Comput Biol. 2007 Nov;14(9):1160-75. doi: 10.1089/cmb.2007.0048.

DOI:10.1089/cmb.2007.0048
PMID:17990975
Abstract

The assignment of orthologous genes between a pair of genomes is a fundamental and challenging problem in comparative genomics, since many computational methods for solving various biological problems critically rely on bona fide orthologs as input. While it is usually done using sequence similarity search, we recently proposed a new combinatorial approach that combines sequence similarity and genome rearrangement. This paper continues the development of the approach and unites genome rearrangement events and (post-speciation) duplication events in a single framework under the parsimony principle. In this framework, orthologous genes are assumed to correspond to each other in the most parsimonious evolutionary scenario involving both genome rearrangement and (post-speciation) gene duplication. Besides several original algorithmic contributions, the enhanced method allows for the detection of inparalogs. Following this approach, we have implemented a high-throughput system for ortholog assignment on a genome scale, called MSOAR, and applied it to human and mouse genomes. As the result will show, MSOAR is able to find 99 more true orthologs than the INPARANOID program did. In comparison to the iterated exemplar algorithm on simulated data, MSOAR performed favorably in terms of assignment accuracy. We also validated our predicted main ortholog pairs between human and mouse using public ortholog assignment datasets, synteny information, and gene function classification. These test results indicate that our approach is very promising for genome-wide ortholog assignment. Supplemental material and MSOAR program are available at http://msoar.cs.ucr.edu.

摘要

在一对基因组之间确定直系同源基因是比较基因组学中的一个基本且具有挑战性的问题,因为许多用于解决各种生物学问题的计算方法都严重依赖真正的直系同源基因作为输入。虽然通常是通过序列相似性搜索来完成,但我们最近提出了一种新的组合方法,该方法结合了序列相似性和基因组重排。本文继续该方法的开发,并在简约原则下将基因组重排事件和(物种形成后的)复制事件统一在一个单一框架中。在这个框架中,直系同源基因被假定在涉及基因组重排和(物种形成后的)基因复制的最简约进化场景中相互对应。除了一些原创的算法贡献外,增强后的方法还能够检测到旁系同源基因。遵循这种方法,我们在基因组规模上实现了一个用于直系同源基因确定的高通量系统,称为MSOAR,并将其应用于人类和小鼠基因组。结果将表明,MSOAR比INPARANOID程序能够多找到99个真正的直系同源基因。与在模拟数据上的迭代范例算法相比,MSOAR在确定准确性方面表现良好。我们还使用公共直系同源基因确定数据集、共线性信息和基因功能分类对我们预测的人类和小鼠之间的主要直系同源基因对进行了验证。这些测试结果表明,我们的方法在全基因组直系同源基因确定方面非常有前景。补充材料和MSOAR程序可在http://msoar.cs.ucr.edu获取。

相似文献

1
MSOAR: a high-throughput ortholog assignment system based on genome rearrangement.MSOAR:一种基于基因组重排的高通量直系同源物分配系统。
J Comput Biol. 2007 Nov;14(9):1160-75. doi: 10.1089/cmb.2007.0048.
2
Assignment of orthologous genes via genome rearrangement.通过基因组重排进行直系同源基因的分配。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Oct-Dec;2(4):302-15. doi: 10.1109/TCBB.2005.48.
3
MSOAR 2.0: Incorporating tandem duplications into ortholog assignment based on genome rearrangement.MSOAR 2.0:基于基因组重排的串联重复整合到直系同源物分配中。
BMC Bioinformatics. 2010 Jan 6;11:10. doi: 10.1186/1471-2105-11-10.
4
Clustering of main orthologs for multiple genomes.多个基因组主要直系同源基因的聚类
Comput Syst Bioinformatics Conf. 2007;6:195-201.
5
Clustering of main orthologs for multiple genomes.多个基因组主要直系同源基因的聚类
J Bioinform Comput Biol. 2008 Jun;6(3):573-84. doi: 10.1142/s0219720008003540.
6
MultiMSOAR 2.0: an accurate tool to identify ortholog groups among multiple genomes.MultiMSOAR 2.0:一种用于在多个基因组中识别直系同源物的精确工具。
PLoS One. 2011;6(6):e20892. doi: 10.1371/journal.pone.0020892. Epub 2011 Jun 21.
7
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.通过成对物种比较对直系同源基因和旁系同源基因进行自动聚类。
J Mol Biol. 2001 Dec 14;314(5):1041-52. doi: 10.1006/jmbi.2000.5197.
8
Accurate identification of orthologous segments among multiple genomes.准确识别多个基因组之间的直系同源片段。
Bioinformatics. 2009 Apr 1;25(7):853-60. doi: 10.1093/bioinformatics/btp070. Epub 2009 Feb 2.
9
Improving the specificity of high-throughput ortholog prediction.提高高通量直系同源物预测的特异性。
BMC Bioinformatics. 2006 May 28;7:270. doi: 10.1186/1471-2105-7-270.
10
Using shared genomic synteny and shared protein functions to enhance the identification of orthologous gene pairs.利用共享基因组同线性和共享蛋白质功能来加强直系同源基因对的识别。
Bioinformatics. 2005 Mar;21(6):703-10. doi: 10.1093/bioinformatics/bti045. Epub 2004 Sep 30.

引用本文的文献

1
On the parameterized complexity of the median and closest problems under some permutation metrics.关于某些排列度量下中位数和最接近问题的参数化复杂度
Algorithms Mol Biol. 2024 Dec 24;19(1):24. doi: 10.1186/s13015-024-00269-z.
2
An Exact and Fast SAT Formulation for the DCJ Distance.一种用于DCJ距离的精确且快速的SAT公式化方法。
bioRxiv. 2024 Nov 8:2024.11.05.622153. doi: 10.1101/2024.11.05.622153.
3
Genome Rearrangement Analysis : Cut and Join Genome Rearrangements and Gene Cluster Preserving Approaches.基因组重排分析:切割和连接基因组重排及基因簇保护方法。
Methods Mol Biol. 2024;2802:215-245. doi: 10.1007/978-1-0716-3838-5_9.
4
Primary orthologs from local sequence context.来自本地序列上下文的直系同源物。
BMC Bioinformatics. 2020 Feb 6;21(1):48. doi: 10.1186/s12859-020-3384-2.
5
Aequatus: an open-source homology browser.Aequatus:一个开源同源浏览器。
Gigascience. 2018 Nov 1;7(11):giy128. doi: 10.1093/gigascience/giy128.
6
Genome-Guided Phylo-Transcriptomic Methods and the Nuclear Phylogentic Tree of the Paniceae Grasses.基因组指导的系统发育转录组学方法及禾本科 Paniceae 族的核系统发育树。
Sci Rep. 2017 Oct 19;7(1):13528. doi: 10.1038/s41598-017-13236-z.
7
Orthonome - a new pipeline for predicting high quality orthologue gene sets applicable to complete and draft genomes.Orthonome——一种用于预测适用于完整基因组和草图基因组的高质量直系同源基因集的新流程。
BMC Genomics. 2017 Aug 31;18(1):673. doi: 10.1186/s12864-017-4079-6.
8
GenFamClust: an accurate, synteny-aware and reliable homology inference algorithm.GenFamClust:一种准确、具有共线性意识且可靠的同源性推断算法。
BMC Evol Biol. 2016 Jun 4;16(1):120. doi: 10.1186/s12862-016-0684-2.
9
An Effective Big Data Supervised Imbalanced Classification Approach for Ortholog Detection in Related Yeast Species.一种用于相关酵母物种直系同源物检测的有效大数据监督不平衡分类方法。
Biomed Res Int. 2015;2015:748681. doi: 10.1155/2015/748681. Epub 2015 Oct 29.
10
Comparing genomes with rearrangements and segmental duplications.比较带有重排和片段重复的基因组。
Bioinformatics. 2015 Jun 15;31(12):i329-38. doi: 10.1093/bioinformatics/btv229.