• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

序列相似性和功能关联信息的整合可以克服跨细菌基因组进行同源映射的固有问题。

Integration of sequence-similarity and functional association information can overcome intrinsic problems in orthology mapping across bacterial genomes.

机构信息

Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, Computational Systems Biology Laboratory, University of Georgia, Athens, GA 30602, USA.

出版信息

Nucleic Acids Res. 2011 Dec;39(22):e150. doi: 10.1093/nar/gkr766. Epub 2011 Sep 29.

DOI:10.1093/nar/gkr766
PMID:21965536
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3239196/
Abstract

Existing methods for orthologous gene mapping suffer from two general problems: (i) they are computationally too slow and their results are difficult to interpret for automated large-scale applications when based on phylogenetic analyses; or (ii) they are too prone to making mistakes in dealing with complex situations involving horizontal gene transfers and gene fusion due to the lack of a sound basis when based on sequence similarity information. We present a novel algorithm, Global Optimization Strategy (GOST), for orthologous gene mapping through combining sequence similarity and contextual (working partners) information, using a combinatorial optimization framework. Genome-scale applications of GOST show substantial improvements over the predictions by three popular sequence similarity-based orthology mapping programs. Our analysis indicates that our algorithm overcomes the intrinsic issues faced by sequence similarity-based methods, when orthology mapping involves gene fusions and horizontal gene transfers. Our program runs as efficiently as the most efficient sequence similarity-based algorithm in the public domain. GOST is freely downloadable at http://csbl.bmb.uga.edu/~maqin/GOST.

摘要

现有的直系同源基因映射方法存在两个普遍问题

(i) 基于系统发生分析的计算速度太慢,并且结果难以解释,不适合自动化的大规模应用;或 (ii) 基于序列相似性信息时,由于缺乏合理的基础,在处理涉及水平基因转移和基因融合的复杂情况时,很容易出错。我们提出了一种新的算法,全局优化策略 (GOST),通过结合序列相似性和上下文(工作伙伴)信息,使用组合优化框架进行直系同源基因映射。GOST 的基因组规模应用程序在基于三个流行的序列相似性直系同源映射程序的预测方面取得了显著的改进。我们的分析表明,当直系同源映射涉及基因融合和水平基因转移时,我们的算法克服了基于序列相似性方法所面临的内在问题。我们的程序的运行效率与公共领域中最有效的基于序列相似性的算法一样高效。GOST 可在 http://csbl.bmb.uga.edu/~maqin/GOST 免费下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/3034cac74e9f/gkr766f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/1742f7d944bd/gkr766f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/6087e4906e60/gkr766f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/680baf61a02d/gkr766f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/f5a8af316840/gkr766f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/3034cac74e9f/gkr766f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/1742f7d944bd/gkr766f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/6087e4906e60/gkr766f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/680baf61a02d/gkr766f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/f5a8af316840/gkr766f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2bb/3239196/3034cac74e9f/gkr766f5.jpg

相似文献

1
Integration of sequence-similarity and functional association information can overcome intrinsic problems in orthology mapping across bacterial genomes.序列相似性和功能关联信息的整合可以克服跨细菌基因组进行同源映射的固有问题。
Nucleic Acids Res. 2011 Dec;39(22):e150. doi: 10.1093/nar/gkr766. Epub 2011 Sep 29.
2
Mapping of orthologous genes in the context of biological pathways: An application of integer programming.生物途径背景下直系同源基因的映射:整数规划的一个应用
Proc Natl Acad Sci U S A. 2006 Jan 3;103(1):129-34. doi: 10.1073/pnas.0509737102. Epub 2005 Dec 22.
3
OrthologID: automation of genome-scale ortholog identification within a parsimony framework.直系同源物ID:简约框架内全基因组规模直系同源物鉴定的自动化
Bioinformatics. 2006 Mar 15;22(6):699-707. doi: 10.1093/bioinformatics/btk040. Epub 2006 Jan 12.
4
SEAS: a system for SEED-based pathway enrichment analysis.SEAS:基于 SEED 的通路富集分析系统。
PLoS One. 2011;6(7):e22556. doi: 10.1371/journal.pone.0022556. Epub 2011 Jul 22.
5
Clustering of main orthologs for multiple genomes.多个基因组主要直系同源基因的聚类
Comput Syst Bioinformatics Conf. 2007;6:195-201.
6
A universal operon predictor for prokaryotic genomes.一种用于原核生物基因组的通用操纵子预测工具。
J Bioinform Comput Biol. 2009 Feb;7(1):19-38. doi: 10.1142/s0219720009003984.
7
Alignment-free detection of local similarity among viral and bacterial genomes.基于比对的病毒和细菌基因组之间局部相似性的检测。
Bioinformatics. 2011 Jun 1;27(11):1466-72. doi: 10.1093/bioinformatics/btr176. Epub 2011 Apr 6.
8
A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm.一个使用黑马算法识别出的古菌和细菌基因组中系统发育非典型基因的数据库。
BMC Bioinformatics. 2008 Oct 7;9:419. doi: 10.1186/1471-2105-9-419.
9
OrthoFocus: program for identification of orthologs in multiple genomes in family-focused studies.OrthoFocus:用于在以家族为重点的研究中识别多个基因组直系同源基因的程序。
J Bioinform Comput Biol. 2008 Aug;6(4):811-24. doi: 10.1142/s0219720008003692.
10
Clustering of main orthologs for multiple genomes.多个基因组主要直系同源基因的聚类
J Bioinform Comput Biol. 2008 Jun;6(3):573-84. doi: 10.1142/s0219720008003540.

引用本文的文献

1
RECTA: Regulon Identification Based on Comparative Genomics and Transcriptomics Analysis.RECTA:基于比较基因组学和转录组学分析的调控子识别
Genes (Basel). 2018 May 30;9(6):278. doi: 10.3390/genes9060278.
2
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
3
Bacterial regulon modeling and prediction based on systematic cis regulatory motif analyses.

本文引用的文献

1
QuartetS: a fast and accurate algorithm for large-scale orthology detection. QuartetS:一种用于大规模同源检测的快速准确算法。
Nucleic Acids Res. 2011 Jul;39(13):e88. doi: 10.1093/nar/gkr308. Epub 2011 May 13.
2
CDD: a Conserved Domain Database for the functional annotation of proteins.CDD:一个用于蛋白质功能注释的保守结构域数据库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D225-9. doi: 10.1093/nar/gkq1189. Epub 2010 Nov 24.
3
EGM: encapsulated gene-by-gene matching to identify gene orthologs and homologous segments in genomes.
基于系统顺式调控基序分析的细菌调控子建模与预测
Sci Rep. 2016 Mar 15;6:23030. doi: 10.1038/srep23030.
4
Revisiting operons: an analysis of the landscape of transcriptional units in E. coli.重新审视操纵子:大肠杆菌转录单元格局分析
BMC Bioinformatics. 2015 Nov 4;16:356. doi: 10.1186/s12859-015-0805-8.
5
Elucidation of operon structures across closely related bacterial genomes.解析密切相关细菌基因组中的操纵子结构。
PLoS One. 2014 Jun 24;9(6):e100999. doi: 10.1371/journal.pone.0100999. eCollection 2014.
6
DOOR 2.0: presenting operons and their functions through dynamic and integrated views.DOOR 2.0:通过动态和集成的视图呈现操纵子及其功能。
Nucleic Acids Res. 2014 Jan;42(Database issue):D654-9. doi: 10.1093/nar/gkt1048. Epub 2013 Nov 7.
EGM:包裹基因对基因匹配,以鉴定基因组中的基因直系同源物和同源片段。
Bioinformatics. 2010 Sep 1;26(17):2076-84. doi: 10.1093/bioinformatics/btq339. Epub 2010 Jun 27.
4
Phylogenetic and functional assessment of orthologs inference projects and methods.直系同源物推断项目和方法的系统发育与功能评估。
PLoS Comput Biol. 2009 Jan;5(1):e1000262. doi: 10.1371/journal.pcbi.1000262. Epub 2009 Jan 16.
5
DOOR: a database for prokaryotic operons.DOOR:一个原核生物操纵子数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D459-63. doi: 10.1093/nar/gkn757. Epub 2008 Nov 6.
6
An approach of orthology detection from homologous sequences under minimum evolution.一种在最小进化条件下从同源序列中进行直系同源检测的方法。
Nucleic Acids Res. 2008 Oct;36(17):e110. doi: 10.1093/nar/gkn485. Epub 2008 Aug 1.
7
Horizontal gene transfer in eukaryotic evolution.真核生物进化中的水平基因转移
Nat Rev Genet. 2008 Aug;9(8):605-18. doi: 10.1038/nrg2386.
8
Choosing BLAST options for better detection of orthologs as reciprocal best hits.选择BLAST选项以更好地将直系同源物检测为相互最佳匹配。
Bioinformatics. 2008 Feb 1;24(3):319-24. doi: 10.1093/bioinformatics/btm585. Epub 2007 Nov 26.
9
Ortholog detection using the reciprocal smallest distance algorithm.使用互反最小距离算法进行直系同源物检测。
Methods Mol Biol. 2007;396:95-110. doi: 10.1007/978-1-59745-515-2_7.
10
Assessing performance of orthology detection strategies applied to eukaryotic genomes.评估应用于真核生物基因组的直系同源检测策略的性能。
PLoS One. 2007 Apr 18;2(4):e383. doi: 10.1371/journal.pone.0000383.