• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全局追踪图,一种搜索蛋白质序列数据库的新范式。

The global trace graph, a novel paradigm for searching protein sequence databases.

作者信息

Heger Andreas, Mallick Swapan, Wilton Christopher, Holm Liisa

机构信息

Institute of Biotechnology, P.O. Box 56 (Viikinkaari 5), FI-00014 University of Helsinki, Finland.

出版信息

Bioinformatics. 2007 Sep 15;23(18):2361-7. doi: 10.1093/bioinformatics/btm358. Epub 2007 Sep 6.

DOI:10.1093/bioinformatics/btm358
PMID:17823134
Abstract

MOTIVATION

Propagating functional annotations to sequence-similar, presumably homologous proteins lies at the heart of the bioinformatics industry. Correct propagation is crucially dependent on the accurate identification of subtle sequence motifs that are conserved in evolution. The evolutionary signal can be difficult to detect because functional sites may consist of non-contiguous residues while segments in-between may be mutated without affecting fold or function.

RESULTS

Here, we report a novel graph clustering algorithm in which all known protein sequences simultaneously self-organize into hypothetical multiple sequence alignments. This eliminates noise so that non-contiguous sequence motifs can be tracked down between extremely distant homologues. The novel data structure enables fast sequence database searching methods which are superior to profile-profile comparison at recognizing distant homologues. This study will boost the leverage of structural and functional genomics and opens up new avenues for data mining a complete set of functional signature motifs.

AVAILABILITY

http://www.bioinfo.biocenter.helsinki.fi/gtg.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

将功能注释传播到序列相似、可能同源的蛋白质是生物信息学行业的核心。正确的传播关键取决于对进化中保守的细微序列基序的准确识别。进化信号可能难以检测,因为功能位点可能由不连续的残基组成,而其间的片段可能发生突变而不影响折叠或功能。

结果

在此,我们报告了一种新颖的图聚类算法,其中所有已知蛋白质序列同时自组织成假设的多序列比对。这消除了噪声,从而可以在极其遥远的同源物之间追踪不连续的序列基序。这种新颖的数据结构实现了快速的序列数据库搜索方法,在识别遥远的同源物方面优于轮廓-轮廓比较。本研究将提高结构和功能基因组学的影响力,并为挖掘完整的功能特征基序集开辟新途径。

可用性

http://www.bioinfo.biocenter.helsinki.fi/gtg。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
The global trace graph, a novel paradigm for searching protein sequence databases.全局追踪图,一种搜索蛋白质序列数据库的新范式。
Bioinformatics. 2007 Sep 15;23(18):2361-7. doi: 10.1093/bioinformatics/btm358. Epub 2007 Sep 6.
2
Tracking repeats using significance and transitivity.利用显著性和传递性追踪重复序列。
Bioinformatics. 2004 Aug 4;20 Suppl 1:i311-7. doi: 10.1093/bioinformatics/bth911.
3
Blast sampling for structural and functional analyses.用于结构和功能分析的胚细胞采样。
BMC Bioinformatics. 2007 Feb 23;8:62. doi: 10.1186/1471-2105-8-62.
4
PROMALS: towards accurate multiple sequence alignments of distantly related proteins.PROMALS:用于实现远缘相关蛋白质准确多序列比对
Bioinformatics. 2007 Apr 1;23(7):802-8. doi: 10.1093/bioinformatics/btm017. Epub 2007 Jan 31.
5
Methods of remote homology detection can be combined to increase coverage by 10% in the midnight zone.远程同源性检测方法可以结合起来,使“午夜区”的覆盖率提高10%。
Bioinformatics. 2007 Sep 15;23(18):2353-60. doi: 10.1093/bioinformatics/btm355. Epub 2007 Aug 20.
6
Graph sharpening plus graph integration: a synergy that improves protein functional classification.图谱锐化加图谱整合:一种改善蛋白质功能分类的协同作用。
Bioinformatics. 2007 Dec 1;23(23):3217-24. doi: 10.1093/bioinformatics/btm511. Epub 2007 Oct 31.
7
Protein structural similarity search by Ramachandran codes.通过拉马钱德兰编码进行蛋白质结构相似性搜索。
BMC Bioinformatics. 2007 Aug 23;8:307. doi: 10.1186/1471-2105-8-307.
8
A new progressive-iterative algorithm for multiple structure alignment.一种用于多结构比对的新型渐进迭代算法。
Bioinformatics. 2005 Aug 1;21(15):3255-63. doi: 10.1093/bioinformatics/bti527. Epub 2005 Jun 7.
9
FORTE: a profile-profile comparison tool for protein fold recognition.FORTE:一种用于蛋白质折叠识别的轮廓-轮廓比较工具。
Bioinformatics. 2004 Mar 1;20(4):594-5. doi: 10.1093/bioinformatics/btg474. Epub 2004 Feb 5.
10
Bayesian search of functionally divergent protein subgroups and their function specific residues.功能趋异蛋白质亚组及其功能特异性残基的贝叶斯搜索
Bioinformatics. 2006 Oct 15;22(20):2466-74. doi: 10.1093/bioinformatics/btl411. Epub 2006 Jul 26.

引用本文的文献

1
Whole-genome metabolic model of built by comparative reconstruction.通过比较重建构建的全基因组代谢模型。 不过你提供的原文“Whole-genome metabolic model of built by comparative reconstruction.”似乎不完整,“of”后面缺少具体内容。
Biotechnol Biofuels. 2016 Nov 21;9:252. doi: 10.1186/s13068-016-0665-0. eCollection 2016.
2
Machine Learning of Protein Interactions in Fungal Secretory Pathways.真菌分泌途径中蛋白质相互作用的机器学习
PLoS One. 2016 Jul 21;11(7):e0159302. doi: 10.1371/journal.pone.0159302. eCollection 2016.
3
Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning.
基于联合进化耦合分析和监督学习的蛋白质接触预测。
Bioinformatics. 2015 Nov 1;31(21):3506-13. doi: 10.1093/bioinformatics/btv472. Epub 2015 Aug 14.
4
Structural determinants allowing transferase activity in SENSITIVE TO FREEZING 2, classified as a family I glycosyl hydrolase.允许在对冷冻敏感2中具有转移酶活性的结构决定因素,其被归类为I类糖基水解酶。
J Biol Chem. 2014 Sep 19;289(38):26089-26106. doi: 10.1074/jbc.M114.576694. Epub 2014 Aug 6.
5
Comparative genome-scale reconstruction of gapless metabolic networks for present and ancestral species.当前物种和祖先物种无间隙代谢网络的比较基因组规模重建。
PLoS Comput Biol. 2014 Feb 6;10(2):e1003465. doi: 10.1371/journal.pcbi.1003465. eCollection 2014 Feb.
6
Comprehensive comparison of graph based multiple protein sequence alignment strategies.基于图的多种蛋白质序列比对策略的综合比较。
BMC Bioinformatics. 2012 Apr 29;13:64. doi: 10.1186/1471-2105-13-64.
7
Detecting remote evolutionary relationships among proteins by large-scale semantic embedding.通过大规模语义嵌入检测蛋白质之间的远程进化关系。
PLoS Comput Biol. 2011 Jan 27;7(1):e1001047. doi: 10.1371/journal.pcbi.1001047.
8
Energetic profiling of protein folds.蛋白质折叠的能量分析
Methods Enzymol. 2009;455:299-327. doi: 10.1016/S0076-6879(08)04211-0.
9
Towards structured output prediction of enzyme function.迈向酶功能的结构化输出预测。
BMC Proc. 2008 Dec 17;2 Suppl 4(Suppl 4):S2. doi: 10.1186/1753-6561-2-s4-s2.
10
Searching protein structure databases with DaliLite v.3.使用DaliLite v.3搜索蛋白质结构数据库。
Bioinformatics. 2008 Dec 1;24(23):2780-1. doi: 10.1093/bioinformatics/btn507. Epub 2008 Sep 25.