• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

与连接序列空间的结构相似性:新的潜在超家族及其对结构基因组学的意义。

Structural similarity to link sequence space: new potential superfamilies and implications for structural genomics.

作者信息

Aloy Patrick, Oliva Baldomero, Querol Enrique, Aviles Francesc X, Russell Robert B

机构信息

EMBL, Biocomputing, Meyerhofstrasse 1, D-69117 Heidelberg, Germany.

出版信息

Protein Sci. 2002 May;11(5):1101-16. doi: 10.1110/ps.3950102.

DOI:10.1110/ps.3950102
PMID:11967367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2373547/
Abstract

The current pace of structural biology now means that protein three-dimensional structure can be known before protein function, making methods for assigning homology via structure comparison of growing importance. Previous research has suggested that sequence similarity after structure-based alignment is one of the best discriminators of homology and often functional similarity. Here, we exploit this observation, together with a merger of protein structure and sequence databases, to predict distant homologous relationships. We use the Structural Classification of Proteins (SCOP) database to link sequence alignments from the SMART and Pfam databases. We thus provide new alignments that could not be constructed easily in the absence of known three-dimensional structures. We then extend the method of Murzin (1993b) to assign statistical significance to sequence identities found after structural alignment and thus suggest the best link between diverse sequence families. We find that several distantly related protein sequence families can be linked with confidence, showing the approach to be a means for inferring homologous relationships and thus possible functions when proteins are of known structure but of unknown function. The analysis also finds several new potential superfamilies, where inspection of the associated alignments and superimpositions reveals conservation of unusual structural features or co-location of conserved amino acids and bound substrates. We discuss implications for Structural Genomics initiatives and for improvements to sequence comparison methods.

摘要

当前结构生物学的发展速度意味着在了解蛋白质功能之前就能够知晓其三维结构,这使得通过结构比较来确定同源性的方法变得愈发重要。先前的研究表明,基于结构比对后的序列相似性是同源性以及通常功能相似性的最佳判别指标之一。在此,我们利用这一观察结果,并结合蛋白质结构与序列数据库的合并,来预测远缘同源关系。我们使用蛋白质结构分类(SCOP)数据库将来自SMART和Pfam数据库的序列比对进行关联。由此,我们提供了在缺乏已知三维结构的情况下难以轻易构建的新比对。然后,我们扩展了Murzin(1993b)的方法,为结构比对后发现的序列同一性赋予统计学意义,从而确定不同序列家族之间的最佳关联。我们发现几个远缘相关的蛋白质序列家族能够被可靠地关联起来,这表明该方法是在蛋白质结构已知但功能未知时推断同源关系以及可能功能的一种手段。分析还发现了几个新的潜在超家族,对相关比对和叠加的检查揭示了异常结构特征的保守性或保守氨基酸与结合底物的共定位。我们讨论了对结构基因组计划的影响以及对序列比较方法改进的意义。

相似文献

1
Structural similarity to link sequence space: new potential superfamilies and implications for structural genomics.与连接序列空间的结构相似性:新的潜在超家族及其对结构基因组学的意义。
Protein Sci. 2002 May;11(5):1101-16. doi: 10.1110/ps.3950102.
2
SUPFAM--a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes.SUPFAM——一个通过比较基于序列和基于结构的家族而得出的潜在蛋白质超家族关系数据库:对结构基因组学和基因组功能注释的意义。
Nucleic Acids Res. 2002 Jan 1;30(1):289-93. doi: 10.1093/nar/30.1.289.
3
PASS2: an automated database of protein alignments organised as structural superfamilies.PASS2:一个以结构超家族形式组织的蛋白质比对自动化数据库。
BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.
4
SUPFAM: a database of sequence superfamilies of protein domains.SUPFAM:一个蛋白质结构域序列超家族数据库。
BMC Bioinformatics. 2004 Mar 15;5:28. doi: 10.1186/1471-2105-5-28.
5
PASS2 version 6: a database of structure-based sequence alignments of protein domain superfamilies in accordance with SCOPe.PASS2 版本 6:根据 SCOPe 构建的蛋白质结构域超家族结构比对数据库。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz028.
6
PASS2: a semi-automated database of protein alignments organised as structural superfamilies.PASS2:一个半自动化的蛋白质比对数据库,按结构超家族组织。
Nucleic Acids Res. 2002 Jan 1;30(1):284-8. doi: 10.1093/nar/30.1.284.
7
GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies.GenDiS 数据库更新,采用改进的方法和功能来识别蛋白质结构域超家族的同源序列。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz042.
8
De-DUFing the DUFs: Deciphering distant evolutionary relationships of Domains of Unknown Function using sensitive homology detection methods.去除未知功能结构域中的冗余:使用灵敏的同源性检测方法解析未知功能结构域的远缘进化关系。
Biol Direct. 2015 Jul 31;10:38. doi: 10.1186/s13062-015-0069-2.
9
A comparison of sequence and structure protein domain families as a basis for structural genomics.作为结构基因组学基础的序列与结构蛋白质结构域家族比较。
Bioinformatics. 1999 Jun;15(6):480-500. doi: 10.1093/bioinformatics/15.6.480.
10
Use of a database of structural alignments and phylogenetic trees in investigating the relationship between sequence and structural variability among homologous proteins.利用结构比对和系统发育树数据库研究同源蛋白质序列与结构变异性之间的关系。
Protein Eng. 2001 Apr;14(4):219-26. doi: 10.1093/protein/14.4.219.

引用本文的文献

1
Prediction of a new class of RNA recognition motif.预测一类新的 RNA 识别基序。
J Mol Model. 2011 Aug;17(8):1863-75. doi: 10.1007/s00894-010-0888-0. Epub 2010 Nov 17.
2
Protein-protein interaction hotspots carved into sequences.蛋白质-蛋白质相互作用热点嵌入序列之中。
PLoS Comput Biol. 2007 Jul;3(7):e119. doi: 10.1371/journal.pcbi.0030119.
3
Structural similarity to bridge sequence space: finding new families on the bridges.与桥接序列空间的结构相似性:在桥梁上发现新家族。
Protein Sci. 2005 May;14(5):1305-14. doi: 10.1110/ps.041187405.
4
SUPFAM: a database of sequence superfamilies of protein domains.SUPFAM:一个蛋白质结构域序列超家族数据库。
BMC Bioinformatics. 2004 Mar 15;5:28. doi: 10.1186/1471-2105-5-28.
5
Profile-profile comparisons by COMPASS predict intricate homologies between protein families.COMPASS进行的轮廓-轮廓比较预测了蛋白质家族之间复杂的同源性。
Protein Sci. 2003 Oct;12(10):2262-72. doi: 10.1110/ps.03197403.

本文引用的文献

1
Identification of homology in protein structure classification.蛋白质结构分类中同源性的鉴定。
Nat Struct Biol. 2001 Nov;8(11):953-7. doi: 10.1038/nsb1101-953.
2
Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking.基于结构的蛋白质功能位点自动预测:应用于评估基因组注释中同源蛋白功能继承的有效性及蛋白质对接。
J Mol Biol. 2001 Aug 10;311(2):395-408. doi: 10.1006/jmbi.2001.4870.
3
Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins.三维聚类分析可识别蛋白质中的界面和功能残基簇。
J Mol Biol. 2001 Apr 13;307(5):1487-502. doi: 10.1006/jmbi.2001.4540.
4
Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains.古老的细胞内小分子结合结构域的调控潜力、系统发育分布及进化
J Mol Biol. 2001 Apr 13;307(5):1271-92. doi: 10.1006/jmbi.2001.4508.
5
Evolution of function in protein superfamilies, from a structural perspective.从结构角度看蛋白质超家族中功能的演变。
J Mol Biol. 2001 Apr 6;307(4):1113-43. doi: 10.1006/jmbi.2001.4513.
6
Consistency analysis of similarity between multiple alignments: prediction of protein function and fold structure from analysis of local sequence motifs.多序列比对相似性的一致性分析:通过局部序列基序分析预测蛋白质功能和折叠结构。
J Mol Biol. 2001 Mar 30;307(3):939-49. doi: 10.1006/jmbi.2001.4466.
7
Identification of distant homologues of fibroblast growth factors suggests a common ancestor for all beta-trefoil proteins.成纤维细胞生长因子远源同源物的鉴定表明,所有β-三叶蛋白都有一个共同的祖先。
J Mol Biol. 2000 Oct 6;302(5):1041-7. doi: 10.1006/jmbi.2000.4087.
8
Homology among (betaalpha)(8) barrels: implications for the evolution of metabolic pathways.(β-α)8桶之间的同源性:对代谢途径进化的影响
J Mol Biol. 2000 Nov 3;303(4):627-41. doi: 10.1006/jmbi.2000.4152.
9
Structural proteomics of an archaeon.古生菌的结构蛋白质组学
Nat Struct Biol. 2000 Oct;7(10):903-9. doi: 10.1038/82823.
10
Protein function in the post-genomic era.后基因组时代的蛋白质功能。
Nature. 2000 Jun 15;405(6788):823-6. doi: 10.1038/35015694.