• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用蛋白质结构分类进行远距离同源性识别。

Distant homology recognition using structural classification of proteins.

作者信息

Murzin A G, Bateman A

机构信息

Center for Protein Engineering, MRC Center, Cambridge, United Kingdom.

出版信息

Proteins. 1997;Suppl 1:105-12. doi: 10.1002/(sici)1097-0134(1997)1+<105::aid-prot14>3.3.co;2-1.

DOI:10.1002/(sici)1097-0134(1997)1+<105::aid-prot14>3.3.co;2-1
PMID:9485501
Abstract

Protein structure prediction is arguably the biggest unsolved problem of structural biology. The notion of the number of naturally occurring different protein folds being limited allows partial solution of this problem by the use of fold recognition methods, which "thread" the sequence in question through a library of known protein folds. The fold recognition methods were thought to be superior to the distant homology recognition methods when there is no significant sequence similarity to known structures. We show here that the Structural Classification of Proteins (SCOP) database, organizing all known protein folds according their structural and evolutionary relationships, can be effectively used to enhance the sensitivity of the distant homology recognition methods to rival the "threading" methods. In the CASP2 experiment, our approach correctly assigned into existing SCOP superfamilies all of the six "fold recognition" targets we attempted. For each of the six targets, we correctly predicted the homologous protein with a very similar structure; often, it was the most similar structure. We correctly predicted local alignments of the sequence features that we found to be characteristic for the protein superfamily containing a given target. Our global alignments, extended manually from these local alignments, also appeared to be rather accurate.

摘要

蛋白质结构预测可以说是结构生物学中最大的未解决问题。天然存在的不同蛋白质折叠数量有限这一概念使得通过使用折叠识别方法来部分解决这个问题成为可能,这些方法将所讨论的序列“穿线”通过已知蛋白质折叠的库。当与已知结构没有显著的序列相似性时,折叠识别方法被认为优于远源同源识别方法。我们在此表明,蛋白质结构分类(SCOP)数据库根据其结构和进化关系组织所有已知蛋白质折叠,可以有效地用于提高远源同源识别方法的灵敏度,以与“穿线”方法相媲美。在CASP2实验中,我们的方法将我们尝试的六个“折叠识别”目标全部正确地归入现有的SCOP超家族。对于这六个目标中的每一个,我们都正确地预测了具有非常相似结构的同源蛋白质;通常,它是最相似的结构。我们正确地预测了我们发现对于包含给定目标的蛋白质超家族具有特征性的序列特征的局部比对。我们从这些局部比对手动扩展得到的全局比对似乎也相当准确。

相似文献

1
Distant homology recognition using structural classification of proteins.利用蛋白质结构分类进行远距离同源性识别。
Proteins. 1997;Suppl 1:105-12. doi: 10.1002/(sici)1097-0134(1997)1+<105::aid-prot14>3.3.co;2-1.
2
Protein folds from pair interactions: a blind test in fold recognition.基于成对相互作用的蛋白质折叠:折叠识别中的一项盲测。
Proteins. 1997;Suppl 1:129-33. doi: 10.1002/(sici)1097-0134(1997)1+<129::aid-prot17>3.3.co;2-f.
3
Fold recognition using predicted secondary structure sequences and hidden Markov models of protein folds.利用预测的二级结构序列和蛋白质折叠的隐马尔可夫模型进行折叠识别。
Proteins. 1997;Suppl 1:123-8. doi: 10.1002/(sici)1097-0134(1997)1+<123::aid-prot16>3.3.co;2-#.
4
CASP2 knowledge-based approach to distant homology recognition and fold prediction in CASP4.在CASP4中基于CASP2知识的远程同源性识别和折叠预测方法。
Proteins. 2001;Suppl 5:76-85. doi: 10.1002/prot.10037.
5
PASS2: an automated database of protein alignments organised as structural superfamilies.PASS2:一个以结构超家族形式组织的蛋白质比对自动化数据库。
BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.
6
Protein structure prediction by threading methods: evaluation of current techniques.基于穿线法的蛋白质结构预测:当前技术评估
Proteins. 1995 Nov;23(3):337-55. doi: 10.1002/prot.340230308.
7
Recognition of analogous and homologous protein folds: analysis of sequence and structure conservation.相似和同源蛋白质折叠的识别:序列和结构保守性分析
J Mol Biol. 1997 Jun 13;269(3):423-39. doi: 10.1006/jmbi.1997.1019.
8
Automatic classification of protein structures using low-dimensional structure space mappings.利用低维结构空间映射对蛋白质结构进行自动分类。
BMC Bioinformatics. 2014;15 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-15-S2-S1. Epub 2014 Jan 24.
9
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.
10
SCOP: a Structural Classification of Proteins database.SCOP:蛋白质结构分类数据库。
Nucleic Acids Res. 1999 Jan 1;27(1):254-6. doi: 10.1093/nar/27.1.254.

引用本文的文献

1
The Urfold: Structural similarity just above the superfold level?《展开:超级折叠水平之上的结构相似性?》
Protein Sci. 2019 Dec;28(12):2119-2126. doi: 10.1002/pro.3742. Epub 2019 Nov 6.
2
Genome-Level Analysis of Selective Constraint without Apparent Sequence Conservation.无明显序列保守性的选择约束的全基因组分析。
Genome Biol Evol. 2013;5(3):532-41. doi: 10.1093/gbe/evt023.
3
Bacterial pleckstrin homology domains: a prokaryotic origin for the PH domain.细菌 Pleckstrin homology 结构域:PH 结构域的原核起源。
J Mol Biol. 2010 Feb 12;396(1):31-46. doi: 10.1016/j.jmb.2009.11.006. Epub 2009 Nov 10.
4
Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments.通过结合从进化中获得的序列概况和片段的深度依赖结构比对来进行折叠识别。
Proteins. 2005 Feb 1;58(2):321-8. doi: 10.1002/prot.20308.
5
Enhanced functional and structural domain assignments using remote similarity detection procedures for proteins encoded in the genome of Mycobacterium tuberculosis H37Rv.利用远程相似性检测程序对结核分枝杆菌H37Rv基因组中编码的蛋白质进行增强的功能和结构域分配。
J Biosci. 2004 Sep;29(3):245-59. doi: 10.1007/BF02702607.
6
An approach to large scale identification of non-obvious structural similarities between proteins.一种大规模识别蛋白质之间非明显结构相似性的方法。
BMC Bioinformatics. 2004 May 17;5:61. doi: 10.1186/1471-2105-5-61.
7
Structural characterization of genomes by large scale sequence-structure threading.通过大规模序列-结构穿线法对基因组进行结构表征。
BMC Bioinformatics. 2004 Apr 3;5:37. doi: 10.1186/1471-2105-5-37.
8
PROTINFO: Secondary and tertiary protein structure prediction.蛋白质信息:二级和三级蛋白质结构预测。
Nucleic Acids Res. 2003 Jul 1;31(13):3296-9. doi: 10.1093/nar/gkg541.
9
Molecular analysis of the multiple GroEL proteins of Chlamydiae.衣原体多种热休克蛋白60的分子分析。
J Bacteriol. 2003 Mar;185(6):1958-66. doi: 10.1128/JB.185.6.1958-1966.2003.
10
A comprehensive analysis of 40 blind protein structure predictions.40个盲法蛋白质结构预测的综合分析。
BMC Struct Biol. 2002 Aug 1;2:3. doi: 10.1186/1472-6807-2-3.