• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种探索蛋白质结构全域的快速方法。

A rapid method for exploring the protein structure universe.

作者信息

Young M M, Skillman A G, Kuntz I D

机构信息

Department of Pharmaceutical Chemistry, University of California, San Francisco 94143-0446, USA.

出版信息

Proteins. 1999 Feb 15;34(3):317-32.

PMID:10024019
Abstract

We have developed an automatic protein fingerprinting method for the evaluation of protein structural similarities based on secondary structure element compositions, spatial arrangements, lengths, and topologies. This method can rapidly identify proteins sharing structural homologies as we demonstrate with five test cases: the globins, the mammalian trypsinlike serine proteases, the immunoglobulins, the cupredoxins, and the actinlike ATPase domain-containing proteins. Principal components analysis of the similarity distance matrix calculated from an all-by-all comparison of 1,031 unique chains in the Protein Data Bank has produced a distribution of structures within a high-dimensional structural space. Fifty percent of the variance observed for this distribution is bounded by six axes, two of which encode structural variability within two large families, the immunoglobulins and the trypsinlike serine proteases. Many aspects of the spatial distribution remain stable upon reduction of the database to 140 proteins with minimal family overlap. The axes correlated with specific structural families are no longer observed. A clear hierarchy of organization is seen in the arrangement of protein structures in the universe. At the highest level, protein structures populate regions corresponding to the all-alpha, all-beta, and alpha/beta superfamilies. Large protein families are arranged along family-specific axes, forming local densely populated regions within the space. The lowest level of organization is intrafamilial; homologous structures are ordered by variations in peripheral secondary structure elements or by conformational shifts in the tertiary structure.

摘要

我们开发了一种自动蛋白质指纹识别方法,用于基于二级结构元件组成、空间排列、长度和拓扑结构评估蛋白质结构相似性。正如我们通过五个测试案例所展示的那样,该方法可以快速识别具有结构同源性的蛋白质:球蛋白、哺乳动物类胰蛋白酶丝氨酸蛋白酶、免疫球蛋白、铜氧化还原蛋白以及含肌动蛋白样ATP酶结构域的蛋白质。对从蛋白质数据库中1031条独特链的全对全比较计算得出的相似性距离矩阵进行主成分分析,得出了高维结构空间内的结构分布。此分布中观察到的50%的方差由六个轴界定,其中两个轴编码两个大家族(免疫球蛋白和类胰蛋白酶丝氨酸蛋白酶)内的结构变异性。当将数据库缩减为家族重叠最小的140种蛋白质时,空间分布的许多方面仍保持稳定。与特定结构家族相关的轴不再出现。在宇宙中蛋白质结构的排列中可以看到明显的组织层次结构。在最高层次上,蛋白质结构分布在对应于全α、全β和α/β超家族的区域。大的蛋白质家族沿着家族特异性轴排列,在空间内形成局部密集区域。最低层次的组织是家族内部的;同源结构通过外围二级结构元件的变化或三级结构的构象变化进行排序。

相似文献

1
A rapid method for exploring the protein structure universe.一种探索蛋白质结构全域的快速方法。
Proteins. 1999 Feb 15;34(3):317-32.
2
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
3
Structural diversity of domain superfamilies in the CATH database.CATH数据库中结构域超家族的结构多样性。
J Mol Biol. 2006 Jul 14;360(3):725-41. doi: 10.1016/j.jmb.2006.05.035. Epub 2006 Jun 2.
4
Discovery of local packing motifs in protein structures.蛋白质结构中局部堆积基序的发现。
Proteins. 1999 Feb 1;34(2):206-19.
5
An automatic method involving cluster analysis of secondary structures for the identification of domains in proteins.一种涉及对二级结构进行聚类分析以识别蛋白质结构域的自动化方法。
Protein Sci. 1995 Mar;4(3):506-20. doi: 10.1002/pro.5560040317.
6
A new method for identification of protein (sub)families in a set of proteins based on hydropathy distribution in proteins.一种基于蛋白质亲水性分布在一组蛋白质中鉴定蛋白质(亚)家族的新方法。
Proteins. 2005 Mar 1;58(4):923-34. doi: 10.1002/prot.20356.
7
Protein loops on structurally similar scaffolds: database and conformational analysis.结构相似支架上的蛋白质环:数据库与构象分析
Biopolymers. 1999 May;49(6):481-95. doi: 10.1002/(SICI)1097-0282(199905)49:6<481::AID-BIP6>3.0.CO;2-V.
8
MUSTANG: a multiple structural alignment algorithm.MUSTANG:一种多重结构比对算法。
Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.
9
Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm.使用最大公共子图同构算法识别蛋白质中的三级结构相似性。
J Mol Biol. 1993 Feb 5;229(3):707-21. doi: 10.1006/jmbi.1993.1074.
10
The linked conservation of structure and function in a family of high diversity: the monomeric cupredoxins.一个高度多样化家族中结构与功能的关联保守性:单体铜氧化还原蛋白。
Structure. 2004 Jun;12(6):917-25. doi: 10.1016/j.str.2004.03.029.

引用本文的文献

1
The Structure Superposition Database.结构叠加数据库。
Nucleic Acids Res. 2003 Jan 1;31(1):505-10. doi: 10.1093/nar/gkg127.