• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CATH——蛋白质结构域结构的层次分类。

CATH--a hierarchic classification of protein domain structures.

作者信息

Orengo C A, Michie A D, Jones S, Jones D T, Swindells M B, Thornton J M

机构信息

Department of Biochemistry and Molecular Biology, University College London, UK.

出版信息

Structure. 1997 Aug 15;5(8):1093-108. doi: 10.1016/s0969-2126(97)00260-8.

DOI:10.1016/s0969-2126(97)00260-8
PMID:9309224
Abstract

BACKGROUND

Protein evolution gives rise to families of structurally related proteins, within which sequence identities can be extremely low. As a result, structure-based classifications can be effective at identifying unanticipated relationships in known structures and in optimal cases function can also be assigned. The ever increasing number of known protein structures is too large to classify all proteins manually, therefore, automatic methods are needed for fast evaluation of protein structures.

RESULTS

We present a semi-automatic procedure for deriving a novel hierarchical classification of protein domain structures (CATH). The four main levels of our classification are protein class (C), architecture (A), topology (T) and homologous superfamily (H). Class is the simplest level, and it essentially describes the secondary structure composition of each domain. In contrast, architecture summarises the shape revealed by the orientations of the secondary structure units, such as barrels and sandwiches. At the topology level, sequential connectivity is considered, such that members of the same architecture might have quite different topologies. When structures belonging to the same T-level have suitably high similarities combined with similar functions, the proteins are assumed to be evolutionarily related and put into the same homologous superfamily.

CONCLUSIONS

Analysis of the structural families generated by CATH reveals the prominent features of protein structure space. We find that nearly a third of the homologous superfamilies (H-levels) belong to ten major T-levels, which we call superfolds, and furthermore that nearly two-thirds of these H-levels cluster into nine simple architectures. A database of well-characterised protein structure families, such as CATH, will facilitate the assignment of structure-function/evolution relationships to both known and newly determined protein structures.

摘要

背景

蛋白质进化产生了结构相关的蛋白质家族,其中序列同一性可能极低。因此,基于结构的分类在识别已知结构中未预料到的关系时可能很有效,在最佳情况下还可以确定其功能。已知蛋白质结构的数量不断增加,规模太大以至于无法手动对所有蛋白质进行分类,因此需要自动方法来快速评估蛋白质结构。

结果

我们提出了一种半自动程序,用于推导蛋白质结构域结构的新型层次分类(CATH)。我们分类的四个主要层次是蛋白质类(C)、结构(A)、拓扑结构(T)和同源超家族(H)。类是最简单的层次,它本质上描述了每个结构域的二级结构组成。相比之下,结构总结了二级结构单元(如桶状和三明治状)的取向所揭示的形状。在拓扑结构层次上,考虑序列连接性,因此相同结构的成员可能具有相当不同的拓扑结构。当属于同一T层次的结构具有足够高的相似性且功能相似时,这些蛋白质被认为在进化上相关,并被归入同一个同源超家族。

结论

对CATH生成的结构家族的分析揭示了蛋白质结构空间的显著特征。我们发现,近三分之一的同源超家族(H层次)属于十个主要的T层次,我们称之为超折叠,此外,这些H层次中近三分之二聚集为九种简单结构。一个特征明确的蛋白质结构家族数据库,如CATH,将有助于确定已知和新确定的蛋白质结构的结构-功能/进化关系。

相似文献

1
CATH--a hierarchic classification of protein domain structures.CATH——蛋白质结构域结构的层次分类。
Structure. 1997 Aug 15;5(8):1093-108. doi: 10.1016/s0969-2126(97)00260-8.
2
The CATH classification revisited--architectures reviewed and new ways to characterize structural divergence in superfamilies.重温CATH分类——超家族中结构差异的架构综述及新表征方法
Nucleic Acids Res. 2009 Jan;37(Database issue):D310-4. doi: 10.1093/nar/gkn877. Epub 2008 Nov 7.
3
Structural diversity of domain superfamilies in the CATH database.CATH数据库中结构域超家族的结构多样性。
J Mol Biol. 2006 Jul 14;360(3):725-41. doi: 10.1016/j.jmb.2006.05.035. Epub 2006 Jun 2.
4
The CATH Database provides insights into protein structure/function relationships.CATH数据库提供了对蛋白质结构/功能关系的深入见解。
Nucleic Acids Res. 1999 Jan 1;27(1):275-9. doi: 10.1093/nar/27.1.275.
5
The CATH database: an extended protein family resource for structural and functional genomics.CATH数据库:用于结构和功能基因组学的扩展蛋白质家族资源。
Nucleic Acids Res. 2003 Jan 1;31(1):452-5. doi: 10.1093/nar/gkg062.
6
Extending CATH: increasing coverage of the protein structure universe and linking structure with function.扩展CATH:扩大蛋白质结构领域的覆盖范围并将结构与功能联系起来。
Nucleic Acids Res. 2011 Jan;39(Database issue):D420-6. doi: 10.1093/nar/gkq1001. Epub 2010 Nov 19.
7
New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures.CATH 中的新功能家族(FunFams),以改进将保守功能位点映射到 3D 结构的工作。
Nucleic Acids Res. 2013 Jan;41(Database issue):D490-8. doi: 10.1093/nar/gks1211. Epub 2012 Nov 29.
8
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis.CATH结构域数据库以及相关资源Gene3D和DHS为基因组分析提供了全面的结构域家族信息。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D247-51. doi: 10.1093/nar/gki024.
9
Classifying a protein in the CATH database of domain structures.在CATH结构域数据库中对一种蛋白质进行分类。
Acta Crystallogr D Biol Crystallogr. 1998 Nov 1;54(Pt 6 Pt 1):1155-67. doi: 10.1107/s0907444998007501.
10
A rapid classification protocol for the CATH Domain Database to support structural genomics.一种用于CATH结构域数据库以支持结构基因组学的快速分类协议。
Nucleic Acids Res. 2001 Jan 1;29(1):223-7. doi: 10.1093/nar/29.1.223.

引用本文的文献

1
Evidence for vitellogenin DNA-binding in honey bees.蜜蜂中卵黄原蛋白DNA结合的证据。
Protein Sci. 2025 Oct;34(10):e70291. doi: 10.1002/pro.70291.
2
Exploration of Comprehensive Structural and Functional Potential of Recombinant Proteins Using Cutting-Edge Bioinformatics Tools.使用前沿生物信息学工具探索重组蛋白的综合结构和功能潜力。
Appl Biochem Biotechnol. 2025 Sep 9. doi: 10.1007/s12010-025-05366-2.
3
Conditional Protein Structure Generation with Protpardelle-1c.使用Protpardelle-1c进行条件蛋白质结构生成。
bioRxiv. 2025 Aug 18:2025.08.18.670959. doi: 10.1101/2025.08.18.670959.
4
Zinc-Induced Folding and Solution Structure of the Eponymous Novel Zinc Finger from the ZC4H2 Protein.锌诱导的来自ZC4H2蛋白的同名新型锌指的折叠及溶液结构
Biomolecules. 2025 Jul 28;15(8):1091. doi: 10.3390/biom15081091.
5
Large protein databases reveal structural complementarity and functional locality.大型蛋白质数据库揭示了结构互补性和功能局部性。
Nat Commun. 2025 Aug 25;16(1):7925. doi: 10.1038/s41467-025-63250-3.
6
Piecing Together the History of Protein Folds From a Fragmented Evolutionary Record.从碎片化的进化记录中拼凑蛋白质折叠的历史
Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf148.
7
Protein Structural Phylogenetics.蛋白质结构系统发育学
Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf139.
8
Dynamic energy conversion in protein catalysis: From brownian motion to enzymatic function.蛋白质催化中的动态能量转换:从布朗运动到酶功能
Comput Struct Biotechnol J. 2025 Jul 30;27:3337-3369. doi: 10.1016/j.csbj.2025.07.050. eCollection 2025.
9
Rational engineering of allosteric protein switches by in silico prediction of domain insertion sites.通过结构域插入位点的计算机模拟预测对变构蛋白开关进行合理工程设计。
Nat Methods. 2025 Aug;22(8):1698-1706. doi: 10.1038/s41592-025-02741-z. Epub 2025 Aug 4.
10
Protein structure alignment significance is often exaggerated.蛋白质结构比对的重要性常常被夸大。
bioRxiv. 2025 Aug 17:2025.07.17.665375. doi: 10.1101/2025.07.17.665375.