SCOP与CATH的系统比较：蛋白质结构分析的新金标准。

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.

作者信息

Csaba Gergely, Birzele Fabian, Zimmer Ralf

机构信息

Department of Informatics, Ludwig-Maximilians-Universität München, Munich, Germany.

出版信息

BMC Struct Biol. 2009 Apr 17;9:23. doi: 10.1186/1472-6807-9-23.

DOI:10.1186/1472-6807-9-23

PMID:19374763

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2678134/

Abstract

BACKGROUND

SCOP and CATH are widely used as gold standards to benchmark novel protein structure comparison methods as well as to train machine learning approaches for protein structure classification and prediction. The two hierarchies result from different protocols which may result in differing classifications of the same protein. Ignoring such differences leads to problems when being used to train or benchmark automatic structure classification methods. Here, we propose a method to compare SCOP and CATH in detail and discuss possible applications of this analysis.

RESULTS

We create a new mapping between SCOP and CATH and define a consistent benchmark set which is shown to largely reduce errors made by structure comparison methods such as TM-Align and has useful further applications, e.g. for machine learning methods being trained for protein structure classification. Additionally, we extract additional connections in the topology of the protein fold space from the orthogonal features contained in SCOP and CATH.

CONCLUSION

Via an all-to-all comparison, we find that there are large and unexpected differences between SCOP and CATH w.r.t. their domain definitions as well as their hierarchic partitioning of the fold space on every level of the two classifications. A consistent mapping of SCOP and CATH can be exploited for automated structure comparison and classification.

AVAILABILITY

Benchmark sets and an interactive SCOP-CATH browser are available at http://www.bio.ifi.lmu.de/SCOPCath.

摘要

背景

SCOP和CATH被广泛用作黄金标准，用于对新型蛋白质结构比较方法进行基准测试，以及训练用于蛋白质结构分类和预测的机器学习方法。这两种层次结构源自不同的协议，可能导致对同一蛋白质的分类不同。在用于训练或基准测试自动结构分类方法时，忽略这些差异会导致问题。在此，我们提出一种方法来详细比较SCOP和CATH，并讨论这种分析的可能应用。

结果

我们在SCOP和CATH之间创建了一个新的映射，并定义了一个一致的基准集，该基准集被证明能大幅减少结构比较方法（如TM-Align）所产生的错误，并且有其他有用的应用，例如用于训练蛋白质结构分类的机器学习方法。此外，我们从SCOP和CATH中包含的正交特征提取蛋白质折叠空间拓扑结构中的额外联系。

结论

通过全对全比较，我们发现SCOP和CATH在其结构域定义以及在这两种分类的每个层次上的折叠空间层次划分方面存在巨大且意想不到的差异。SCOP和CATH的一致映射可用于自动结构比较和分类。

可用性

基准集和交互式SCOP-CATH浏览器可在http://www.bio.ifi.lmu.de/SCOPCath获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c7/2678134/5c6cd1e0d629/1472-6807-9-23-1.jpg

相似文献

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.

BMC Struct Biol. 2009 Apr 17;9:23. doi: 10.1186/1472-6807-9-23.

AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings.

Bioinformatics. 2007 May 15;23(10):1203-10. doi: 10.1093/bioinformatics/btm089. Epub 2007 Mar 22.

Cross-over between discrete and continuous protein structure space: insights into automatic classification and networks of protein structures.

PLoS Comput Biol. 2009 Mar;5(3):e1000331. doi: 10.1371/journal.pcbi.1000331. Epub 2009 Mar 27.

Automatic classification of protein structures using low-dimensional structure space mappings.

BMC Bioinformatics. 2014;15 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-15-S2-S1. Epub 2014 Jan 24.

A comparison of SCOP and CATH with respect to domain-domain interactions.

Proteins. 2008 Jan 1;70(1):54-62. doi: 10.1002/prot.21496.

Automated assignment of SCOP and CATH protein structure classifications from FSSP scores.

Proteins. 2002 Mar 1;46(4):405-15. doi: 10.1002/prot.1176.

A consensus view of fold space: combining SCOP, CATH, and the Dali Domain Dictionary.

Protein Sci. 2003 Oct;12(10):2150-60. doi: 10.1110/ps.0306803.

The value of protein structure classification information-Surveying the scientific literature.

Proteins. 2015 Nov;83(11):2025-38. doi: 10.1002/prot.24915. Epub 2015 Sep 19.

A systematic comparison of protein structure classifications: SCOP, CATH and FSSP.

Structure. 1999 Sep 15;7(9):1099-112. doi: 10.1016/s0969-2126(99)80177-4.

DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile.

Protein Sci. 2007 May;16(5):947-55. doi: 10.1110/ps.062597307.

引用本文的文献

Dissecting the function of the DNMT2-homolog (DNMA) in Dictyostelium discoideum.

G3 (Bethesda). 2025 Sep 3;15(9). doi: 10.1093/g3journal/jkaf152.

Bioinformatic Characterization of the Functional and Structural Effect of Single Nucleotide Mutations in Patients with High-Grade Glioma.

Biomedicines. 2024 Oct 9;12(10):2287. doi: 10.3390/biomedicines12102287.

Mutations in the WG and GW motifs of the three RNA silencing suppressors of grapevine fanleaf virus alter their systemic suppression ability and affect virus infectivity.

Front Microbiol. 2024 Aug 12;15:1451285. doi: 10.3389/fmicb.2024.1451285. eCollection 2024.

Potential Therapeutic Target and Vaccines for SARS-CoV-2.

Pathogens. 2023 Jul 10;12(7):926. doi: 10.3390/pathogens12070926.

A review of visualisations of protein fold networks and their relationship with sequence and function.

Biol Rev Camb Philos Soc. 2023 Feb;98(1):243-262. doi: 10.1111/brv.12905. Epub 2022 Oct 9.

Protein folds as synapomorphies of the tree of life.

Evolution. 2022 Aug;76(8):1706-1719. doi: 10.1111/evo.14550. Epub 2022 Jul 13.

An in silico scientific basis for LL-37 as a therapeutic for Covid-19.

Proteins. 2022 May;90(5):1029-1043. doi: 10.1002/prot.26198. Epub 2021 Aug 9.

Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia.

BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):414. doi: 10.1186/s12859-018-2381-1.

Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC.

PLoS One. 2018 Oct 17;13(10):e0204587. doi: 10.1371/journal.pone.0204587. eCollection 2018.

Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths.

Proc Natl Acad Sci U S A. 2017 Oct 31;114(44):11703-11708. doi: 10.1073/pnas.1707642114. Epub 2017 Oct 19.

本文引用的文献

Protein structure alignment considering phenotypic plasticity.

Bioinformatics. 2008 Aug 15;24(16):i98-104. doi: 10.1093/bioinformatics/btn271.

Secondary structure-based assignment of the protein structural classes.

Amino Acids. 2008 Oct;35(3):551-64. doi: 10.1007/s00726-008-0080-3. Epub 2008 Apr 22.

Alternative splicing and protein structure evolution.

Nucleic Acids Res. 2008 Feb;36(2):550-8. doi: 10.1093/nar/gkm1054. Epub 2007 Nov 30.

Data growth and its impact on the SCOP database: new developments.

Nucleic Acids Res. 2008 Jan;36(Database issue):D419-25. doi: 10.1093/nar/gkm993. Epub 2007 Nov 13.

PFRES: protein fold classification by using evolutionary information and predicted secondary structure.

Bioinformatics. 2007 Nov 1;23(21):2843-50. doi: 10.1093/bioinformatics/btm475. Epub 2007 Oct 17.

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

AutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings.

Bioinformatics. 2007 May 15;23(10):1203-10. doi: 10.1093/bioinformatics/btm089. Epub 2007 Mar 22.

Vorolign--fast structural alignment using Voronoi contacts.

Bioinformatics. 2007 Jan 15;23(2):e205-11. doi: 10.1093/bioinformatics/btl294.

The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution.

Nucleic Acids Res. 2007 Jan;35(Database issue):D291-7. doi: 10.1093/nar/gkl959. Epub 2006 Nov 29.

Partitioning protein structures into domains: why is it so difficult?

J Mol Biol. 2006 Aug 18;361(3):562-90. doi: 10.1016/j.jmb.2006.05.060. Epub 2006 Jun 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SCOP与CATH的系统比较：蛋白质结构分析的新金标准。

Systematic comparison of SCOP and CATH: a new gold standard for protein structure analysis.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

AVAILABILITY

背景

结果

结论

可用性

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献