• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

球状蛋白质结构域数据库:将代表性家族成员聚类为相似折叠结构。

A database of globular protein structural domains: clustering of representative family members into similar folds.

作者信息

Sowdhamini R, Rufino S D, Blundell T L

机构信息

Department of Crystallography, Birkbeck College, London, UK.

出版信息

Fold Des. 1996;1(3):209-20. doi: 10.1016/S1359-0278(96)00032-6.

DOI:10.1016/S1359-0278(96)00032-6
PMID:9079382
Abstract

BACKGROUND

A database of globular domains, derived from a non-redundant set of proteins, is useful for the sequence analysis of aligned domains, for structural comparisons, for understanding domain stability and flexibility and for fold recognition procedures. Domains are defined by the program DIAL and classified structurally using the procedure SEA.

RESULTS

The DIAL-derived domain database (DDBASE) consists of 436 protein chains involving 695 protein domains. Of these, 206 are alpha-class, 191 are beta-class and 294 alpha and beta class. The domains, 63% from multidomain proteins and 73% less than 150 residues in length, were clustered automatically using both single-link cluster analysis and hierarchical clustering to give a quantitative estimate of similarity in the domain-fold space.

CONCLUSIONS

Highly populated and well described folds (doubly wound alpha/beta, singly wound alpha/beta barrels, globins alpha, large Greek-key beta and flavin-binding alpha/beta) are recognized at a SEA cut-off score of 0.55 in single-link clustering and at 0.65 in hierarchical clustering, although functionally related families are usually clearly distinguished at more stringent values.

摘要

背景

一个源自非冗余蛋白质集的球状结构域数据库,对于比对结构域的序列分析、结构比较、理解结构域稳定性和灵活性以及折叠识别程序都很有用。结构域由程序DIAL定义,并使用SEA程序进行结构分类。

结果

DIAL衍生的结构域数据库(DDBASE)由436条蛋白质链组成,涉及695个蛋白质结构域。其中,206个是α类,191个是β类,294个是α和β类。这些结构域,63%来自多结构域蛋白质,73%长度小于150个残基,使用单链聚类分析和层次聚类自动聚类,以定量估计结构域折叠空间中的相似性。

结论

在单链聚类中,SEA截止分数为0.55,在层次聚类中为0.65时,可识别出高度密集且描述良好的折叠(双绕α/β、单绕α/β桶、α类球蛋白、大希腊钥匙β和黄素结合α/β),尽管功能相关家族通常在更严格的值下能更清晰地区分。

相似文献

1
A database of globular protein structural domains: clustering of representative family members into similar folds.球状蛋白质结构域数据库:将代表性家族成员聚类为相似折叠结构。
Fold Des. 1996;1(3):209-20. doi: 10.1016/S1359-0278(96)00032-6.
2
Protein structural domains: analysis of the 3Dee domains database.蛋白质结构域:3Dee结构域数据库分析
Proteins. 2001 Feb 15;42(3):332-44.
3
A 3D sequence-independent representation of the protein data bank.蛋白质数据库的一种与序列无关的三维表示。
Protein Eng. 1995 Oct;8(10):981-97. doi: 10.1093/protein/8.10.981.
4
An automatic method involving cluster analysis of secondary structures for the identification of domains in proteins.一种涉及对二级结构进行聚类分析以识别蛋白质结构域的自动化方法。
Protein Sci. 1995 Mar;4(3):506-20. doi: 10.1002/pro.5560040317.
5
Protein topology recognition from secondary structure sequences: application of the hidden Markov models to the alpha class proteins.从二级结构序列识别蛋白质拓扑结构:隐马尔可夫模型在α类蛋白质中的应用。
J Mol Biol. 1997 Mar 28;267(2):446-63. doi: 10.1006/jmbi.1996.0874.
6
A database method for automated map interpretation in protein crystallography.一种用于蛋白质晶体学中自动图谱解读的数据库方法。
Proteins. 1999 Sep 1;36(4):526-41.
7
A database of protein structure families with common folding motifs.一个具有共同折叠基序的蛋白质结构家族数据库。
Protein Sci. 1992 Dec;1(12):1691-8. doi: 10.1002/pro.5560011217.
8
Modeling of folds and folding pathways for some protein families of (α + β)- and (α/β)-classes.一些(α+β)-和(α/β)-类蛋白质家族的折叠和折叠途径的建模。
J Biomol Struct Dyn. 2013;31(1):4-16. doi: 10.1080/07391102.2012.691341. Epub 2012 Jul 16.
9
An alternative view of protein fold space.蛋白质折叠空间的另一种观点。
Proteins. 2000 Feb 15;38(3):247-60.
10
Protein loops on structurally similar scaffolds: database and conformational analysis.结构相似支架上的蛋白质环:数据库与构象分析
Biopolymers. 1999 May;49(6):481-95. doi: 10.1002/(SICI)1097-0282(199905)49:6<481::AID-BIP6>3.0.CO;2-V.

引用本文的文献

1
AcconPred: Predicting Solvent Accessibility and Contact Number Simultaneously by a Multitask Learning Framework under the Conditional Neural Fields Model.AcconPred:在条件神经场模型下通过多任务学习框架同时预测溶剂可及性和接触数
Biomed Res Int. 2015;2015:678764. doi: 10.1155/2015/678764. Epub 2015 Aug 3.
2
Homology modeling a fast tool for drug discovery: current perspectives.同源建模:药物发现的快速工具——当前观点
Indian J Pharm Sci. 2012 Jan;74(1):1-17. doi: 10.4103/0250-474X.102537.
3
Sequence-based analysis of protein energy landscapes reveals nonuniform thermal adaptation within the proteome.
基于序列的蛋白质能量景观分析揭示了蛋白质组内不均匀的热适应性。
Mol Biol Evol. 2009 Oct;26(10):2217-27. doi: 10.1093/molbev/msp140. Epub 2009 Jul 10.
4
TCP: a tool for designing chimera proteins based on the tertiary structure information.TCP:一种基于三级结构信息设计嵌合蛋白的工具。
BMC Bioinformatics. 2009 Jan 7;10:9. doi: 10.1186/1471-2105-10-9.
5
Comparison of sequence-based and structure-based phylogenetic trees of homologous proteins: Inferences on protein evolution.同源蛋白质基于序列和基于结构的系统发育树比较:对蛋白质进化的推断
J Biosci. 2007 Jan;32(1):83-96. doi: 10.1007/s12038-007-0008-1.
6
Bayesian data mining of protein domains gives an efficient predictive algorithm and new insight.蛋白质结构域的贝叶斯数据挖掘提供了一种高效的预测算法和新见解。
J Mol Model. 2007 Jan;13(1):275-82. doi: 10.1007/s00894-006-0141-z. Epub 2006 Oct 7.
7
Fast prediction of protein domain boundaries using conserved local patterns.利用保守局部模式快速预测蛋白质结构域边界
J Mol Model. 2006 Sep;12(6):943-52. doi: 10.1007/s00894-006-0116-0. Epub 2006 Apr 29.
8
SCOP: a structural classification of proteins database.SCOP:蛋白质数据库的结构分类
Nucleic Acids Res. 2000 Jan 1;28(1):257-9. doi: 10.1093/nar/28.1.257.
9
The ASTRAL compendium for protein structure and sequence analysis.用于蛋白质结构和序列分析的ASTRAL汇编
Nucleic Acids Res. 2000 Jan 1;28(1):254-6. doi: 10.1093/nar/28.1.254.
10
CORA--topological fingerprints for protein structural families.CORA——蛋白质结构家族的拓扑指纹图谱。
Protein Sci. 1999 Apr;8(4):699-715. doi: 10.1110/ps.8.4.699.