• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从布鲁克海文蛋白质数据库中选择一组具有代表性的结构。

Selection of a representative set of structures from Brookhaven Protein Data Bank.

作者信息

Boberg J, Salakoski T, Vihinen M

机构信息

Department of Computer Science, University of Turku, Finland.

出版信息

Proteins. 1992 Oct;14(2):265-76. doi: 10.1002/prot.340140212.

DOI:10.1002/prot.340140212
PMID:1409573
Abstract

Reliable structural and statistical analyses of three dimensional protein structures should be based on unbiased data. The Protein Data Bank is highly redundant, containing several entries for identical or very similar sequences. A technique was developed for clustering the known structures based on their sequences and contents of alpha- and beta-structures. First, sequences were aligned pairwise. A representative sample of sequences was then obtained by grouping similar sequences together, and selecting a typical representative from each group. The similarity significance threshold needed in the clustering method was found by analyzing similarities of random sequences. Because three dimensional structures for proteins of same structural class are generally more conserved than their sequences, the proteins were clustered also according to their contents of secondary structural elements. The results of these clusterings indicate conservation of alpha- and beta-structures even when sequence similarity is relatively low. An unbiased sample of 103 high resolution structures, representing a wide variety of proteins, was chosen based on the suggestions made by the clustering algorithm. The proteins were divided into structural classes according to their contents and ratios of secondary structural elements. Previous classifications have suffered from subjective view of secondary structures, whereas here the classification was based on backbone geometry. The concise view lead to reclassification of some structures. The representative set of structures facilitates unbiased analyses of relationships between protein sequence, function, and structure as well as of structural characteristics.

摘要

对三维蛋白质结构进行可靠的结构和统计分析应基于无偏差的数据。蛋白质数据库高度冗余,包含相同或非常相似序列的多个条目。开发了一种基于已知结构的序列以及α-和β-结构含量进行聚类的技术。首先,将序列进行两两比对。然后通过将相似序列分组在一起,并从每组中选择一个典型代表来获得序列的代表性样本。通过分析随机序列的相似性来确定聚类方法所需的相似性显著性阈值。由于相同结构类别的蛋白质的三维结构通常比其序列更保守,因此还根据蛋白质二级结构元件的含量对其进行聚类。这些聚类结果表明,即使序列相似性相对较低,α-和β-结构也具有保守性。根据聚类算法的建议,选择了一个代表各种蛋白质的103个高分辨率结构的无偏差样本。根据蛋白质二级结构元件的含量和比例将其分为不同的结构类别。以前的分类受到二级结构主观观点的影响,而这里的分类是基于主链几何结构。这种简洁的观点导致了对一些结构的重新分类。该代表性结构集有助于对蛋白质序列、功能和结构之间的关系以及结构特征进行无偏差分析。

相似文献

1
Selection of a representative set of structures from Brookhaven Protein Data Bank.从布鲁克海文蛋白质数据库中选择一组具有代表性的结构。
Proteins. 1992 Oct;14(2):265-76. doi: 10.1002/prot.340140212.
2
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
3
Alignment and searching for common protein folds using a data bank of structural templates.利用结构模板数据库进行比对并寻找常见蛋白质折叠。
J Mol Biol. 1993 Jun 5;231(3):735-52. doi: 10.1006/jmbi.1993.1323.
4
A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.一种用于蛋白质折叠识别的3D-1D替换矩阵,其包含序列的预测二级结构。
J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924.
5
Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.评估基因组学中的注释转移:通过传统分数和概率分数量化蛋白质序列、结构与功能之间的关系。
J Mol Biol. 2000 Mar 17;297(1):233-49. doi: 10.1006/jmbi.2000.3550.
6
Automated search of natively folded protein fragments for high-throughput structure determination in structural genomics.在结构基因组学中自动搜索天然折叠的蛋白质片段以进行高通量结构测定。
Protein Sci. 2000 Dec;9(12):2313-21. doi: 10.1110/ps.9.12.2313.
7
Database of homology-derived protein structures and the structural meaning of sequence alignment.同源性衍生蛋白质结构数据库及序列比对的结构意义
Proteins. 1991;9(1):56-68. doi: 10.1002/prot.340090107.
8
Structural features can be unconserved in proteins with similar folds. An analysis of side-chain to side-chain contacts secondary structure and accessibility.在具有相似折叠结构的蛋白质中,结构特征可能是不保守的。对侧链与侧链接触、二级结构和可及性进行分析。
J Mol Biol. 1994 Dec 2;244(3):332-50. doi: 10.1006/jmbi.1994.1733.
9
An alternative view of protein fold space.蛋白质折叠空间的另一种观点。
Proteins. 2000 Feb 15;38(3):247-60.
10
Prediction of protein three-dimensional structures in insertion and deletion regions: a procedure for searching data bases of representative protein fragments using geometric scoring criteria.插入和缺失区域中蛋白质三维结构的预测:一种使用几何评分标准搜索代表性蛋白质片段数据库的方法。
J Mol Biol. 1995 Oct 13;253(1):114-31. doi: 10.1006/jmbi.1995.0540.

引用本文的文献

1
Never too late for endothelin.内皮素治疗永远不会太晚。
Acta Crystallogr F Struct Biol Commun. 2019 Jan 1;75(Pt 1):45-46. doi: 10.1107/S2053230X18018101.
2
Modelling of peptide and protein structures.肽和蛋白质结构的建模。
Amino Acids. 1994 Jun;7(2):175-202. doi: 10.1007/BF00814159.
3
Hierarchical classification of protein folds using a novel ensemble classifier.利用新型集成分类器对蛋白质折叠进行层次分类。
PLoS One. 2013;8(2):e56499. doi: 10.1371/journal.pone.0056499. Epub 2013 Feb 20.
4
Comparing models of evolution for ordered and disordered proteins.比较有序蛋白和无序蛋白进化模型。
Mol Biol Evol. 2010 Mar;27(3):609-21. doi: 10.1093/molbev/msp277. Epub 2009 Nov 18.
5
Predicting protein folding rates from geometric contact and amino acid sequence.从几何接触和氨基酸序列预测蛋白质折叠速率。
Protein Sci. 2008 Jul;17(7):1256-63. doi: 10.1110/ps.034660.108. Epub 2008 Apr 23.
6
SynDB: a Synapse protein DataBase based on synapse ontology.SynDB:基于突触本体论的突触蛋白数据库。
Nucleic Acids Res. 2007 Jan;35(Database issue):D737-41. doi: 10.1093/nar/gkl876. Epub 2006 Nov 10.
7
Global mapping of the protein structure space and application in structure-based inference of protein function.蛋白质结构空间的全球图谱及其在基于结构的蛋白质功能推断中的应用。
Proc Natl Acad Sci U S A. 2005 Mar 8;102(10):3651-6. doi: 10.1073/pnas.0409772102. Epub 2005 Feb 10.
8
Identification of csk tyrosine phosphorylation sites and a tyrosine residue important for kinase domain structure.识别csk酪氨酸磷酸化位点以及对激酶结构域结构重要的一个酪氨酸残基。
Biochem J. 1997 Mar 15;322 ( Pt 3)(Pt 3):927-35. doi: 10.1042/bj3220927.
9
Verification of protein structures: patterns of nonbonded atomic interactions.蛋白质结构的验证:非键合原子相互作用模式
Protein Sci. 1993 Sep;2(9):1511-9. doi: 10.1002/pro.5560020916.
10
Common prevalence of alanine and glycine in mobile reactive centre loops of serpins and viral fusion peptides: do prions possess a fusion peptide?丝氨酸蛋白酶抑制剂和病毒融合肽的移动反应中心环中丙氨酸和甘氨酸的常见普遍性:朊病毒是否具有融合肽?
J Comput Aided Mol Des. 1994 Apr;8(2):175-91. doi: 10.1007/BF00119866.