PPT-DB：蛋白质特性预测与测试数据库。

PPT-DB: the protein property prediction and testing database.

作者信息

Wishart David S, Arndt David, Berjanskii Mark, Guo An Chi, Shi Yi, Shrivastava Savita, Zhou Jianjun, Zhou You, Lin Guohui

机构信息

Department of Biological Sciences, Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada.

出版信息

Nucleic Acids Res. 2008 Jan;36(Database issue):D222-9. doi: 10.1093/nar/gkm800. Epub 2007 Oct 4.

DOI:10.1093/nar/gkm800

PMID:17916570

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2238980/

Abstract

The protein property prediction and testing database (PPT-DB) is a database housing nearly 30 carefully curated databases, each of which contains commonly predicted protein property information. These properties include both structural (i.e. secondary structure, contact order, disulfide pairing) and dynamic (i.e. order parameters, B-factors, folding rates) features that have been measured, derived or tabulated from a variety of sources. PPT-DB is designed to serve two purposes. First it is intended to serve as a centralized, up-to-date, freely downloadable and easily queried repository of predictable or 'derived' protein property data. In this role, PPT-DB can serve as a one-stop, fully standardized repository for developers to obtain the required training, testing and validation data needed for almost any kind of protein property prediction program they may wish to create. The second role that PPT-DB can play is as a tool for homology-based protein property prediction. Users may query PPT-DB with a sequence of interest and have a specific property predicted using a sequence similarity search against PPT-DB's extensive collection of proteins with known properties. PPT-DB exploits the well-known fact that protein structure and dynamic properties are highly conserved between homologous proteins. Predictions derived from PPT-DB's similarity searches are typically 85-95% correct (for categorical predictions, such as secondary structure) or exhibit correlations of >0.80 (for numeric predictions, such as accessible surface area). This performance is 10-20% better than what is typically obtained from standard 'ab initio' predictions. PPT-DB, its prediction utilities and all of its contents are available at http://www.pptdb.ca.

摘要

蛋白质特性预测与测试数据库（PPT-DB）是一个容纳近30个精心策划数据库的数据库，每个数据库都包含常见的预测蛋白质特性信息。这些特性包括已从各种来源测量、推导或制表的结构特征（即二级结构、接触序、二硫键配对）和动态特征（即序参量、B因子、折叠速率）。PPT-DB旨在实现两个目的。首先，它旨在作为一个集中的、最新的、可免费下载且易于查询的可预测或“推导”蛋白质特性数据存储库。在这个角色中，PPT-DB可以作为一个一站式的、完全标准化的存储库，供开发人员获取他们可能希望创建的几乎任何类型蛋白质特性预测程序所需的训练、测试和验证数据。PPT-DB可以发挥的第二个作用是作为基于同源性的蛋白质特性预测工具。用户可以使用感兴趣的序列查询PPT-DB，并通过对PPT-DB中大量具有已知特性的蛋白质进行序列相似性搜索来预测特定特性。PPT-DB利用了同源蛋白质之间蛋白质结构和动态特性高度保守这一众所周知的事实。从PPT-DB的相似性搜索得出的预测通常有85%-95%是正确的（对于分类预测，如二级结构），或者相关性大于0.80（对于数值预测，如可及表面积）。这种性能比从标准的“从头开始”预测通常获得的性能要好10%-20%。PPT-DB、其预测实用程序及其所有内容可在http://www.pptdb.ca上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0fdb/2238980/152bdd5650f1/gkm800f1.jpg

相似文献

PPT-DB: the protein property prediction and testing database.

Nucleic Acids Res. 2008 Jan;36(Database issue):D222-9. doi: 10.1093/nar/gkm800. Epub 2007 Oct 4.

MEGADOCK-Web: an integrated database of high-throughput structure-based protein-protein interaction predictions.

BMC Bioinformatics. 2018 May 8;19(Suppl 4):62. doi: 10.1186/s12859-018-2073-x.

Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks.

BMC Struct Biol. 2009 Jan 30;9:5. doi: 10.1186/1472-6807-9-5.

LOC3D: annotate sub-cellular localization for protein structures.

Nucleic Acids Res. 2003 Jul 1;31(13):3337-40. doi: 10.1093/nar/gkg514.

CKAAPs DB: a conserved key amino acid positions database.

Nucleic Acids Res. 2001 Jan 1;29(1):329-31. doi: 10.1093/nar/29.1.329.

ProSeg: a database of local structures of protein segments.

J Comput Aided Mol Des. 2009 Mar;23(3):163-9. doi: 10.1007/s10822-008-9248-x. Epub 2008 Oct 16.

CKAAPs DB: a Conserved Key Amino Acid Positions DataBase.

Nucleic Acids Res. 2002 Jan 1;30(1):409-11. doi: 10.1093/nar/30.1.409.

Fully automated ab initio protein structure prediction using I-SITES, HMMSTR and ROSETTA.

Bioinformatics. 2002;18 Suppl 1:S54-61. doi: 10.1093/bioinformatics/18.suppl_1.s54.

SAM-T08, HMM-based protein structure prediction.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W492-7. doi: 10.1093/nar/gkp403. Epub 2009 May 29.

PSPP: a protein structure prediction pipeline for computing clusters.

PLoS One. 2009 Jul 16;4(7):e6254. doi: 10.1371/journal.pone.0006254.

引用本文的文献

Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.

Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.

Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors.

bioRxiv. 2024 Jun 28:2024.06.25.600283. doi: 10.1101/2024.06.25.600283.

TSignal: a transformer model for signal peptide prediction.

Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i347-i356. doi: 10.1093/bioinformatics/btad228.

VIPdb, a genetic Variant Impact Predictor Database.

Hum Mutat. 2019 Sep;40(9):1202-1214. doi: 10.1002/humu.23858. Epub 2019 Aug 17.

Conserved prosegment residues stabilize a late-stage folding transition state of pepsin independently of ground states.

PLoS One. 2014 Jul 1;9(7):e101339. doi: 10.1371/journal.pone.0101339. eCollection 2014.

Computational and experimental approaches to reveal the effects of single nucleotide polymorphisms with respect to disease diagnostics.

Int J Mol Sci. 2014 May 30;15(6):9670-717. doi: 10.3390/ijms15069670.

CyanoPhyChe: a database for physico-chemical properties, structure and biochemical pathway information of cyanobacterial proteins.

PLoS One. 2012;7(11):e49425. doi: 10.1371/journal.pone.0049425. Epub 2012 Nov 21.

A unified multitask architecture for predicting local protein properties.

PLoS One. 2012;7(3):e32235. doi: 10.1371/journal.pone.0032235. Epub 2012 Mar 26.

SeqRate: sequence-based protein folding type classification and rates prediction.

BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2105-11-S3-S1.

GeNMR: a web server for rapid NMR-based protein structure determination.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W670-7. doi: 10.1093/nar/gkp280. Epub 2009 Apr 30.

本文引用的文献

Real-SPINE: an integrated system of neural networks for real-value prediction of protein structural properties.

Proteins. 2007 Jul 1;68(1):76-81. doi: 10.1002/prot.21408.

Protein Folding Database (PFD 2.0): an online environment for the International Foldeomics Consortium.

Nucleic Acids Res. 2007 Jan;35(Database issue):D304-7. doi: 10.1093/nar/gkl1007. Epub 2006 Dec 14.

The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data.

Nucleic Acids Res. 2007 Jan;35(Database issue):D301-3. doi: 10.1093/nar/gkl971. Epub 2006 Nov 16.

PREDITOR: a web server for predicting protein torsion angle restraints.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W63-9. doi: 10.1093/nar/gkl341.

Improving the accuracy of protein secondary structure prediction using structural alignment.

BMC Bioinformatics. 2006 Jun 14;7:301. doi: 10.1186/1471-2105-7-301.

Paircoil2: improved prediction of coiled coils from sequence.

Bioinformatics. 2006 Feb 1;22(3):356-8. doi: 10.1093/bioinformatics/bti797. Epub 2005 Nov 29.

SPdb--a signal peptide database.

BMC Bioinformatics. 2005 Oct 13;6:249. doi: 10.1186/1471-2105-6-249.

Protein flexibility and rigidity predicted from sequence.

Proteins. 2005 Oct 1;61(1):115-26. doi: 10.1002/prot.20587.

BhairPred: prediction of beta-hairpins in a protein from multiple alignment information using ANN and SVM techniques.

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W154-9. doi: 10.1093/nar/gki588.

SCRATCH: a protein structure and structural feature prediction server.

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W72-6. doi: 10.1093/nar/gki396.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PPT-DB：蛋白质特性预测与测试数据库。

PPT-DB: the protein property prediction and testing database.

作者信息

Wishart David S, Arndt David, Berjanskii Mark, Guo An Chi, Shi Yi, Shrivastava Savita, Zhou Jianjun, Zhou You, Lin Guohui

机构信息

Department of Biological Sciences, Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada.

出版信息

Nucleic Acids Res. 2008 Jan;36(Database issue):D222-9. doi: 10.1093/nar/gkm800. Epub 2007 Oct 4.

DOI:10.1093/nar/gkm800

PMID:17916570

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2238980/

Abstract

摘要

PPT-DB：蛋白质特性预测与测试数据库。

PPT-DB: the protein property prediction and testing database.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

PPT-DB：蛋白质特性预测与测试数据库。

PPT-DB: the protein property prediction and testing database.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献