PROFEAT：一个用于从氨基酸序列计算蛋白质和肽的结构及物理化学特征的网络服务器。

PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.

作者信息

Li Z R, Lin H H, Han L Y, Jiang L, Chen X, Chen Y Z

机构信息

Bioinformatics and Drug Design Group, Department of Computational Science, National University of Singapore, Blk SOC1, Level 7, 3 Science Drive 2, Singapore 117543.

出版信息

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W32-7. doi: 10.1093/nar/gkl305.

DOI:10.1093/nar/gkl305

PMID:16845018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1538821/

Abstract

Sequence-derived structural and physicochemical features have frequently been used in the development of statistical learning models for predicting proteins and peptides of different structural, functional and interaction profiles. PROFEAT (Protein Features) is a web server for computing commonly-used structural and physicochemical features of proteins and peptides from amino acid sequence. It computes six feature groups composed of ten features that include 51 descriptors and 1447 descriptor values. The computed features include amino acid composition, dipeptide composition, normalized Moreau-Broto autocorrelation, Moran autocorrelation, Geary autocorrelation, sequence-order-coupling number, quasi-sequence-order descriptors and the composition, transition and distribution of various structural and physicochemical properties. In addition, it can also compute previous autocorrelations descriptors based on user-defined properties. Our computational algorithms were extensively tested and the computed protein features have been used in a number of published works for predicting proteins of functional classes, protein-protein interactions and MHC-binding peptides. PROFEAT is accessible at http://jing.cz3.nus.edu.sg/cgi-bin/prof/prof.cgi.

摘要

基于序列的结构和物理化学特征经常被用于开发统计学习模型，以预测具有不同结构、功能和相互作用特征的蛋白质和肽。PROFEAT（蛋白质特征）是一个网络服务器，用于从氨基酸序列计算蛋白质和肽常用的结构和物理化学特征。它计算由十个特征组成的六个特征组，包括51个描述符和1447个描述符值。计算得到的特征包括氨基酸组成、二肽组成、归一化的莫罗-布罗托自相关、莫兰自相关、吉尔里自相关、序列顺序耦合数、准序列顺序描述符以及各种结构和物理化学性质的组成、转变和分布。此外，它还可以根据用户定义的属性计算先前的自相关描述符。我们的计算算法经过了广泛测试，计算得到的蛋白质特征已在许多已发表的作品中用于预测功能类别的蛋白质、蛋白质-蛋白质相互作用和MHC结合肽。可通过http://jing.cz3.nus.edu.sg/cgi-bin/prof/prof.cgi访问PROFEAT。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bed2/1538821/a25215edd1ab/gkl305f1.jpg

相似文献

PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W32-7. doi: 10.1093/nar/gkl305.

Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W385-90. doi: 10.1093/nar/gkr284. Epub 2011 May 23.

MODEL-molecular descriptor lab: a web-based server for computing structural and physicochemical features of compounds.

Biotechnol Bioeng. 2007 Jun 1;97(2):389-96. doi: 10.1002/bit.21214.

propy: a tool to generate various modes of Chou's PseAAC.

Bioinformatics. 2013 Apr 1;29(7):960-2. doi: 10.1093/bioinformatics/btt072. Epub 2013 Feb 19.

PROFEAT Update: A Protein Features Web Server with Added Facility to Compute Network Descriptors for Studying Omics-Derived Networks.

J Mol Biol. 2017 Feb 3;429(3):416-425. doi: 10.1016/j.jmb.2016.10.013. Epub 2016 Oct 12.

protr/ProtrWeb: R package and web server for generating various numerical representation schemes of protein sequences.

Bioinformatics. 2015 Jun 1;31(11):1857-9. doi: 10.1093/bioinformatics/btv042. Epub 2015 Jan 24.

Predicting protein structural class based on multi-features fusion.

J Theor Biol. 2008 Jul 21;253(2):388-92. doi: 10.1016/j.jtbi.2008.03.009. Epub 2008 Mar 14.

Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.

BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59.

A protein network descriptor server and its use in studying protein, disease, metabolic and drug targeted networks.

Brief Bioinform. 2017 Nov 1;18(6):1057-1070. doi: 10.1093/bib/bbw071.

COPid: composition based protein identification.

In Silico Biol. 2008;8(2):121-8.

引用本文的文献

AOPxSVM: A Support Vector Machine for Identifying Antioxidant Peptides Using a Block Substitution Matrix and Amino Acid Composition, Transformation, and Distribution Embeddings.

Foods. 2025 Jun 6;14(12):2014. doi: 10.3390/foods14122014.

The genome of the polyextremophilic yeast, Naganishia friedmannii, reveals adaptations involved in stress response pathways, carbohydrate metabolism expansion, and a limited DNA repair repertoire.

FEMS Yeast Res. 2025 Jan 30;25. doi: 10.1093/femsyr/foaf028.

Predicting amyloid proteins using attention-based long short-term memory.

PeerJ Comput Sci. 2025 Feb 7;11:e2660. doi: 10.7717/peerj-cs.2660. eCollection 2025.

APBIO: bioactive profiling of air pollutants through inferred bioactivity signatures and prediction of novel target interactions.

J Cheminform. 2025 Jan 31;17(1):13. doi: 10.1186/s13321-025-00961-1.

Identifying nucleotide-binding leucine-rich repeat receptor and pathogen effector pairing using transfer-learning and bilinear attention network.

Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae581.

In silico protein function prediction: the rise of machine learning-based approaches.

Med Rev (2021). 2023 Nov 29;3(6):487-510. doi: 10.1515/mr-2023-0038. eCollection 2023 Dec.

PPSNO: A Feature-Rich SNO Sites Predictor by Stacking Ensemble Strategy from Protein Sequence-Derived Information.

Interdiscip Sci. 2024 Mar;16(1):192-217. doi: 10.1007/s12539-023-00595-7. Epub 2024 Jan 11.

TROLLOPE: A novel sequence-based stacked approach for the accelerated discovery of linear T-cell epitopes of hepatitis C virus.

PLoS One. 2023 Aug 25;18(8):e0290538. doi: 10.1371/journal.pone.0290538. eCollection 2023.

Interactome-Based Machine Learning Predicts Potential Therapeutics for COVID-19.

ACS Omega. 2023 Apr 4;8(15):13840-13854. doi: 10.1021/acsomega.3c00030. eCollection 2023 Apr 18.

SCP4ssd: A Serverless Platform for Nucleotide Sequence Synthesis Difficulty Prediction Using an AutoML Model.

Genes (Basel). 2023 Feb 28;14(3):605. doi: 10.3390/genes14030605.

本文引用的文献

Prediction of the functional class of lipid binding proteins from sequence-derived properties irrespective of sequence similarity.

J Lipid Res. 2006 Apr;47(4):824-31. doi: 10.1194/jlr.M500530-JLR200. Epub 2006 Jan 27.

Prediction of functional class of novel bacterial proteins without the use of sequence similarity by a statistical learning method.

J Mol Microbiol Biotechnol. 2005;9(2):86-100. doi: 10.1159/000088839.

Prediction of transporter family from protein sequence by support vector machine approach.

Proteins. 2006 Jan 1;62(1):218-31. doi: 10.1002/prot.20605.

Population structure inferred by local spatial autocorrelation: an example from an Amerindian tribal population.

Am J Phys Anthropol. 2006 Jan;129(1):121-31. doi: 10.1002/ajpa.20250.

Effect of training datasets on support vector machine prediction of protein-protein interactions.

Proteomics. 2005 Mar;5(4):876-84. doi: 10.1002/pmic.200401118.

Predicting functional family of novel enzymes irrespective of sequence similarity: a statistical learning approach.

Nucleic Acids Res. 2004 Dec 7;32(21):6437-44. doi: 10.1093/nar/gkh984. Print 2004.

Prediction of functional class of novel viral proteins by a statistical learning method irrespective of sequence similarity.

Virology. 2005 Jan 5;331(1):136-43. doi: 10.1016/j.virol.2004.10.020.

Effect of molecular descriptor feature selection in support vector machine classification of pharmacokinetic and toxicological properties of chemical agents.

J Chem Inf Comput Sci. 2004 Sep-Oct;44(5):1630-8. doi: 10.1021/ci049869h.

Notes on continuous stochastic phenomena.

Biometrika. 1950 Jun;37(1-2):17-23.

Prediction of protein subcellular locations by GO-FunD-PseAA predictor.

Biochem Biophys Res Commun. 2004 Aug 6;320(4):1236-9. doi: 10.1016/j.bbrc.2004.06.073.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PROFEAT：一个用于从氨基酸序列计算蛋白质和肽的结构及物理化学特征的网络服务器。

PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献