一种利用傅里叶分析和神经网络从序列识别蛋白质结构的新方法。

A novel approach to the recognition of protein architecture from sequence using Fourier analysis and neural networks.

作者信息

Shepherd Adrian J, Gorse Denise, Thornton Janet M

机构信息

Department of Biochemistry and Molecular Biology, University College London, London, United Kingdom.

出版信息

Proteins. 2003 Feb 1;50(2):290-302. doi: 10.1002/prot.10290.

DOI:10.1002/prot.10290

PMID:12486723

Abstract

A novel method is presented for the prediction of protein architecture from sequence using neural networks. The method involves the preprocessing of protein sequence data by numerically encoding it and then applying a Fourier transform. The encoded and transformed data are then used to train a neural network to recognize a number of different protein architectures. The method proved significantly better than comparable alternative strategies such as percentage dipeptide frequency, but is still limited by the size of the data set and the input demands of a neural network. Its main potential is as a complement to existing fold recognition techniques, with its ability to identify global symmetries within protein structures its greatest strength.

摘要

提出了一种使用神经网络从序列预测蛋白质结构的新方法。该方法包括通过对蛋白质序列数据进行数字编码然后应用傅里叶变换来进行预处理。然后将编码和变换后的数据用于训练神经网络以识别多种不同的蛋白质结构。该方法被证明比诸如二肽频率百分比等可比的替代策略要好得多，但仍然受到数据集大小和神经网络输入要求的限制。其主要潜力在于作为现有折叠识别技术的补充，识别蛋白质结构内全局对称性的能力是其最大优势。

相似文献

A novel approach to the recognition of protein architecture from sequence using Fourier analysis and neural networks.

Proteins. 2003 Feb 1;50(2):290-302. doi: 10.1002/prot.10290.

Using artificially generated spectral data to improve protein secondary structure prediction from Fourier transform infrared spectra of proteins.

Anal Biochem. 2004 Sep 15;332(2):238-44. doi: 10.1016/j.ab.2004.06.030.

Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.

BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59.

Hepatitis C virus contact map prediction based on binary encoding strategy.

Comput Biol Chem. 2007 Jun;31(3):233-8. doi: 10.1016/j.compbiolchem.2007.03.009. Epub 2007 Mar 30.

Predicting protein secondary structure by cascade-correlation neural networks.

Bioinformatics. 2004 Feb 12;20(3):419-20. doi: 10.1093/bioinformatics/btg423. Epub 2004 Jan 22.

Pcons5: combining consensus, structural evaluation and fold recognition scores.

Bioinformatics. 2005 Dec 1;21(23):4248-54. doi: 10.1093/bioinformatics/bti702. Epub 2005 Oct 4.

Neural networks for secondary structure and structural class predictions.

Protein Sci. 1995 Feb;4(2):275-85. doi: 10.1002/pro.5560040214.

GANNPhos: a new phosphorylation site predictor based on a genetic algorithm integrated neural network.

Protein Eng Des Sel. 2007 Aug;20(8):405-12. doi: 10.1093/protein/gzm035. Epub 2007 Jul 24.

Ab initio prediction of the three-dimensional structure of a de novo designed protein: a double-blind case study.

Proteins. 2005 Feb 15;58(3):560-70. doi: 10.1002/prot.20338.

A neural network method for prediction of beta-turn types in proteins using evolutionary information.

Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14.

引用本文的文献

Anti-symmetric framework for balanced learning of protein-protein interactions.

Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae603.

The EMILIN/Multimerin family.

Front Immunol. 2012 Jan 6;2:93. doi: 10.3389/fimmu.2011.00093. eCollection 2011.

A procedure for identifying homologous alternative splicing events.

BMC Bioinformatics. 2007 Jul 19;8:260. doi: 10.1186/1471-2105-8-260.

PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W32-7. doi: 10.1093/nar/gkl305.

EHPred: an SVM-based method for epoxide hydrolases recognition and classification.

J Zhejiang Univ Sci B. 2006 Jan;7(1):1-6. doi: 10.1631/jzus.2006.B0001.

Detailed protein sequence alignment based on Spectral Similarity Score (SSS).

BMC Bioinformatics. 2005 Apr 23;6:105. doi: 10.1186/1471-2105-6-105.

ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST.

Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W414-9. doi: 10.1093/nar/gkh350.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种利用傅里叶分析和神经网络从序列识别蛋白质结构的新方法。

A novel approach to the recognition of protein architecture from sequence using Fourier analysis and neural networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献