使用多样性测度的二次判别分析进行 β-发夹预测。

Beta-hairpin prediction with quadratic discriminant analysis using diversity measure.

机构信息

College of Computer Science, Chongqing University, Chongqing 400044, China.

出版信息

J Comput Chem. 2009 Nov 15;30(14):2277-84. doi: 10.1002/jcc.21229.

PMID:19263434

Abstract

On the basis of the features of protein sequential pattern, we used the method of increment of diversity combined with quadratic discriminant analysis (IDQD) to predict beta-hairpins motifs in protein sequences. Three rules are used to extract the raw beta-beta motifs sequential patterns for fixed-length. Amino acid basic compositions, dipeptide components, and amino acid composition distribution are combined to represent the compositional features. Eighteen feature variables on a sequential pattern to be predicted are defined in terms of ID. They are integrated in a single formal framework given by IDQD. The method is trained and tested on ArchDB40 dataset containing 3088 proteins. The overall accuracy of prediction and Matthew's correlation coefficient for the independent testing dataset are 81.7% and 0.60, respectively. In addition, a higher accuracy of 84.5% and Matthew's correlation coefficient of 0.68 for the independent testing dataset are obtained on a dataset previously used by Kumar et al. (Nucleic Acids Res 2005, 33, 154), which contains 2088 proteins. For a fair assessment of our method, the performance is also evaluated on all 63 proteins used in CASP6. The overall accuracy of prediction is 74.2% for the independent testing dataset.

摘要

基于蛋白质序列模式的特点，我们使用多样性增量结合二次判别分析（IDQD）的方法来预测蛋白质序列中的β发夹基序。使用三种规则从原始β-β基序序列中提取固定长度的序列模式。氨基酸组成、二肽组成和氨基酸组成分布相结合来表示组成特征。在 ID 方面，对要预测的序列模式定义了 18 个特征变量。它们集成在由 IDQD 给出的单个正式框架中。该方法在包含 3088 个蛋白质的 ArchDB40 数据集上进行了训练和测试。独立测试数据集的整体预测准确率和 Matthew 相关系数分别为 81.7%和 0.60。此外，在 Kumar 等人以前使用的数据集（Nucleic Acids Res 2005，33，154）上，对独立测试数据集的准确率更高，达到 84.5%，Matthew 相关系数为 0.68，该数据集包含 2088 个蛋白质。为了公平评估我们的方法，还在 CASP6 中使用的 63 个蛋白质上评估了该方法的性能。独立测试数据集的总体预测准确率为 74.2%。

相似文献

Beta-hairpin prediction with quadratic discriminant analysis using diversity measure.

J Comput Chem. 2009 Nov 15;30(14):2277-84. doi: 10.1002/jcc.21229.

Supersecondary structure prediction using Chou's pseudo amino acid composition.

J Comput Chem. 2011 Jan 30;32(2):271-8. doi: 10.1002/jcc.21616.

Recognition of beta-hairpin motifs in proteins by using the composite vector.

Amino Acids. 2010 Mar;38(3):915-21. doi: 10.1007/s00726-009-0299-7. Epub 2009 May 6.

Statistical geometry based prediction of nonsynonymous SNP functional effects using random forest and neuro-fuzzy classifiers.

Proteins. 2008 Jun;71(4):1930-9. doi: 10.1002/prot.21838.

Prediction of the beta-hairpins in proteins using support vector machine.

Protein J. 2008 Feb;27(2):115-22. doi: 10.1007/s10930-007-9114-z.

DPROT: prediction of disordered proteins using evolutionary information.

Amino Acids. 2008 Oct;35(3):599-605. doi: 10.1007/s00726-008-0085-y. Epub 2008 Apr 19.

Using pseudo amino acid composition to predict protein subnuclear location with improved hybrid approach.

Amino Acids. 2008 Jan;34(1):119-25. doi: 10.1007/s00726-007-0545-9. Epub 2007 May 21.

Prediction of the parallel/antiparallel orientation of beta-strands using amino acid pairing preferences and support vector machines.

J Theor Biol. 2010 Apr 7;263(3):360-8. doi: 10.1016/j.jtbi.2009.12.019. Epub 2009 Dec 24.

The modified Mahalanobis Discriminant for predicting outer membrane proteins by using Chou's pseudo amino acid composition.

J Theor Biol. 2008 May 21;252(2):350-6. doi: 10.1016/j.jtbi.2008.02.004. Epub 2008 Feb 12.

Classification of G-protein coupled receptors at four levels.

Protein Eng Des Sel. 2006 Nov;19(11):511-6. doi: 10.1093/protein/gzl038. Epub 2006 Oct 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用多样性测度的二次判别分析进行 β-发夹预测。

Beta-hairpin prediction with quadratic discriminant analysis using diversity measure.

机构信息

College of Computer Science, Chongqing University, Chongqing 400044, China.

出版信息

J Comput Chem. 2009 Nov 15;30(14):2277-84. doi: 10.1002/jcc.21229.

DOI:10.1002/jcc.21229

PMID:19263434

Abstract

摘要

使用多样性测度的二次判别分析进行 β-发夹预测。

Beta-hairpin prediction with quadratic discriminant analysis using diversity measure.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用多样性测度的二次判别分析进行 β-发夹预测。

Beta-hairpin prediction with quadratic discriminant analysis using diversity measure.

机构信息

出版信息

相似文献