Suppr超能文献

使用特征选择和支持向量机预测顺式/反式异构化

Prediction of cis/trans isomerization using feature selection and support vector machines.

作者信息

Exarchos Konstantinos P, Papaloukas Costas, Exarchos Themis P, Troganis Anastassios N, Fotiadis Dimitrios I

机构信息

Unit of Medical Technology and Intelligent Information Systems, Department of Computer Science, University of Ioannina, P.O. Box 1186, GR 45110 Ioannina, Greece.

出版信息

J Biomed Inform. 2009 Feb;42(1):140-9. doi: 10.1016/j.jbi.2008.05.006. Epub 2008 May 23.

Abstract

In protein structures the peptide bond is found to be in trans conformation in the majority of the cases. Only a small fraction of peptide bonds in proteins is reported to be in cis conformation. Most of these instances (>90%) occur when the peptide bond is an imide (X-Pro) rather than an amide bond (X-nonPro). Due to the implication of cis/trans isomerization in many biologically significant processes, the accurate prediction of the peptide bond conformation is of high interest. In this study, we evaluate the effect of a wide range of features, towards the reliable prediction of both proline and non-proline cis/trans isomerization. We use evolutionary profiles, secondary structure information, real-valued solvent accessibility predictions for each amino acid and the physicochemical properties of the surrounding residues. We also explore the predictive impact of a modified feature vector, which consists of condensed position-specific scoring matrices (PSSMX), secondary structure and solvent accessibility. The best discriminating ability is achieved using the first feature vector combined with a wrapper feature selection algorithm and a support vector machine (SVM). The proposed method results in 70% accuracy, 75% sensitivity and 71% positive predictive value (PPV) in the prediction of the peptide bond conformation between any two amino acids. The output of the feature selection stage is investigated in order to identify discriminatory features as well as the contribution of each neighboring residue in the formation of the peptide bond, thus, advancing our knowledge towards cis/trans isomerization.

摘要

在蛋白质结构中,大多数情况下肽键呈反式构象。据报道,蛋白质中只有一小部分肽键呈顺式构象。这些情况中的大多数(>90%)发生在肽键为亚胺键(X-Pro)而非酰胺键(X-非Pro)时。由于顺/反异构化在许多生物学重要过程中的影响,肽键构象的准确预测备受关注。在本研究中,我们评估了广泛的特征对脯氨酸和非脯氨酸顺/反异构化可靠预测的影响。我们使用进化谱、二级结构信息、每个氨基酸的实值溶剂可及性预测以及周围残基的物理化学性质。我们还探索了一种修改后的特征向量的预测影响,该特征向量由浓缩的位置特异性评分矩阵(PSSMX)、二级结构和溶剂可及性组成。使用第一个特征向量结合包装器特征选择算法和支持向量机(SVM)可实现最佳区分能力。所提出的方法在预测任意两个氨基酸之间的肽键构象时,准确率为70%,灵敏度为75%,阳性预测值(PPV)为71%。对特征选择阶段的输出进行了研究,以识别区分性特征以及每个相邻残基在肽键形成中的贡献,从而增进我们对顺/反异构化的了解。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验