Suppr超能文献

用于改进预测α-螺旋膜蛋白中扭结螺旋的字符串核和高质量数据集。

String kernels and high-quality data set for improved prediction of kinked helices in α-helical membrane proteins.

机构信息

Johannes Gutenberg-University of Mainz , 55128 Mainz, Germany.

出版信息

J Chem Inf Model. 2011 Nov 28;51(11):3017-25. doi: 10.1021/ci200278w. Epub 2011 Oct 26.

Abstract

The reasons for distortions from optimal α-helical geometry are widely unknown, but their influences on structural changes of proteins are significant. Hence, their prediction is a crucial problem in structural bioinformatics. For the particular case of kink prediction, we generated a data set of 132 membrane proteins containing 1014 manually labeled helices and examined the environment of kinks. Our sequence analysis confirms the great relevance of proline and reveals disproportionately high occurrences of glycine and serine at kink positions. The structural analysis shows significantly different solvent accessible surface area mean values for kinked and nonkinked helices. More important, we used this data set to validate string kernels for support vector machines as a new kink prediction method. Applying the new predictor, about 80% of all helices could be correctly predicted as kinked or nonkinked even when focusing on small helical fragments. The results exceed recently reported accuracies of alternative approaches and are a consequence of both the method and the data set.

摘要

扭曲偏离最佳α-螺旋几何形状的原因尚不清楚,但它们对蛋白质结构变化的影响是显著的。因此,它们的预测是结构生物信息学中的一个关键问题。对于扭结预测这一特殊情况,我们生成了一个包含 1014 个手动标记螺旋的 132 种膜蛋白数据集,并检查了扭结的环境。我们的序列分析证实了脯氨酸的重要相关性,并揭示了扭结位置甘氨酸和丝氨酸的异常高出现率。结构分析表明,扭结和非扭结螺旋的溶剂可及表面积平均值有显著差异。更重要的是,我们使用这个数据集来验证支持向量机的字符串核作为一种新的扭结预测方法。应用新的预测器,即使只关注小的螺旋片段,也可以正确预测大约 80%的所有螺旋是否扭结或非扭结。这些结果超过了最近报道的替代方法的准确性,是方法和数据集共同作用的结果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验