Suppr超能文献

通过将真空紫外圆二色光谱与神经网络相结合,改进基于序列的蛋白质二级结构预测。

Improved sequence-based prediction of protein secondary structures by combining vacuum-ultraviolet circular dichroism spectroscopy with neural network.

作者信息

Matsuo Koichi, Watanabe Hidenori, Gekko Kunihiko

机构信息

Hiroshima Synchrotron Radiation Center, Hiroshima University, Higashi-Hiroshima, Japan.

出版信息

Proteins. 2008 Oct;73(1):104-12. doi: 10.1002/prot.22055.

Abstract

Synchrotron-radiation vacuum-ultraviolet circular dichroism (VUVCD) spectroscopy can significantly improve the predictive accuracy of the contents and segment numbers of protein secondary structures by extending the short-wavelength limit of the spectra. In the present study, we combined VUVCD spectra down to 160 nm with neural-network (NN) method to improve the sequence-based prediction of protein secondary structures. The secondary structures of 30 target proteins (test set) were assigned into alpha-helices, beta-strands, and others by the DSSP program based on their X-ray crystal structures. Combining the alpha-helix and beta-strand contents estimated from the VUVCD spectra of the target proteins improved the overall sequence-based predictive accuracy Q(3) for three secondary-structure components from 59.5 to 60.7%. Incorporating the position-specific scoring matrix in the NN method improved the predictive accuracy from 70.9 to 72.1% when combining the secondary-structure contents, to 72.5% when combining the numbers of segments, and finally to 74.9% when filtering the VUVCD data. Improvement in the sequence-based prediction of secondary structures was also apparent in two other indices of the overall performance: the correlation coefficient (C) and the segment overlap value (SOV). These results suggest that VUVCD data could enhance the predictive accuracy to over 80% when combined with the currently best sequence-prediction algorithms, greatly expanding the applicability of VUVCD spectroscopy to protein structural biology.

摘要

同步辐射真空紫外圆二色光谱(VUVCD)通过扩展光谱的短波长极限,可显著提高蛋白质二级结构含量和片段数预测的准确性。在本研究中,我们将低至160 nm的VUVCD光谱与神经网络(NN)方法相结合,以改进基于序列的蛋白质二级结构预测。根据30个目标蛋白质(测试集)的X射线晶体结构,通过DSSP程序将其二级结构分为α螺旋、β链和其他结构。结合从目标蛋白质的VUVCD光谱估计的α螺旋和β链含量,将基于序列的三种二级结构成分的整体预测准确性Q(3)从59.5%提高到60.7%。在NN方法中纳入位置特异性评分矩阵,在结合二级结构含量时,预测准确性从70.9%提高到72.1%;在结合片段数时,提高到72.5%;在过滤VUVCD数据时,最终提高到74.9%。基于序列的二级结构预测在整体性能的另外两个指标:相关系数(C)和片段重叠值(SOV)上也有明显改善。这些结果表明,当VUVCD数据与目前最好的序列预测算法相结合时,可将预测准确性提高到80%以上,极大地扩展了VUVCD光谱在蛋白质结构生物学中的适用性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验