Suppr超能文献

基于单序列的深度学习全序列预测蛋白质二级结构和溶剂可及性。

Single-sequence-based prediction of protein secondary structures and solvent accessibility by deep whole-sequence learning.

机构信息

Signal Processing Laboratory, Griffith University, Brisbane, QLD, 4111, Australia.

School of Data and Computer Science, Sun Yet-Sen University, Guangzhou, China.

出版信息

J Comput Chem. 2018 Oct 5;39(26):2210-2216. doi: 10.1002/jcc.25534. Epub 2018 Oct 14.

Abstract

Predicting protein structure from sequence alone is challenging. Thus, the majority of methods for protein structure prediction rely on evolutionary information from multiple sequence alignments. In previous work we showed that Long Short-Term Bidirectional Recurrent Neural Networks (LSTM-BRNNs) improved over regular neural networks by better capturing intra-sequence dependencies. Here we show a single-sequence-based prediction method employing LSTM-BRNNs (SPIDER3-Single), that consistently achieves Q3 accuracy of 72.5%, and correlation coefficient of 0.67 between predicted and actual solvent accessible surface area. Moreover, it yields reasonably accurate prediction of eight-state secondary structure, main-chain angles (backbone ϕ and ψ torsion angles and C α-atom-based θ and τ angles), half-sphere exposure, and contact number. The method is more accurate than the corresponding evolutionary-based method for proteins with few sequence homologs, and computationally efficient for large-scale screening of protein-structural properties. It is available as an option in the SPIDER3 server, and a standalone version for download, at http://sparks-lab.org. © 2018 Wiley Periodicals, Inc.

摘要

仅从序列预测蛋白质结构具有挑战性。因此,大多数蛋白质结构预测方法都依赖于来自多重序列比对的进化信息。在之前的工作中,我们表明长短期双向递归神经网络 (LSTM-BRNN) 通过更好地捕获序列内相关性,优于常规神经网络。在这里,我们展示了一种基于单序列的预测方法,该方法采用 LSTM-BRNN (SPIDER3-Single),始终实现 Q3 准确率为 72.5%,预测溶剂可及表面积与实际溶剂可及表面积之间的相关系数为 0.67。此外,它还可以合理准确地预测八元状态二级结构、主链角度(骨架 ϕ 和 ψ 扭转角以及基于 Cα 原子的 θ 和 τ 角)、半球暴露度和接触数。对于序列同源物较少的蛋白质,该方法比相应的基于进化的方法更准确,并且对于大规模筛选蛋白质结构特性具有计算效率。它可作为 SPIDER3 服务器中的一个选项使用,也可在 http://sparks-lab.org 下载独立版本。©2018 威利父子公司

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验