• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CNNcon:使用级联神经网络改进蛋白质接触图预测。

CNNcon: improved protein contact maps prediction using cascaded neural networks.

机构信息

School of Computer Engineering and Science, Shanghai University, Shanghai, People's Republic of China.

出版信息

PLoS One. 2013 Apr 23;8(4):e61533. doi: 10.1371/journal.pone.0061533. Print 2013.

DOI:10.1371/journal.pone.0061533
PMID:23626696
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3634008/
Abstract

BACKGROUNDS

Despite continuing progress in X-ray crystallography and high-field NMR spectroscopy for determination of three-dimensional protein structures, the number of unsolved and newly discovered sequences grows much faster than that of determined structures. Protein modeling methods can possibly bridge this huge sequence-structure gap with the development of computational science. A grand challenging problem is to predict three-dimensional protein structure from its primary structure (residues sequence) alone. However, predicting residue contact maps is a crucial and promising intermediate step towards final three-dimensional structure prediction. Better predictions of local and non-local contacts between residues can transform protein sequence alignment to structure alignment, which can finally improve template based three-dimensional protein structure predictors greatly.

METHODS

CNNcon, an improved multiple neural networks based contact map predictor using six sub-networks and one final cascade-network, was developed in this paper. Both the sub-networks and the final cascade-network were trained and tested with their corresponding data sets. While for testing, the target protein was first coded and then input to its corresponding sub-networks for prediction. After that, the intermediate results were input to the cascade-network to finish the final prediction.

RESULTS

The CNNcon can accurately predict 58.86% in average of contacts at a distance cutoff of 8 Å for proteins with lengths ranging from 51 to 450. The comparison results show that the present method performs better than the compared state-of-the-art predictors. Particularly, the prediction accuracy keeps steady with the increase of protein sequence length. It indicates that the CNNcon overcomes the thin density problem, with which other current predictors have trouble. This advantage makes the method valuable to the prediction of long length proteins. As a result, the effective prediction of long length proteins could be possible by the CNNcon.

摘要

背景

尽管 X 射线晶体学和高场 NMR 光谱学在确定三维蛋白质结构方面不断取得进展,但未解决和新发现的序列数量的增长速度远远快于确定结构的数量。随着计算科学的发展,蛋白质建模方法有可能弥补这一巨大的序列-结构差距。一个巨大的挑战性问题是仅从其一级结构(残基序列)预测三维蛋白质结构。然而,预测残基接触图是朝着最终三维结构预测迈出的关键而有前途的中间步骤。更好地预测残基之间的局部和非局部接触可以将蛋白质序列比对转化为结构比对,从而最终大大改进基于模板的三维蛋白质结构预测器。

方法

本文开发了一种使用六个子网络和一个最终级联网络的基于改进的多个神经网络的接触图预测器 CNNcon。子网络和最终级联网络都使用相应的数据集进行训练和测试。在测试时,首先对目标蛋白质进行编码,然后将其输入到相应的子网络中进行预测。之后,将中间结果输入级联网络以完成最终预测。

结果

CNNcon 可以准确预测长度在 51 到 450 之间的蛋白质在距离截止值为 8 Å 时的接触点,平均准确率为 58.86%。比较结果表明,本方法的性能优于比较的最先进的预测器。特别是,随着蛋白质序列长度的增加,预测精度保持稳定。这表明 CNNcon 克服了其他当前预测器难以解决的密度稀疏问题。该优势使该方法对长链蛋白质的预测具有价值。因此,通过 CNNcon 可以实现对长链蛋白质的有效预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/3634008/8227ddade034/pone.0061533.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/3634008/65b0e058c835/pone.0061533.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/3634008/a1c2d207e6f1/pone.0061533.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/3634008/8227ddade034/pone.0061533.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/3634008/65b0e058c835/pone.0061533.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/3634008/a1c2d207e6f1/pone.0061533.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f9/3634008/8227ddade034/pone.0061533.g003.jpg

相似文献

1
CNNcon: improved protein contact maps prediction using cascaded neural networks.CNNcon:使用级联神经网络改进蛋白质接触图预测。
PLoS One. 2013 Apr 23;8(4):e61533. doi: 10.1371/journal.pone.0061533. Print 2013.
2
A two-stage approach for improved prediction of residue contact maps.一种用于改进残基接触图预测的两阶段方法。
BMC Bioinformatics. 2006 Mar 30;7:180. doi: 10.1186/1471-2105-7-180.
3
Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks.基于二维递归神经网络的多类别距离图的从头预测和基于模板的预测。
BMC Struct Biol. 2009 Jan 30;9:5. doi: 10.1186/1472-6807-9-5.
4
Toward an accurate prediction of inter-residue distances in proteins using 2D recursive neural networks.使用二维递归神经网络准确预测蛋白质中残基间的距离。
BMC Bioinformatics. 2014 Jan 10;15:6. doi: 10.1186/1471-2105-15-6.
5
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。
Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.
6
PCP-GC-LM: single-sequence-based protein contact prediction using dual graph convolutional neural network and convolutional neural network.PCP-GC-LM:基于双图卷积神经网络和卷积神经网络的单序列蛋白质接触预测。
BMC Bioinformatics. 2024 Sep 2;25(1):287. doi: 10.1186/s12859-024-05914-3.
7
Prediction of contact maps by GIOHMMs and recurrent neural networks using lateral propagation from all four cardinal corners.使用从四个基本角进行横向传播的广义隐马尔可夫模型(GIOHMM)和递归神经网络预测接触图。
Bioinformatics. 2002;18 Suppl 1:S62-70. doi: 10.1093/bioinformatics/18.suppl_1.s62.
8
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
9
Hybridized distance- and contact-based hierarchical structure modeling for folding soluble and membrane proteins.用于可溶性和膜蛋白折叠的杂交距离和接触基层级结构建模。
PLoS Comput Biol. 2021 Feb 23;17(2):e1008753. doi: 10.1371/journal.pcbi.1008753. eCollection 2021 Feb.
10
Ab initio and homology based prediction of protein domains by recursive neural networks.利用递归神经网络对蛋白质结构域进行从头预测和基于同源性的预测。
BMC Bioinformatics. 2009 Jun 26;10:195. doi: 10.1186/1471-2105-10-195.

引用本文的文献

1
Ordering Protein Contact Matrices.订购蛋白质接触矩阵。
Comput Struct Biotechnol J. 2018 Mar 16;16:140-156. doi: 10.1016/j.csbj.2018.03.001. eCollection 2018.
2
Forecasting residue-residue contact prediction accuracy.预测残差-残基接触预测精度。
Bioinformatics. 2017 Nov 1;33(21):3405-3414. doi: 10.1093/bioinformatics/btx416.
3
The evolution of logic circuits for the purpose of protein contact map prediction.用于蛋白质接触图预测的逻辑电路的演变。

本文引用的文献

1
A network clustering algorithm for detection of protein families.一种用于检测蛋白质家族的网络聚类算法。
Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:6329-32. doi: 10.1109/EMBC.2012.6347441.
2
Deep architectures for protein contact map prediction.用于蛋白质接触图预测的深度架构。
Bioinformatics. 2012 Oct 1;28(19):2449-57. doi: 10.1093/bioinformatics/bts475. Epub 2012 Jul 30.
3
CMWeb: an interactive on-line tool for analysing residue-residue contacts and contact prediction methods.CMWeb:一种用于分析残基-残基接触和接触预测方法的交互式在线工具。
PeerJ. 2017 Apr 18;5:e3139. doi: 10.7717/peerj.3139. eCollection 2017.
4
Multi-Dimensional Scaling and MODELLER-Based Evolutionary Algorithms for Protein Model Refinement.用于蛋白质模型优化的基于多维缩放和MODELLER的进化算法。
Proc Congr Evol Comput. 2014 Jul;2014:1038-1045. doi: 10.1109/CEC.2014.6900443.
5
Characteristics of protein residue-residue contacts and their application in contact prediction.蛋白质残基-残基接触的特征及其在接触预测中的应用。
J Mol Model. 2014 Nov;20(11):2497. doi: 10.1007/s00894-014-2497-9. Epub 2014 Nov 6.
6
Parallel clustering algorithm for large-scale biological data sets.用于大规模生物数据集的并行聚类算法。
PLoS One. 2014 Apr 4;9(4):e91315. doi: 10.1371/journal.pone.0091315. eCollection 2014.
7
Sequence-based Gaussian network model for protein dynamics.基于序列的蛋白质动力学高斯网络模型。
Bioinformatics. 2014 Feb 15;30(4):497-505. doi: 10.1093/bioinformatics/btt716. Epub 2013 Dec 12.
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W329-33. doi: 10.1093/nar/gks488. Epub 2012 Jun 4.
4
Critical assessment of methods of protein structure prediction (CASP)--round IX.蛋白质结构预测方法的关键评估(CASP)——第九轮。
Proteins. 2011;79 Suppl 10(0 10):1-5. doi: 10.1002/prot.23200. Epub 2011 Oct 14.
5
Evaluation of residue-residue contact predictions in CASP9.评估 CASP9 中残基-残基接触预测的结果。
Proteins. 2011;79 Suppl 10(Suppl 10):119-25. doi: 10.1002/prot.23160. Epub 2011 Sep 17.
6
Blurring contact maps of thousands of proteins: what we can learn by reconstructing 3D structure.模糊数千种蛋白质的接触图谱:通过重建 3D 结构我们可以学到什么。
BioData Min. 2011 Jan 13;4(1):1. doi: 10.1186/1756-0381-4-1.
7
I-TASSER: a unified platform for automated protein structure and function prediction.I-TASSER:一个用于自动化蛋白质结构和功能预测的统一平台。
Nat Protoc. 2010 Apr;5(4):725-38. doi: 10.1038/nprot.2010.5. Epub 2010 Mar 25.
8
PDBselect 1992-2009 and PDBfilter-select.PDBselect 1992-2009 和 PDBfilter-select。
Nucleic Acids Res. 2010 Jan;38(Database issue):D318-9. doi: 10.1093/nar/gkp786. Epub 2009 Sep 25.
9
NNcon: improved protein contact map prediction using 2D-recursive neural networks.NNcon:使用二维递归神经网络改进蛋白质接触图预测。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W515-8. doi: 10.1093/nar/gkp305. Epub 2009 May 6.
10
Comparative protein structure modeling using MODELLER.使用MODELLER进行比较蛋白质结构建模。
Curr Protoc Protein Sci. 2007 Nov;Chapter 2:Unit 2.9. doi: 10.1002/0471140864.ps0209s50.