• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度残差集成学习从序列中检测蛋白质结构中的脯氨酸和顺式异构体和非脯氨酸异构体。

Detecting Proline and Non-Proline Cis Isomers in Protein Structures from Sequences Using Deep Residual Ensemble Learning.

机构信息

Signal Processing Laboratory , Griffith University , Brisbane , QLD 4122 , Australia.

Institute for Glycomics and School of Information and Communication Technology , Griffith University , Southport , QLD 4222 , Australia.

出版信息

J Chem Inf Model. 2018 Sep 24;58(9):2033-2042. doi: 10.1021/acs.jcim.8b00442. Epub 2018 Aug 29.

DOI:10.1021/acs.jcim.8b00442
PMID:30118602
Abstract

It has been long established that cis conformations of amino acid residues play many biologically important roles despite their rare occurrence in protein structure. Because of this rarity, few methods have been developed for predicting cis isomers from protein sequences, most of which are based on outdated datasets and lack the means for independent testing. In this work, using a database of >10000 high-resolution protein structures, we update the statistics of cis isomers and develop a sequence-based prediction technique using an ensemble of residual convolutional and long short-term memory bidirectional recurrent neural networks that allow learning from the whole protein sequence. We show that ensembling eight neural network models yields maximum Matthews correlation coefficient values of approximately 0.35 for cis-Pro isomers and 0.1 for cis-nonPro residues. The method should be useful for prioritizing functionally important residues in cis isomers for experimental validations and improving the sampling of rare protein conformations for ab initio protein structure prediction.

摘要

已经证实,尽管氨基酸残基的顺式构象在蛋白质结构中很少出现,但它们在许多生物中发挥着重要作用。由于这种稀有性,很少有方法可以从蛋白质序列中预测顺式异构体,其中大多数方法都是基于过时的数据集,并且缺乏独立测试的手段。在这项工作中,我们使用了一个包含>10000 个高分辨率蛋白质结构的数据库,更新了顺式异构体的统计数据,并开发了一种基于残差卷积和长短期记忆双向递归神经网络的序列预测技术,该技术可以从整个蛋白质序列中进行学习。我们表明,集成八个神经网络模型可以为顺式-Pro 异构体产生大约 0.35 的马修斯相关系数值,为顺式-nonPro 残基产生 0.1 的马修斯相关系数值。该方法对于优先考虑顺式异构体中功能重要的残基进行实验验证,以及提高从头预测蛋白质结构中稀有蛋白质构象的采样率都将非常有用。

相似文献

1
Detecting Proline and Non-Proline Cis Isomers in Protein Structures from Sequences Using Deep Residual Ensemble Learning.基于深度残差集成学习从序列中检测蛋白质结构中的脯氨酸和顺式异构体和非脯氨酸异构体。
J Chem Inf Model. 2018 Sep 24;58(9):2033-2042. doi: 10.1021/acs.jcim.8b00442. Epub 2018 Aug 29.
2
Predicting protein-ligand binding residues with deep convolutional neural networks.使用深度卷积神经网络预测蛋白质-配体结合残基。
BMC Bioinformatics. 2019 Feb 26;20(1):93. doi: 10.1186/s12859-019-2672-1.
3
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
4
DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction.DeepACLSTM:用于蛋白质二级结构预测的深度非对称卷积长短时记忆神经模型。
BMC Bioinformatics. 2019 Jun 17;20(1):341. doi: 10.1186/s12859-019-2940-0.
5
Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks.通过与二维卷积神经网络集成的循环神经网络改进蛋白质二级结构预测。
J Bioinform Comput Biol. 2018 Oct;16(5):1850021. doi: 10.1142/S021972001850021X.
6
Randomized Subspace Learning for Proline Cis-Trans Isomerization Prediction.用于脯氨酸顺反异构化预测的随机子空间学习
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):763-9. doi: 10.1109/TCBB.2014.2369040.
7
Extraction of consensus protein patterns in regions containing non-proline cis peptide bonds and their functional assessment.提取含有非脯氨酸顺式肽键的区域中的共识蛋白模式及其功能评估。
BMC Bioinformatics. 2011 May 10;12:142. doi: 10.1186/1471-2105-12-142.
8
Thermodynamic origin of cis/trans isomers of a proline-containing beta-turn model dipeptide in aqueous solution: a combined variable temperature 1H-NMR, two-dimensional 1H,1H gradient enhanced nuclear Overhauser effect spectroscopy (NOESY), one-dimensional steady-state intermolecular 13C,1H NOE, and molecular dynamics study.含脯氨酸的β-转角模型二肽在水溶液中顺式/反式异构体的热力学起源:变温1H-NMR、二维1H,1H梯度增强核Overhauser效应光谱(NOESY)、一维稳态分子间13C,1H NOE及分子动力学联合研究
Biopolymers. 2000 Jan;53(1):72-83. doi: 10.1002/(SICI)1097-0282(200001)53:1<72::AID-BIP7>3.0.CO;2-5.
9
Beta VI turns in peptides and proteins: a model peptide mimicry.βVI 转化肽和蛋白质:一种模拟肽模型。
Proteins. 1993 Mar;15(3):235-51. doi: 10.1002/prot.340150303.
10
Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks.通过将残差二维双向长短期记忆与卷积神经网络相结合,准确预测蛋白质接触图。
Bioinformatics. 2018 Dec 1;34(23):4039-4045. doi: 10.1093/bioinformatics/bty481.

引用本文的文献

1
Proline Isomerization: From the Chemistry and Biology to Therapeutic Opportunities.脯氨酸异构化:从化学与生物学至治疗机遇
Biology (Basel). 2023 Jul 14;12(7):1008. doi: 10.3390/biology12071008.
2
Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction.基于空洞空间金字塔网络的深度集成学习用于蛋白质二级结构预测
Biomolecules. 2022 Jun 2;12(6):774. doi: 10.3390/biom12060774.
3
Secondary Structures of Proteins: A Comparison of Models and Experimental Results.蛋白质的二级结构:模型与实验结果比较。
J Proteome Res. 2021 Mar 5;20(3):1802-1808. doi: 10.1021/acs.jproteome.0c00986. Epub 2021 Feb 23.
4
SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning.SPOT-Disorder2:通过集成深度学习提高蛋白质固有无序预测。
Genomics Proteomics Bioinformatics. 2019 Dec;17(6):645-656. doi: 10.1016/j.gpb.2019.01.004. Epub 2020 Mar 13.
5
Functional identification of a FKBP-like protein involved in the folding of overproduced secreted proteins.鉴定 FKBP 样蛋白在折叠过量分泌蛋白中的功能。
Open Biol. 2019 Oct 31;9(10):190201. doi: 10.1098/rsob.190201. Epub 2019 Oct 30.