• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用平均化学位移预测蛋白质结构类别。

The prediction of protein structural class using averaged chemical shifts.

机构信息

Key Laboratory for NeuroInformation of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China.

出版信息

J Biomol Struct Dyn. 2012;29(6):643-9. doi: 10.1080/07391102.2011.672628.

DOI:10.1080/07391102.2011.672628
PMID:22545995
Abstract

Knowledge of protein structural class can provide important information about its folding patterns. Many approaches have been developed for the prediction of protein structural classes. However, the information used by these approaches is primarily based on amino acid sequences. In this study, a novel method is presented to predict protein structural classes by use of chemical shift (CS) information derived from nuclear magnetic resonance spectra. Firstly, 399 non-homologue (about 15% identity) proteins were constructed to investigate the distribution of averaged CS values of six nuclei ((13)CO, (13)Cα, (13)Cβ, (1)HN, (1)Hα and (15)N) in three protein structural classes. Subsequently, support vector machine was proposed to predict three protein structural classes by using averaged CS information of six nuclei. Overall accuracy of jackknife cross-validation achieves 87.0%. Finally, the feature selection technique is applied to exclude redundant information and find out an optimized feature set. Results show that the overall accuracy increased to 88.0% by using the averaged CSs of (13)CO, (1)Hα and (15)N. The proposed approach outperformed other state-of-the-art methods in terms of predictive accuracy in particular for low-similarity protein data. We expect that our proposed approach will be an excellent alternative to traditional methods for protein structural class prediction.

摘要

蛋白质结构类别的知识可以提供关于其折叠模式的重要信息。已经开发了许多方法来预测蛋白质结构类别。然而,这些方法所使用的信息主要基于氨基酸序列。在这项研究中,提出了一种新的方法,通过使用从核磁共振谱中得出的化学位移(CS)信息来预测蛋白质结构类别。首先,构建了 399 个非同源(约 15%的同一性)蛋白质,以研究三种蛋白质结构类别中六个核((13)CO、(13)Cα、(13)Cβ、(1)HN、(1)Hα 和 (15)N)的平均 CS 值的分布。随后,提出了支持向量机来使用六个核的平均 CS 信息来预测三种蛋白质结构类别。Jackknife 交叉验证的总体准确率达到 87.0%。最后,应用特征选择技术排除冗余信息并找到优化的特征集。结果表明,通过使用 (13)CO、(1)Hα 和 (15)N 的平均 CS,整体准确率提高到 88.0%。与其他最先进的方法相比,该方法在预测准确性方面表现更好,特别是对于低相似度的蛋白质数据。我们期望我们提出的方法将成为蛋白质结构类别预测的传统方法的极好替代方法。

相似文献

1
The prediction of protein structural class using averaged chemical shifts.利用平均化学位移预测蛋白质结构类别。
J Biomol Struct Dyn. 2012;29(6):643-9. doi: 10.1080/07391102.2011.672628.
2
Using principal component analysis and support vector machine to predict protein structural class for low-similarity sequences via PSSM.基于 PSSM 利用主成分分析和支持向量机预测低相似度序列的蛋白质结构类别
J Biomol Struct Dyn. 2012;29(6):634-42. doi: 10.1080/07391102.2011.672627.
3
Prediction of protein structural class using novel evolutionary collocation-based sequence representation.使用基于新型进化搭配的序列表示法预测蛋白质结构类别。
J Comput Chem. 2008 Jul 30;29(10):1596-604. doi: 10.1002/jcc.20918.
4
High-accuracy prediction of protein structural class for low-similarity sequences based on predicted secondary structure.基于预测的二级结构对低相似度序列进行蛋白质结构类别高精度预测。
Biochimie. 2011 Apr;93(4):710-4. doi: 10.1016/j.biochi.2011.01.001. Epub 2011 Jan 13.
5
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
6
Accurate prediction of protein structural classes using functional domains and predicted secondary structure sequences.使用功能域和预测的二级结构序列准确预测蛋白质结构类别。
J Biomol Struct Dyn. 2012;29(6):623-33. doi: 10.1080/07391102.2011.672626.
7
Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.通过将预测的二级结构信息纳入周的伪氨基酸组成的通用形式,准确预测蛋白质结构类别。
J Theor Biol. 2014 Mar 7;344:12-8. doi: 10.1016/j.jtbi.2013.11.021. Epub 2013 Dec 6.
8
Identify five kinds of simple super-secondary structures with quadratic discriminant algorithm based on the chemical shifts.基于化学位移,用二次判别算法识别五种简单的超二级结构。
J Theor Biol. 2015 Sep 7;380:392-8. doi: 10.1016/j.jtbi.2015.06.006. Epub 2015 Jun 16.
9
Prediction of protein structural classes using support vector machines.使用支持向量机预测蛋白质结构类别。
Amino Acids. 2006 Jun;30(4):469-75. doi: 10.1007/s00726-005-0239-0. Epub 2006 Apr 20.
10
Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile.使用支持向量机和 PSI-BLAST 轮廓预测低相似度序列的蛋白质结构类别。
Biochimie. 2010 Oct;92(10):1330-4. doi: 10.1016/j.biochi.2010.06.013. Epub 2010 Jun 23.

引用本文的文献

1
iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.iSUMOK-PseAAC:利用统计矩和周氏伪氨基酸组成预测赖氨酸的类泛素化位点
PeerJ. 2021 Aug 4;9:e11581. doi: 10.7717/peerj.11581. eCollection 2021.
2
Prediction of secondary structure population and intrinsic disorder of proteins using multitask deep learning.利用多任务深度学习预测蛋白质的二级结构和固有无序性。
AMIA Annu Symp Proc. 2021 Jan 25;2020:1325-1334. eCollection 2020.
3
Discrimination of Thermophilic Proteins and Non-thermophilic Proteins Using Feature Dimension Reduction.
基于特征降维的嗜热蛋白与非嗜热蛋白鉴别
Front Bioeng Biotechnol. 2020 Oct 22;8:584807. doi: 10.3389/fbioe.2020.584807. eCollection 2020.
4
AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees.AtbPpred:使用极端随机树对抗结核肽进行基于序列的稳健预测。
Comput Struct Biotechnol J. 2019 Jul 3;17:972-981. doi: 10.1016/j.csbj.2019.06.024. eCollection 2019.
5
Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation.Meta-4mCpred:一种基于序列的元预测器,用于通过有效特征表示准确预测DNA 4mC位点。
Mol Ther Nucleic Acids. 2019 Jun 7;16:733-744. doi: 10.1016/j.omtn.2019.04.019. Epub 2019 Apr 30.
6
Predicting Ion Channels Genes and Their Types With Machine Learning Techniques.运用机器学习技术预测离子通道基因及其类型。
Front Genet. 2019 May 3;10:399. doi: 10.3389/fgene.2019.00399. eCollection 2019.
7
Identification of S-nitrosylation sites based on multiple features combination.基于多种特征组合的 S-亚硝酰化位点鉴定。
Sci Rep. 2019 Feb 28;9(1):3098. doi: 10.1038/s41598-019-39743-9.
8
iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree.iGHBP:使用极端随机树从序列中对生长激素结合蛋白进行计算识别。
Comput Struct Biotechnol J. 2018 Oct 24;16:412-420. doi: 10.1016/j.csbj.2018.10.007. eCollection 2018.
9
Improving Protein Gamma-Turn Prediction Using Inception Capsule Networks.利用 inception 胶囊网络提高蛋白质 γ-转角预测。
Sci Rep. 2018 Oct 24;8(1):15741. doi: 10.1038/s41598-018-34114-2.
10
PIP-EL: A New Ensemble Learning Method for Improved Proinflammatory Peptide Predictions.PIP-EL:一种用于改进促炎肽预测的新集成学习方法。
Front Immunol. 2018 Jul 31;9:1783. doi: 10.3389/fimmu.2018.01783. eCollection 2018.