• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将二级结构特征纳入序列信息以预测蛋白质结构类别。

Incorporating secondary structural features into sequence information for predicting protein structural class.

作者信息

Liao Bo, Peng Ting, Chen Haowen, Lin Yaping

机构信息

College of Information science and Engineering, Hunan University, Changsha, Hunan, 410082, China.

出版信息

Protein Pept Lett. 2013 Oct;20(10):1079-87. doi: 10.2174/09298665113209990002.

DOI:10.2174/09298665113209990002
PMID:23688152
Abstract

Knowledge of structural classes is applied in numerous important predictive tasks that address structural and functional features of proteins, although the prediction accuracy of the protein structural classes is not high. In this study, 45 different features were rationally designed to model the differences between protein structural classes, among which, 30 of them reflect the combined protein sequence information. In terms of correlation function, the protein sequence can be converted to a digital signal sequence, from which we can generate 20 discrete Fourier spectrum numbers. According to the segments of amino with different characteristics occurring in protein sequences, the frequencies of the 10 kinds of segments of amino acid (motifs) in protein are calculated. Other features include the secondary structural information :10 features were proposed to model the strong adjacent correlations in the secondary structural elements and capture the long-range spatial interactions between secondary structures, other 5 features were designed to differentiate α/β from α+β classes , which is a major problem of the existing algorithm. The methods were proposed based on a large set of low-identity sequences for which secondary structure is predicted from their sequence (based on PSI-PRED). By means of this method, the overall prediction accuracy of four benchmark datasets were all improved. Especially for the dataset FC699, 25PDB and D1189 which are 1.26%, 1% and 0.85% higher than the best previous method respectively.

摘要

尽管蛋白质结构类别的预测准确性不高,但结构类别的知识仍应用于许多重要的预测任务中,这些任务涉及蛋白质的结构和功能特征。在本研究中,合理设计了45种不同的特征来模拟蛋白质结构类别之间的差异,其中30种反映了蛋白质序列的组合信息。就相关函数而言,蛋白质序列可以转换为数字信号序列,从中我们可以生成20个离散傅里叶频谱数。根据蛋白质序列中出现的具有不同特征的氨基酸片段,计算蛋白质中10种氨基酸片段(基序)的频率。其他特征包括二级结构信息:提出了10个特征来模拟二级结构元件中的强相邻相关性,并捕捉二级结构之间的远程空间相互作用,另外5个特征旨在区分α/β类和α+β类,这是现有算法的一个主要问题。这些方法是基于大量低同源性序列提出的,其二级结构是根据它们的序列预测的(基于PSI-PRED)。通过这种方法,四个基准数据集的总体预测准确性均得到了提高。特别是对于数据集FC699、25PDB和D1189,分别比之前最好的方法高出1.26%、1%和0.85%。

相似文献

1
Incorporating secondary structural features into sequence information for predicting protein structural class.将二级结构特征纳入序列信息以预测蛋白质结构类别。
Protein Pept Lett. 2013 Oct;20(10):1079-87. doi: 10.2174/09298665113209990002.
2
A high-accuracy protein structural class prediction algorithm using predicted secondary structural information.利用预测的二级结构信息进行高精度蛋白质结构类预测算法。
J Theor Biol. 2010 Dec 7;267(3):272-5. doi: 10.1016/j.jtbi.2010.09.007. Epub 2010 Sep 8.
3
High-accuracy prediction of protein structural class for low-similarity sequences based on predicted secondary structure.基于预测的二级结构对低相似度序列进行蛋白质结构类别高精度预测。
Biochimie. 2011 Apr;93(4):710-4. doi: 10.1016/j.biochi.2011.01.001. Epub 2011 Jan 13.
4
Predicting structural class for protein sequences of 40% identity based on features of primary and secondary structure using Random Forest algorithm.基于一级和二级结构特征,使用随机森林算法预测相似度为 40%的蛋白质序列的结构类别。
Comput Biol Chem. 2020 Feb;84:107164. doi: 10.1016/j.compbiolchem.2019.107164. Epub 2019 Nov 15.
5
Prediction of protein structural classes for low-homology sequences based on predicted secondary structure.基于预测的二级结构预测低同源序列的蛋白质结构类别。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-11-S1-S9.
6
Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.通过将预测的二级结构信息纳入周的伪氨基酸组成的通用形式,准确预测蛋白质结构类别。
J Theor Biol. 2014 Mar 7;344:12-8. doi: 10.1016/j.jtbi.2013.11.021. Epub 2013 Dec 6.
7
A protein structural classes prediction method based on predicted secondary structure and PSI-BLAST profile.基于预测二级结构和 PSI-BLAST -profile 的蛋白质结构类预测方法。
Biochimie. 2014 Feb;97:60-5. doi: 10.1016/j.biochi.2013.09.013. Epub 2013 Sep 22.
8
Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.从与预测序列具有 twilight-zone 身份的序列中预测蛋白质结构类别
BMC Bioinformatics. 2009 Dec 13;10:414. doi: 10.1186/1471-2105-10-414.
9
Accurate prediction of protein structural classes using functional domains and predicted secondary structure sequences.使用功能域和预测的二级结构序列准确预测蛋白质结构类别。
J Biomol Struct Dyn. 2012;29(6):623-33. doi: 10.1080/07391102.2011.672626.
10
SVM-based method for protein structural class prediction using secondary structural content and structural information of amino acids.基于支持向量机的蛋白质结构类预测方法,该方法利用二级结构含量和氨基酸的结构信息。
J Bioinform Comput Biol. 2011 Aug;9(4):489-502. doi: 10.1142/s0219720011005422.

引用本文的文献

1
Application of data mining tools for classification of protein structural class from residue based averaged NMR chemical shifts.基于残基平均核磁共振化学位移,应用数据挖掘工具对蛋白质结构类别进行分类。
Biochim Biophys Acta. 2015 Oct;1854(10 Pt A):1545-52. doi: 10.1016/j.bbapap.2015.02.016. Epub 2015 Mar 7.