• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于两阶段多分类支持向量机的蛋白质二级结构预测

Prediction of Protein Secondary Structure with two-stage multi-class SVMs.

作者信息

Nguyen Minh N, Rajapakse Jagath C

机构信息

BioInformatics Research Centre, School of Computer Engineering, Nanyang Technological University, Singapore.

出版信息

Int J Data Min Bioinform. 2007;1(3):248-69. doi: 10.1504/ijdmb.2007.011612.

DOI:10.1504/ijdmb.2007.011612
PMID:18399074
Abstract

Bioinformatics techniques to Protein Secondary Structure (PSS) prediction mostly depend on the information available in amino acid sequences. In this paper, we propose a two-stage Multi-class Support Vector Machine (MSVM) approach, where the second MSVM predictor is introduced at the output of the first stage MSVM to capture the contextual relationship among secondary structure elements in order to minimise the generalisation error in the prediction. By using position-specific scoring matrices generated by PSI-BLAST, the two-stage MSVM approach achieves Q3 accuracies of 78.0% and 76.3% on the RS126 dataset of 126 non-homologous globular proteins and the CB396 dataset of 396 non-homologous proteins, respectively, which are better than the scores reported on both datasets to date. By using MSVM, the present prediction scheme significantly achieves 2-6% and 3-15% of improvement in Q3 and Sov accuracies, respectively, on the two datasets. On larger blind-test datasets from PSIPRED, CASP4 and EVA datasets, two-stage MSVM approach achieves Q3 accuracies from 77.0% to 79.5%.

摘要

用于蛋白质二级结构(PSS)预测的生物信息学技术主要依赖于氨基酸序列中可用的信息。在本文中,我们提出了一种两阶段多类支持向量机(MSVM)方法,其中第二个MSVM预测器在第一阶段MSVM的输出处引入,以捕捉二级结构元素之间的上下文关系,从而最小化预测中的泛化误差。通过使用PSI-BLAST生成的位置特异性评分矩阵,两阶段MSVM方法在包含126个非同源球蛋白的RS126数据集和包含396个非同源蛋白的CB396数据集上分别实现了78.0%和76.3%的Q3准确率,这优于迄今为止在这两个数据集上报告的分数。通过使用MSVM,当前的预测方案在这两个数据集上分别显著实现了Q3和Sov准确率提高2%-6%和3%-15%。在来自PSIPRED、CASP4和EVA数据集的更大的盲测数据集上,两阶段MSVM方法实现了77.0%至79.5%的Q3准确率。

相似文献

1
Prediction of Protein Secondary Structure with two-stage multi-class SVMs.基于两阶段多分类支持向量机的蛋白质二级结构预测
Int J Data Min Bioinform. 2007;1(3):248-69. doi: 10.1504/ijdmb.2007.011612.
2
Two-stage multi-class support vector machines to protein secondary structure prediction.用于蛋白质二级结构预测的两阶段多类支持向量机
Pac Symp Biocomput. 2005:346-57. doi: 10.1142/9789812702456_0033.
3
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.基于预测的二级结构集合和多重比对,以超过80%的准确率预测β转角。
BMC Bioinformatics. 2008 Oct 10;9:430. doi: 10.1186/1471-2105-9-430.
4
HYPROSP II--a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence.HYPROSP II——一种基于局部预测置信度的用于蛋白质二级结构预测的基于知识的混合方法。
Bioinformatics. 2005 Aug 1;21(15):3227-33. doi: 10.1093/bioinformatics/bti524. Epub 2005 Jun 2.
5
YASSPP: better kernels and coding schemes lead to improvements in protein secondary structure prediction.YASSPP:更好的核函数和编码方案可改善蛋白质二级结构预测。
Proteins. 2006 Aug 15;64(3):575-86. doi: 10.1002/prot.21036.
6
Secondary structure prediction with support vector machines.基于支持向量机的二级结构预测
Bioinformatics. 2003 Sep 1;19(13):1650-5. doi: 10.1093/bioinformatics/btg223.
7
A neural network method for prediction of beta-turn types in proteins using evolutionary information.一种利用进化信息预测蛋白质中β-转角类型的神经网络方法。
Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14.
8
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.
9
Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition.利用支持向量机并结合基于氨基酸组成的块序列比对进行蛋白质亚细胞定位预测。
BMC Bioinformatics. 2007 Nov 30;8:466. doi: 10.1186/1471-2105-8-466.
10
Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.β-桶状外膜蛋白拓扑结构预测方法的评估及一种共识预测方法
BMC Bioinformatics. 2005 Jan 12;6:7. doi: 10.1186/1471-2105-6-7.

引用本文的文献

1
Why Is There a Glass Ceiling for Threading Based Protein Structure Prediction Methods?为什么基于线程的蛋白质结构预测方法存在玻璃天花板?
J Phys Chem B. 2017 Apr 20;121(15):3546-3554. doi: 10.1021/acs.jpcb.6b09517. Epub 2016 Oct 26.
2
From local structure to a global framework: recognition of protein folds.从局部结构到全局框架:蛋白质折叠的识别
J R Soc Interface. 2014 Apr 16;11(95):20131147. doi: 10.1098/rsif.2013.1147. Print 2014 Jun 6.
3
An ensemble method for predicting subnuclear localizations from primary protein structures.
一种基于原始蛋白质结构预测亚核定位的集成方法。
PLoS One. 2013;8(2):e57225. doi: 10.1371/journal.pone.0057225. Epub 2013 Feb 27.
4
Improving the performance of β-turn prediction using predicted shape strings and a two-layer support vector machine model.利用预测形状字符串和两层支持向量机模型提高 β-转角预测的性能。
BMC Bioinformatics. 2011 Jul 13;12:283. doi: 10.1186/1471-2105-12-283.
5
Template-based protein modeling: recent methodological advances.基于模板的蛋白质建模:最新方法进展。
Curr Top Med Chem. 2010;10(1):84-94. doi: 10.2174/156802610790232314.