• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测氨基酸可及表面积的两阶段支持向量回归方法。

Two-stage support vector regression approach for predicting accessible surface areas of amino acids.

作者信息

Nguyen Minh N, Rajapakse Jagath C

机构信息

BioInformatics Research Centre, School of Computer Engineering, Nanyang Technological University, Singapore.

出版信息

Proteins. 2006 May 15;63(3):542-50. doi: 10.1002/prot.20883.

DOI:10.1002/prot.20883
PMID:16456847
Abstract

We address the problem of predicting solvent accessible surface area (ASA) of amino acid residues in protein sequences, without classifying them into buried and exposed types. A two-stage support vector regression (SVR) approach is proposed to predict real values of ASA from the position-specific scoring matrices generated from PSI-BLAST profiles. By adding SVR as the second stage to capture the influences on the ASA value of a residue by those of its neighbors, the two-stage SVR approach achieves improvements of mean absolute errors up to 3.3%, and correlation coefficients of 0.66, 0.68, and 0.67 on the Manesh dataset of 215 proteins, the Barton dataset of 502 nonhomologous proteins, and the Carugo dataset of 338 proteins, respectively, which are better than the scores published earlier on these datasets. A Web server for protein ASA prediction by using a two-stage SVR method has been developed and is available (http://birc.ntu.edu.sg/~ pas0186457/asa.html).

摘要

我们研究了预测蛋白质序列中氨基酸残基溶剂可及表面积(ASA)的问题,而无需将它们分类为埋藏型和暴露型。提出了一种两阶段支持向量回归(SVR)方法,以从PSI-BLAST图谱生成的位置特异性评分矩阵预测ASA的实际值。通过将SVR作为第二阶段添加,以捕获相邻残基对一个残基ASA值的影响,两阶段SVR方法在215个蛋白质的Manesh数据集、502个非同源蛋白质的Barton数据集和338个蛋白质的Carugo数据集上分别实现了平均绝对误差提高3.3%,相关系数分别为0.66、0.68和0.67,这些结果优于这些数据集上先前公布的分数。已经开发了一个使用两阶段SVR方法进行蛋白质ASA预测的网络服务器,可通过(http://birc.ntu.edu.sg/~ pas0186457/asa.html)访问。

相似文献

1
Two-stage support vector regression approach for predicting accessible surface areas of amino acids.用于预测氨基酸可及表面积的两阶段支持向量回归方法。
Proteins. 2006 May 15;63(3):542-50. doi: 10.1002/prot.20883.
2
Prediction of protein relative solvent accessibility with a two-stage SVM approach.采用两阶段支持向量机方法预测蛋白质相对溶剂可及性。
Proteins. 2005 Apr 1;59(1):30-7. doi: 10.1002/prot.20404.
3
Prediction of protein accessible surface areas by support vector regression.通过支持向量回归预测蛋白质可及表面积
Proteins. 2004 Nov 15;57(3):558-64. doi: 10.1002/prot.20234.
4
SVM-Cabins: prediction of solvent accessibility using accumulation cutoff set and support vector machine.支持向量机-舱室:使用累积截止集和支持向量机预测溶剂可及性
Proteins. 2007 Jul 1;68(1):82-91. doi: 10.1002/prot.21422.
5
Real value prediction of solvent accessibility from amino acid sequence.从氨基酸序列对溶剂可及性进行实际值预测。
Proteins. 2003 Mar 1;50(4):629-35. doi: 10.1002/prot.10328.
6
Real value prediction of solvent accessibility in proteins using multiple sequence alignment and secondary structure.利用多序列比对和二级结构对蛋白质中溶剂可及性进行实际值预测。
Proteins. 2005 Nov 1;61(2):318-24. doi: 10.1002/prot.20630.
7
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.一种基于机器学习的方法,利用氨基酸组成、顺序和相似性搜索来预测分泌蛋白。
In Silico Biol. 2008;8(2):129-40.
8
Real-SPINE: an integrated system of neural networks for real-value prediction of protein structural properties.Real-SPINE:用于蛋白质结构特性实值预测的神经网络集成系统。
Proteins. 2007 Jul 1;68(1):76-81. doi: 10.1002/prot.21408.
9
HSEpred: predict half-sphere exposure from protein sequences.HSEpred:从蛋白质序列预测半球暴露情况。
Bioinformatics. 2008 Jul 1;24(13):1489-97. doi: 10.1093/bioinformatics/btn222. Epub 2008 May 8.
10
BTXpred: prediction of bacterial toxins.BTXpred:细菌毒素预测
In Silico Biol. 2007;7(4-5):405-12.

引用本文的文献

1
A sparse autoencoder-based deep neural network for protein solvent accessibility and contact number prediction.基于稀疏自动编码器的深度神经网络用于预测蛋白质溶剂可及性和接触数。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):569. doi: 10.1186/s12859-017-1971-7.
2
Improving prediction of burial state of residues by exploiting correlation among residues.通过利用残基间的相关性改进对残基埋藏状态的预测。
BMC Bioinformatics. 2017 Mar 14;18(Suppl 3):70. doi: 10.1186/s12859-017-1475-5.
3
A hydrophobic spine stabilizes a surface-exposed α-helix according to analysis of the solvent-accessible surface area.
根据溶剂可及表面积分析,疏水主链稳定了表面暴露的α螺旋。
BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):503. doi: 10.1186/s12859-016-1368-z.
4
FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues.FastRNABindR:蛋白质-RNA 界面残基的快速准确预测
PLoS One. 2016 Jul 6;11(7):e0158445. doi: 10.1371/journal.pone.0158445. eCollection 2016.
5
PredRSA: a gradient boosted regression trees approach for predicting protein solvent accessibility.PredRSA:一种用于预测蛋白质溶剂可及性的梯度提升回归树方法。
BMC Bioinformatics. 2016 Jan 11;17 Suppl 1(Suppl 1):8. doi: 10.1186/s12859-015-0851-2.
6
Prediction of protein solvent accessibility using PSO-SVR with multiple sequence-derived features and weighted sliding window scheme.使用具有多序列衍生特征和加权滑动窗口方案的粒子群优化支持向量回归(PSO-SVR)预测蛋白质溶剂可及性
BioData Min. 2015 Jan 31;8:3. doi: 10.1186/s13040-014-0031-3. eCollection 2015.
7
Predicting secretory proteins of malaria parasite by incorporating sequence evolution information into pseudo amino acid composition via grey system model.通过将序列进化信息纳入伪氨基酸组成,利用灰色系统模型预测疟原虫分泌蛋白。
PLoS One. 2012;7(11):e49040. doi: 10.1371/journal.pone.0049040. Epub 2012 Nov 26.
8
Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.基于机器学习的蛋白质-RNA 界面残基预测:现状评估。
BMC Bioinformatics. 2012 May 10;13:89. doi: 10.1186/1471-2105-13-89.
9
TANGLE: two-level support vector regression approach for protein backbone torsion angle prediction from primary sequences.TANGLE:一种两级支持向量回归方法,用于从蛋白质一级序列预测蛋白质主链扭转角。
PLoS One. 2012;7(2):e30361. doi: 10.1371/journal.pone.0030361. Epub 2012 Feb 2.
10
A comparative study of the second-order hydrophobic moments for globular proteins: the consensus scale of hydrophobicity and the CHARMM partial atomic charges.球状蛋白质二级疏水矩的比较研究:疏水性的共识尺度与CHARMM部分原子电荷
Int J Mol Sci. 2011;12(12):8449-65. doi: 10.3390/ijms12128449. Epub 2011 Nov 29.