• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于改进支持向量机方法的蛋白质二级结构预测

Protein secondary structure prediction based on an improved support vector machines approach.

作者信息

Kim Hyunsoo, Park Haesun

机构信息

Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455, USA.

出版信息

Protein Eng. 2003 Aug;16(8):553-60. doi: 10.1093/protein/gzg072.

DOI:10.1093/protein/gzg072
PMID:12968073
Abstract

The prediction of protein secondary structure is an important step in the prediction of protein tertiary structure. A new protein secondary structure prediction method, SVMpsi, was developed to improve the current level of prediction by incorporating new tertiary classifiers and their jury decision system, and the PSI-BLAST PSSM profiles. Additionally, efficient methods to handle unbalanced data and a new optimization strategy for maximizing the Q(3) measure were developed. The SVMpsi produces the highest published Q(3) and SOV94 scores on both the RS126 and CB513 data sets to date. For a new KP480 set, the prediction accuracy of SVMpsi was Q(3) = 78.5% and SOV94 = 82.8%. Moreover, the blind test results for 136 non-redundant protein sequences which do not contain homologues of training data sets were Q(3) = 77.2% and SOV94 = 81.8%. The SVMpsi results in CASP5 illustrate that it is another competitive method to predict protein secondary structure.

摘要

蛋白质二级结构预测是蛋白质三级结构预测中的重要一步。一种新的蛋白质二级结构预测方法SVMpsi被开发出来,通过纳入新的三级分类器及其评判决策系统以及PSI-BLAST PSSM图谱来提高当前的预测水平。此外,还开发了处理不平衡数据的有效方法以及用于最大化Q(3)度量的新优化策略。到目前为止,SVMpsi在RS126和CB513数据集上产生了已发表的最高Q(3)和SOV94分数。对于新的KP480数据集,SVMpsi的预测准确率为Q(3)=78.5%,SOV94=82.8%。此外,对136个不包含训练数据集同源物的非冗余蛋白质序列的盲测结果为Q(3)=77.2%,SOV94=81.8%。SVMpsi在CASP5中的结果表明它是预测蛋白质二级结构的另一种有竞争力的方法。

相似文献

1
Protein secondary structure prediction based on an improved support vector machines approach.基于改进支持向量机方法的蛋白质二级结构预测
Protein Eng. 2003 Aug;16(8):553-60. doi: 10.1093/protein/gzg072.
2
Extracting physicochemical features to predict protein secondary structure.提取物理化学特征以预测蛋白质二级结构。
ScientificWorldJournal. 2013 May 14;2013:347106. doi: 10.1155/2013/347106. Print 2013.
3
Prediction of protein relative solvent accessibility with support vector machines and long-range interaction 3D local descriptor.使用支持向量机和远程相互作用3D局部描述符预测蛋白质相对溶剂可及性。
Proteins. 2004 Feb 15;54(3):557-62. doi: 10.1002/prot.10602.
4
Two-stage multi-class support vector machines to protein secondary structure prediction.用于蛋白质二级结构预测的两阶段多类支持向量机
Pac Symp Biocomput. 2005:346-57. doi: 10.1142/9789812702456_0033.
5
Rule generation for protein secondary structure prediction with support vector machines and decision tree.使用支持向量机和决策树进行蛋白质二级结构预测的规则生成
IEEE Trans Nanobioscience. 2006 Mar;5(1):46-53. doi: 10.1109/tnb.2005.864021.
6
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.基于预测的二级结构集合和多重比对,以超过80%的准确率预测β转角。
BMC Bioinformatics. 2008 Oct 10;9:430. doi: 10.1186/1471-2105-9-430.
7
Predicting protein secondary structure by a support vector machine based on a new coding scheme.基于一种新编码方案的支持向量机预测蛋白质二级结构
Genome Inform. 2004;15(2):181-90.
8
Toward better understanding of protein secondary structure: extracting prediction rules.为了更好地理解蛋白质二级结构:提取预测规则。
IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):858-64. doi: 10.1109/TCBB.2010.16.
9
Improving protein secondary structure prediction using a multi-modal BP method.利用多模态 BP 方法改进蛋白质二级结构预测。
Comput Biol Med. 2011 Oct;41(10):946-59. doi: 10.1016/j.compbiomed.2011.08.005. Epub 2011 Aug 30.
10
Prediction of Protein Secondary Structure with two-stage multi-class SVMs.基于两阶段多分类支持向量机的蛋白质二级结构预测
Int J Data Min Bioinform. 2007;1(3):248-69. doi: 10.1504/ijdmb.2007.011612.

引用本文的文献

1
Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models.深度学习在基因组学中的应用:从早期神经网络到现代大型语言模型。
Int J Mol Sci. 2023 Nov 1;24(21):15858. doi: 10.3390/ijms242115858.
2
AI applications in functional genomics.人工智能在功能基因组学中的应用。
Comput Struct Biotechnol J. 2021 Oct 11;19:5762-5790. doi: 10.1016/j.csbj.2021.10.009. eCollection 2021.
3
A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction.基于二级结构的位置特异性评分矩阵在提高蛋白质二级结构预测中的应用。
PLoS One. 2021 Jul 28;16(7):e0255076. doi: 10.1371/journal.pone.0255076. eCollection 2021.
4
Identification of Intrinsically Disordered Proteins and Regions by Length-Dependent Predictors Based on Conditional Random Fields.基于条件随机场的长度依赖性预测器识别内在无序蛋白质及区域
Mol Ther Nucleic Acids. 2019 Sep 6;17:396-404. doi: 10.1016/j.omtn.2019.06.004. Epub 2019 Jun 15.
5
DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction.DeepACLSTM:用于蛋白质二级结构预测的深度非对称卷积长短时记忆神经模型。
BMC Bioinformatics. 2019 Jun 17;20(1):341. doi: 10.1186/s12859-019-2940-0.
6
IDP⁻CRF: Intrinsically Disordered Protein/Region Identification Based on Conditional Random Fields.IDP⁻CRF:基于条件随机场的无序蛋白/区域识别。
Int J Mol Sci. 2018 Aug 22;19(9):2483. doi: 10.3390/ijms19092483.
7
Protein Secondary Structure Prediction Based on Data Partition and Semi-Random Subspace Method.基于数据分区和半随机子空间方法的蛋白质二级结构预测。
Sci Rep. 2018 Jun 29;8(1):9856. doi: 10.1038/s41598-018-28084-8.
8
CNNH_PSS: protein 8-class secondary structure prediction by convolutional neural network with highway.CNN_H_PSS:基于卷积神经网络和高速公路的 8 类蛋白质二级结构预测。
BMC Bioinformatics. 2018 May 8;19(Suppl 4):60. doi: 10.1186/s12859-018-2067-8.
9
SOV_refine: A further refined definition of segment overlap score and its significance for protein structure similarity.SOV细化:片段重叠分数的进一步细化定义及其对蛋白质结构相似性的意义。
Source Code Biol Med. 2018 Apr 20;13:1. doi: 10.1186/s13029-018-0068-7. eCollection 2018.
10
Predicting Protein-Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids.利用序列描述符和相邻氨基酸的位点倾向预测蛋白质-蛋白质相互作用位点
Int J Mol Sci. 2016 Oct 26;17(11):1788. doi: 10.3390/ijms17111788.