• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

提取物理化学特征以预测蛋白质二级结构。

Extracting physicochemical features to predict protein secondary structure.

作者信息

Huang Yin-Fu, Chen Shu-Ying

机构信息

Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, 123 University Road, Section 3, Touliu, Yunlin 640, Taiwan.

出版信息

ScientificWorldJournal. 2013 May 14;2013:347106. doi: 10.1155/2013/347106. Print 2013.

DOI:10.1155/2013/347106
PMID:23766688
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3666292/
Abstract

We propose a protein secondary structure prediction method based on position-specific scoring matrix (PSSM) profiles and four physicochemical features including conformation parameters, net charges, hydrophobic, and side chain mass. First, the SVM with the optimal window size and the optimal parameters of the kernel function is found. Then, we train the SVM using the PSSM profiles generated from PSI-BLAST and the physicochemical features extracted from the CB513 data set. Finally, we use the filter to refine the predicted results from the trained SVM. For all the performance measures of our method, Q 3 reaches 79.52, SOV94 reaches 86.10, and SOV99 reaches 74.60; all the measures are higher than those of the SVMpsi method and the SVMfreq method. This validates that considering these physicochemical features in predicting protein secondary structure would exhibit better performances.

摘要

我们提出了一种基于位置特异性得分矩阵(PSSM)概况和包括构象参数、净电荷、疏水性和侧链质量在内的四种物理化学特征的蛋白质二级结构预测方法。首先,找到具有最佳窗口大小和核函数最佳参数的支持向量机(SVM)。然后,我们使用从PSI-BLAST生成的PSSM概况和从CB513数据集中提取的物理化学特征来训练SVM。最后,我们使用过滤器对训练好的SVM的预测结果进行优化。对于我们方法的所有性能指标,Q3达到79.52,SOV94达到86.10,SOV99达到74.60;所有指标均高于SVMpsi方法和SVMfreq方法。这证实了在预测蛋白质二级结构时考虑这些物理化学特征会表现出更好的性能。

相似文献

1
Extracting physicochemical features to predict protein secondary structure.提取物理化学特征以预测蛋白质二级结构。
ScientificWorldJournal. 2013 May 14;2013:347106. doi: 10.1155/2013/347106. Print 2013.
2
Prediction of cis/trans isomerization in proteins using PSI-BLAST profiles and secondary structure information.利用PSI-BLAST谱和二级结构信息预测蛋白质中的顺/反异构化
BMC Bioinformatics. 2006 Mar 9;7:124. doi: 10.1186/1471-2105-7-124.
3
Protein secondary structure prediction based on an improved support vector machines approach.基于改进支持向量机方法的蛋白质二级结构预测
Protein Eng. 2003 Aug;16(8):553-60. doi: 10.1093/protein/gzg072.
4
Sequence based residue depth prediction using evolutionary information and predicted secondary structure.基于序列的残基深度预测,利用进化信息和预测的二级结构。
BMC Bioinformatics. 2008 Sep 20;9:388. doi: 10.1186/1471-2105-9-388.
5
A new representation for protein secondary structure prediction based on frequent patterns.一种基于频繁模式的蛋白质二级结构预测新表示法。
Bioinformatics. 2006 Nov 1;22(21):2628-34. doi: 10.1093/bioinformatics/btl453. Epub 2006 Aug 29.
6
A dynamic Bayesian network approach to protein secondary structure prediction.一种用于蛋白质二级结构预测的动态贝叶斯网络方法。
BMC Bioinformatics. 2008 Jan 25;9:49. doi: 10.1186/1471-2105-9-49.
7
Improved method for predicting beta-turn using support vector machine.使用支持向量机预测β-转角的改进方法。
Bioinformatics. 2005 May 15;21(10):2370-4. doi: 10.1093/bioinformatics/bti358. Epub 2005 Mar 29.
8
Improved protein secondary structure prediction using support vector machine with a new encoding scheme and an advanced tertiary classifier.使用具有新编码方案和先进三级分类器的支持向量机改进蛋白质二级结构预测。
IEEE Trans Nanobioscience. 2004 Dec;3(4):265-71. doi: 10.1109/tnb.2004.837906.
9
Predicting residue-wise contact orders in proteins by support vector regression.通过支持向量回归预测蛋白质中残基水平的接触序。
BMC Bioinformatics. 2006 Oct 3;7:425. doi: 10.1186/1471-2105-7-425.
10
Rule generation for protein secondary structure prediction with support vector machines and decision tree.使用支持向量机和决策树进行蛋白质二级结构预测的规则生成
IEEE Trans Nanobioscience. 2006 Mar;5(1):46-53. doi: 10.1109/tnb.2005.864021.

引用本文的文献

1
Hybrid model based on Genetic Algorithms and SVM applied to variable selection within fruit juice classification.基于遗传算法和支持向量机的混合模型在果汁分类变量选择中的应用。
ScientificWorldJournal. 2013 Dec 10;2013:982438. doi: 10.1155/2013/982438. eCollection 2013.

本文引用的文献

1
AAindex: amino acid index database, progress report 2008.AAindex:氨基酸索引数据库,2008年进展报告。
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5. doi: 10.1093/nar/gkm998. Epub 2007 Nov 12.
2
A novel method for protein secondary structure prediction using dual-layer SVM and profiles.一种使用双层支持向量机和轮廓进行蛋白质二级结构预测的新方法。
Proteins. 2004 Mar 1;54(4):738-43. doi: 10.1002/prot.10634.
3
Protein secondary structure prediction based on an improved support vector machines approach.基于改进支持向量机方法的蛋白质二级结构预测
Protein Eng. 2003 Aug;16(8):553-60. doi: 10.1093/protein/gzg072.
4
EVA: continuous automatic evaluation of protein structure prediction servers.EVA:蛋白质结构预测服务器的连续自动评估
Bioinformatics. 2001 Dec;17(12):1242-3. doi: 10.1093/bioinformatics/17.12.1242.
5
A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach.一种具有高片段重叠度量的蛋白质二级结构预测新方法:支持向量机方法。
J Mol Biol. 2001 Apr 27;308(2):397-407. doi: 10.1006/jmbi.2001.4580.
6
AAindex: amino acid index database.AAindex:氨基酸索引数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):374. doi: 10.1093/nar/28.1.374.
7
Evaluation and improvement of multiple sequence methods for protein secondary structure prediction.蛋白质二级结构预测的多序列方法评估与改进
Proteins. 1999 Mar 1;34(4):508-19. doi: 10.1002/(sici)1097-0134(19990301)34:4<508::aid-prot10>3.0.co;2-4.
8
A modified definition of Sov, a segment-based measure for protein secondary structure prediction assessment.Sov的一种修改定义,一种用于蛋白质二级结构预测评估的基于片段的度量。
Proteins. 1999 Feb 1;34(2):220-3. doi: 10.1002/(sici)1097-0134(19990201)34:2<220::aid-prot7>3.0.co;2-k.
9
AAindex: Amino Acid Index Database.AAindex:氨基酸索引数据库。
Nucleic Acids Res. 1999 Jan 1;27(1):368-9. doi: 10.1093/nar/27.1.368.
10
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.