• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于一种从蛋白质一级结构提取特征的新方法对多类同型寡聚体进行分类

[Classification of multi-class homo-oligomer based on a novel method of feature extraction from protein primary structure].

作者信息

Zhang Shaowu, Pan Quan, Zhao Chunhui, Cheng Yongmei

机构信息

School of Automatic Control, Northwestern Polytechnic University, Xi'an 710072, China.

出版信息

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2007 Aug;24(4):721-6.

PMID:17899731
Abstract

A novel method of feature extraction from protein primary structure has been proposed and applied to classify the protein homodimer, homotrimer, homotetramer and homohexamer, i. e. one protein sequence can be represented by a feature vector composed of amino acid compositions and a set of weighted auto-correlation function factors of amino acid residue index. As a result, high classification accuracies are obtained. For example, with the same support vector machine (SVM), the total accuracies of QIANA, AIANB, MEEJ, ROBB and SNEP sets based on this novel feature extraction method are 77.63, 77.16, 76.46, 76.70 and 75.06% respectively in Jackknife test, which are 6.39, 5.92, 5.22, 5.46 and 3.82 percent points respectively higher than that of COMP set based on the conventional method composed of amino acid compositions. With the same QIANA set, the total accuracy of SVM is 77.63%, which is 16.29 percent points higher than that of covariant discriminant algorithm. These results show: (1) The novel feature extraction method is effective and feasible, and the feature vectors based on this method may contain more protein quaternary structure information and appear to capture essential information about the composition and hydrophobicity of residues in the surface patches buried in the interfaces of associated subunits; (2) SVM can be referred as a powerful computational tool for classifying the homo-oligomers of proteins.

摘要

一种从蛋白质一级结构中提取特征的新方法已被提出,并应用于对蛋白质同二聚体、同三聚体、同四聚体和同六聚体进行分类,即一个蛋白质序列可以由一个由氨基酸组成和一组氨基酸残基索引的加权自相关函数因子组成的特征向量来表示。结果,获得了较高的分类准确率。例如,使用相同的支持向量机(SVM),在留一法测试中,基于这种新特征提取方法的QIANA、AIANB、MEEJ、ROBB和SNEP集的总准确率分别为77.63%、77.16%、76.46%、76.70%和75.06%,分别比基于由氨基酸组成的传统方法的COMP集高出6.39、5.92、5.22、5.46和3.82个百分点。对于相同的QIANA集,SVM的总准确率为77.63%,比协变判别算法高出16.29个百分点。这些结果表明:(1)这种新的特征提取方法是有效可行的,基于该方法的特征向量可能包含更多的蛋白质四级结构信息,并且似乎捕捉到了关于埋藏在相关亚基界面中的表面斑块中残基组成和疏水性的基本信息;(2)SVM可被视为一种强大的计算工具,用于对蛋白质的同寡聚体进行分类。

相似文献

1
[Classification of multi-class homo-oligomer based on a novel method of feature extraction from protein primary structure].基于一种从蛋白质一级结构提取特征的新方法对多类同型寡聚体进行分类
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2007 Aug;24(4):721-6.
2
Prediction of protein homo-oligomer types by pseudo amino acid composition: Approached with an improved feature extraction and Naive Bayes Feature Fusion.基于伪氨基酸组成预测蛋白质同源寡聚体类型:采用改进的特征提取和朴素贝叶斯特征融合方法
Amino Acids. 2006 Jun;30(4):461-8. doi: 10.1007/s00726-006-0263-8. Epub 2006 May 15.
3
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.基于支持向量机,利用氨基酸残基和氨基酸残基对的结构特性对蛋白质折叠进行分类。
Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7.
4
Classification of protein quaternary structure with support vector machine.用支持向量机对蛋白质四级结构进行分类。
Bioinformatics. 2003 Dec 12;19(18):2390-6. doi: 10.1093/bioinformatics/btg331.
5
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
6
Prediction of protein structural class with Rough Sets.基于粗糙集的蛋白质结构类预测
BMC Bioinformatics. 2006 Jan 14;7:20. doi: 10.1186/1471-2105-7-20.
7
Prediction of protein folds: extraction of new features, dimensionality reduction, and fusion of heterogeneous classifiers.蛋白质折叠预测:新特征提取、降维及异构分类器融合
IEEE Trans Nanobioscience. 2009 Mar;8(1):100-10. doi: 10.1109/TNB.2009.2016488. Epub 2009 Mar 10.
8
Boosting classifier for predicting protein domain structural class.用于预测蛋白质结构域结构类别的增强分类器。
Biochem Biophys Res Commun. 2005 Aug 19;334(1):213-7. doi: 10.1016/j.bbrc.2005.06.075.
9
Prediction of protein structure class by coupling improved genetic algorithm and support vector machine.结合改进遗传算法与支持向量机预测蛋白质结构类别
Amino Acids. 2008 Oct;35(3):581-90. doi: 10.1007/s00726-008-0084-z. Epub 2008 Apr 22.
10
A novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine.一种结合基于分组权重编码和支持向量机的凋亡蛋白亚细胞定位预测新方法。
FEBS Lett. 2006 Nov 13;580(26):6169-74. doi: 10.1016/j.febslet.2006.10.017. Epub 2006 Oct 17.