• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用位置特异性评分矩阵的三元概率和递归特征消除预测蛋白质结构类别。

Prediction of protein structural class using tri-gram probabilities of position-specific scoring matrix and recursive feature elimination.

作者信息

Tao Peiying, Liu Taigang, Li Xiaowei, Chen Lanming

机构信息

College of Food Science and Technology, Shanghai Ocean University, Shanghai, 201306, China.

出版信息

Amino Acids. 2015 Mar;47(3):461-8. doi: 10.1007/s00726-014-1878-9. Epub 2015 Jan 13.

DOI:10.1007/s00726-014-1878-9
PMID:25583603
Abstract

Knowledge of structural class plays an important role in understanding protein folding patterns. As a transitional stage in recognition of three-dimensional structure of a protein, protein structural class prediction is considered to be an important and challenging task. In this study, we firstly introduce a feature extraction technique which is based on tri-grams computed directly from position-specific scoring matrix (PSSM). A total of 8,000 features are extracted to represent a protein. Then, support vector machine-recursive feature elimination (SVM-RFE) is applied for feature selection and reduced features are input to a support vector machine (SVM) classifier to predict structural class of a given protein. To examine the effectiveness of our method, jackknife tests are performed on six widely used benchmark datasets, i.e., Z277, Z498, 1189, 25PDB, D640, and D1185. The overall accuracies of 97.1, 98.6, 92.5, 93.5, 94.2, and 95.9% are achieved on these datasets, respectively. Comparison of the proposed method with other prediction methods shows that our method is very promising to perform the prediction of protein structural class.

摘要

了解蛋白质结构类别在理解蛋白质折叠模式方面起着重要作用。作为识别蛋白质三维结构的一个过渡阶段,蛋白质结构类别预测被认为是一项重要且具有挑战性的任务。在本研究中,我们首先介绍一种基于直接从位置特异性得分矩阵(PSSM)计算得到的三元组的特征提取技术。共提取8000个特征来表示一个蛋白质。然后,应用支持向量机递归特征消除(SVM-RFE)进行特征选择,并将减少后的特征输入到支持向量机(SVM)分类器中以预测给定蛋白质的结构类别。为检验我们方法的有效性,在六个广泛使用的基准数据集,即Z277、Z498、1189、25PDB、D640和D1185上进行留一法测试。在这些数据集上分别获得了97.1%、98.6%、92.5%、93.5%、94.2%和95.9%的总体准确率。将所提出的方法与其他预测方法进行比较表明,我们的方法在进行蛋白质结构类别预测方面非常有前景。

相似文献

1
Prediction of protein structural class using tri-gram probabilities of position-specific scoring matrix and recursive feature elimination.利用位置特异性评分矩阵的三元概率和递归特征消除预测蛋白质结构类别。
Amino Acids. 2015 Mar;47(3):461-8. doi: 10.1007/s00726-014-1878-9. Epub 2015 Jan 13.
2
A highly accurate protein structural class prediction approach using auto cross covariance transformation and recursive feature elimination.一种使用自动交叉协方差变换和递归特征消除的高精度蛋白质结构类别预测方法。
Comput Biol Chem. 2015 Dec;59 Pt A:95-100. doi: 10.1016/j.compbiolchem.2015.08.012. Epub 2015 Sep 2.
3
Prediction of subcellular location of apoptosis proteins combining tri-gram encoding based on PSSM and recursive feature elimination.基于位置特异性得分矩阵(PSSM)的三元组编码与递归特征消除相结合预测凋亡蛋白的亚细胞定位
J Theor Biol. 2015 Feb 7;366:8-12. doi: 10.1016/j.jtbi.2014.11.010. Epub 2014 Nov 20.
4
A protein structural classes prediction method based on PSI-BLAST profile.一种基于PSI-BLAST序列谱的蛋白质结构类预测方法。
J Theor Biol. 2014 Jul 21;353:19-23. doi: 10.1016/j.jtbi.2014.02.034. Epub 2014 Mar 4.
5
Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach.基于带间隙二肽和递归特征选择方法的蛋白质结构类预测
Int J Mol Sci. 2015 Dec 24;17(1):15. doi: 10.3390/ijms17010015.
6
Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile.使用支持向量机和 PSI-BLAST 轮廓预测低相似度序列的蛋白质结构类别。
Biochimie. 2010 Oct;92(10):1330-4. doi: 10.1016/j.biochi.2010.06.013. Epub 2010 Jun 23.
7
PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.PSSP-RFE:通过从PSI-BLAST序列谱、物理化学性质和功能注释中进行递归特征提取来准确预测蛋白质结构类别。
PLoS One. 2014 Mar 27;9(3):e92863. doi: 10.1371/journal.pone.0092863. eCollection 2014.
8
Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM.基于一致序列和分段位置特异性得分矩阵预测低相似性序列的蛋白质结构类别
Comput Math Methods Med. 2015;2015:370756. doi: 10.1155/2015/370756. Epub 2015 Dec 15.
9
A tri-gram based feature extraction technique using linear probabilities of position specific scoring matrix for protein fold recognition.基于三字母词的特征提取技术,利用位置特定评分矩阵的线性概率进行蛋白质折叠识别。
IEEE Trans Nanobioscience. 2014 Mar;13(1):44-50. doi: 10.1109/TNB.2013.2296050.
10
A protein structural classes prediction method based on predicted secondary structure and PSI-BLAST profile.基于预测二级结构和 PSI-BLAST -profile 的蛋白质结构类预测方法。
Biochimie. 2014 Feb;97:60-5. doi: 10.1016/j.biochi.2013.09.013. Epub 2013 Sep 22.

引用本文的文献

1
Recursive Feature Elimination by Sensitivity Testing.通过敏感性测试进行递归特征消除
Proc Int Conf Mach Learn Appl. 2018 Dec;2018:40-47. doi: 10.1109/ICMLA.2018.00014. Epub 2019 Jan 17.
2
ProTstab - predictor for cellular protein stability.ProTstab - 细胞蛋白质稳定性预测工具
BMC Genomics. 2019 Nov 4;20(1):804. doi: 10.1186/s12864-019-6138-7.
3
iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences.iAPSL-IF:利用从氨基酸序列中提取的综合特征识别细胞凋亡蛋白亚细胞定位。
Int J Mol Sci. 2018 Apr 13;19(4):1190. doi: 10.3390/ijms19041190.
4
Predicting Presynaptic and Postsynaptic Neurotoxins by Developing Feature Selection Technique.通过开发特征选择技术预测突触前和突触后神经毒素。
Biomed Res Int. 2017;2017:3267325. doi: 10.1155/2017/3267325. Epub 2017 Feb 12.
5
A Novel Feature Extraction Method with Feature Selection to Identify Golgi-Resident Protein Types from Imbalanced Data.一种新型的特征提取方法,具有特征选择功能,可从不平衡数据中识别出高尔基驻留蛋白类型。
Int J Mol Sci. 2016 Feb 6;17(2):218. doi: 10.3390/ijms17020218.