• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将预测的二级结构信息纳入周的伪氨基酸组成的通用形式,准确预测蛋白质结构类别。

Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.

机构信息

College of Mathematics and Information Technology, Hebei Normal University of Science and Technology, Qinhuangdao 066004, PR China.

College of Marine Life Science, Ocean University of China, Yushan Road, Qingdao 266003, PR China.

出版信息

J Theor Biol. 2014 Mar 7;344:12-8. doi: 10.1016/j.jtbi.2013.11.021. Epub 2013 Dec 6.

DOI:10.1016/j.jtbi.2013.11.021
PMID:24316044
Abstract

Extracting good representation from protein sequence is fundamental for protein structural classes prediction tasks. In this paper, we propose a novel and powerful method to predict protein structural classes based on the predicted secondary structure information. At the feature extraction stage, a 13-dimensional feature vector is extracted to characterize general contents and spatial arrangements of the secondary structural elements of a given protein sequence. Specially, four segment-level features are designed to elevate discriminative ability for proteins from the α/β and α+β classes. After the features are extracted, a multi-class non-linear support vector machine classifier is used to implement protein structural classes prediction. We report extensive experiments comparing the proposed method to the state-of-the-art in protein structural classes prediction on three widely used low-similarity benchmark datasets: FC699, 1189 and 640. Our method achieves competitive performance on prediction accuracies, especially for the overall prediction accuracies which have exceeded the best reported results on all of the three datasets.

摘要

从蛋白质序列中提取良好的表示对于蛋白质结构类预测任务至关重要。在本文中,我们提出了一种基于预测的二级结构信息预测蛋白质结构类的新方法。在特征提取阶段,提取了一个 13 维特征向量,以表征给定蛋白质序列中二级结构元素的一般内容和空间排列。特别地,设计了四个分段级特征,以提高对来自 α/β 和 α+β 类别的蛋白质的判别能力。特征提取后,使用多类非线性支持向量机分类器实现蛋白质结构类预测。我们报告了广泛的实验,将所提出的方法与蛋白质结构类预测的最新技术在三个广泛使用的低相似度基准数据集上进行了比较:FC699、1189 和 640。我们的方法在预测精度上表现出竞争力,尤其是在整体预测精度方面,在所有三个数据集上都超过了最佳报道结果。

相似文献

1
Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.通过将预测的二级结构信息纳入周的伪氨基酸组成的通用形式,准确预测蛋白质结构类别。
J Theor Biol. 2014 Mar 7;344:12-8. doi: 10.1016/j.jtbi.2013.11.021. Epub 2013 Dec 6.
2
Novel structure-driven features for accurate prediction of protein structural class.用于准确预测蛋白质结构类别的新型结构驱动特征。
Genomics. 2014 Apr;103(4):292-7. doi: 10.1016/j.ygeno.2014.04.002. Epub 2014 Apr 18.
3
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
4
Predict protein structural class by incorporating two different modes of evolutionary information into Chou's general pseudo amino acid composition.通过将两种不同模式的进化信息整合到周氏广义伪氨基酸组成中预测蛋白质结构类别。
J Mol Graph Model. 2017 Nov;78:110-117. doi: 10.1016/j.jmgm.2017.10.003. Epub 2017 Oct 7.
5
Discriminating protein structure classes by incorporating Pseudo Average Chemical Shift to Chou's general PseAAC and Support Vector Machine.通过将伪平均化学位移纳入周的广义伪氨基酸组成和支持向量机来区分蛋白质结构类别。
Comput Methods Programs Biomed. 2014 Oct;116(3):184-92. doi: 10.1016/j.cmpb.2014.06.007. Epub 2014 Jun 21.
6
A novel feature representation method based on Chou's pseudo amino acid composition for protein structural class prediction.基于 Chou 的伪氨基酸组成的新型蛋白质结构类预测特征表示方法。
Comput Biol Chem. 2010 Dec;34(5-6):320-7. doi: 10.1016/j.compbiolchem.2010.09.002. Epub 2010 Nov 5.
7
Prediction of protein structural class for low-similarity sequences using Chou's pseudo amino acid composition and wavelet denoising.基于周氏伪氨基酸组成和小波去噪的低相似性序列蛋白质结构类预测
J Mol Graph Model. 2017 Sep;76:260-273. doi: 10.1016/j.jmgm.2017.07.012. Epub 2017 Jul 14.
8
A protein structural class prediction method based on novel features.基于新型特征的蛋白质结构类别预测方法。
Biochimie. 2013 Sep;95(9):1741-4. doi: 10.1016/j.biochi.2013.05.017. Epub 2013 Jun 14.
9
Predict protein structural class for low-similarity sequences by evolutionary difference information into the general form of Chou's pseudo amino acid composition.通过进化差异信息将低相似性序列的蛋白质结构类预测转化为周氏伪氨基酸组成的一般形式。
J Theor Biol. 2014 Aug 21;355:105-10. doi: 10.1016/j.jtbi.2014.04.008. Epub 2014 Apr 13.
10
Supersecondary structure prediction using Chou's pseudo amino acid composition.利用周所建立的伪氨基酸组成预测超二级结构。
J Comput Chem. 2011 Jan 30;32(2):271-8. doi: 10.1002/jcc.21616.

引用本文的文献

1
Comparative Study on Feature Selection in Protein Structure and Function Prediction.蛋白质结构与功能预测中的特征选择比较研究。
Comput Math Methods Med. 2022 Oct 11;2022:1650693. doi: 10.1155/2022/1650693. eCollection 2022.
2
Using Recursive Feature Selection with Random Forest to Improve Protein Structural Class Prediction for Low-Similarity Sequences.使用递归特征选择和随机森林提高低相似度序列的蛋白质结构分类预测。
Comput Math Methods Med. 2021 May 7;2021:5529389. doi: 10.1155/2021/5529389. eCollection 2021.
3
Some illuminating remarks on molecular genetics and genomics as well as drug development.
关于分子遗传学和基因组学以及药物开发的一些有启发性的观点。
Mol Genet Genomics. 2020 Mar;295(2):261-274. doi: 10.1007/s00438-019-01634-z. Epub 2020 Jan 1.
4
A New Method for Recognizing Cytokines Based on Feature Combination and a Support Vector Machine Classifier.基于特征组合和支持向量机分类器的细胞因子识别新方法。
Molecules. 2018 Aug 11;23(8):2008. doi: 10.3390/molecules23082008.
5
Accurate prediction of subcellular location of apoptosis proteins combining Chou's PseAAC and PsePSSM based on wavelet denoising.基于小波去噪结合周氏伪氨基酸组成和伪位置特异性得分矩阵对凋亡蛋白亚细胞定位的准确预测
Oncotarget. 2017 Nov 21;8(64):107640-107665. doi: 10.18632/oncotarget.22585. eCollection 2017 Dec 8.
6
An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.一种基于氨基酸间伪马尔可夫转移概率比较蛋白质序列相似性的无比对算法。
PLoS One. 2016 Dec 5;11(12):e0167430. doi: 10.1371/journal.pone.0167430. eCollection 2016.
7
Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.结合周氏距离对伪氨基酸组成和主成分分析进行蛋白质远程同源性检测。
Mol Genet Genomics. 2015 Oct;290(5):1919-31. doi: 10.1007/s00438-015-1044-4. Epub 2015 Apr 21.
8
A high performance prediction of HPV genotypes by Chaos game representation and singular value decomposition.基于混沌博弈表示法和奇异值分解的人乳头瘤病毒基因型高性能预测
BMC Bioinformatics. 2015 Mar 5;16:71. doi: 10.1186/s12859-015-0493-4.
9
Identification of real microRNA precursors with a pseudo structure status composition approach.采用伪结构状态组成方法鉴定真实的微小RNA前体。
PLoS One. 2015 Mar 30;10(3):e0121501. doi: 10.1371/journal.pone.0121501. eCollection 2015.
10
iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.iDNA-Prot|dis:通过将氨基酸距离对和简化字母表概况纳入通用伪氨基酸组成来鉴定DNA结合蛋白。
PLoS One. 2014 Sep 3;9(9):e106691. doi: 10.1371/journal.pone.0106691. eCollection 2014.