• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过进化差异信息将低相似性序列的蛋白质结构类预测转化为周氏伪氨基酸组成的一般形式。

Predict protein structural class for low-similarity sequences by evolutionary difference information into the general form of Chou's pseudo amino acid composition.

作者信息

Zhang Lichao, Zhao Xiqiang, Kong Liang

机构信息

College of Marine Life Science, Ocean University of China, Yushan Road, Qingdao 266003, PR China.

College of Mathematical Science, Ocean University of China, Songling Road, Qingdao 266100, PR China.

出版信息

J Theor Biol. 2014 Aug 21;355:105-10. doi: 10.1016/j.jtbi.2014.04.008. Epub 2014 Apr 13.

DOI:10.1016/j.jtbi.2014.04.008
PMID:24735902
Abstract

Knowledge of protein structural class plays an important role in characterizing the overall folding type of a given protein. At present, it is still a challenge to extract sequence information solely using protein sequence for protein structural class prediction with low similarity sequence in the current computational biology. In this study, a novel sequence representation method is proposed based on position specific scoring matrix for protein structural class prediction. By defined evolutionary difference formula, varying length proteins are expressed as uniform dimensional vectors, which can represent evolutionary difference information between the adjacent residues of a given protein. To perform and evaluate the proposed method, support vector machine and jackknife tests are employed on three widely used datasets, 25PDB, 1189 and 640 datasets with sequence similarity lower than 25%, 40% and 25%, respectively. Comparison of our results with the previous methods shows that our method may provide a promising method to predict protein structural class especially for low-similarity sequences.

摘要

了解蛋白质结构类别对于表征给定蛋白质的整体折叠类型起着重要作用。目前,在当前计算生物学中,仅使用蛋白质序列来预测低相似性序列的蛋白质结构类别,仅从蛋白质序列中提取序列信息仍然是一项挑战。在本研究中,提出了一种基于位置特异性评分矩阵的新型序列表示方法用于蛋白质结构类别预测。通过定义进化差异公式,将不同长度的蛋白质表示为统一维度的向量,该向量可以表示给定蛋白质相邻残基之间的进化差异信息。为了执行和评估所提出的方法,在三个广泛使用的数据集(25PDB、1189和640数据集,序列相似性分别低于25%、40%和25%)上采用了支持向量机和留一法测试。将我们的结果与先前方法进行比较表明,我们的方法可能为预测蛋白质结构类别提供一种有前景的方法,特别是对于低相似性序列。

相似文献

1
Predict protein structural class for low-similarity sequences by evolutionary difference information into the general form of Chou's pseudo amino acid composition.通过进化差异信息将低相似性序列的蛋白质结构类预测转化为周氏伪氨基酸组成的一般形式。
J Theor Biol. 2014 Aug 21;355:105-10. doi: 10.1016/j.jtbi.2014.04.008. Epub 2014 Apr 13.
2
Using principal component analysis and support vector machine to predict protein structural class for low-similarity sequences via PSSM.基于 PSSM 利用主成分分析和支持向量机预测低相似度序列的蛋白质结构类别
J Biomol Struct Dyn. 2012;29(6):634-42. doi: 10.1080/07391102.2011.672627.
3
Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile.使用支持向量机和 PSI-BLAST 轮廓预测低相似度序列的蛋白质结构类别。
Biochimie. 2010 Oct;92(10):1330-4. doi: 10.1016/j.biochi.2010.06.013. Epub 2010 Jun 23.
4
Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.通过将预测的二级结构信息纳入周的伪氨基酸组成的通用形式,准确预测蛋白质结构类别。
J Theor Biol. 2014 Mar 7;344:12-8. doi: 10.1016/j.jtbi.2013.11.021. Epub 2013 Dec 6.
5
Predict protein structural class by incorporating two different modes of evolutionary information into Chou's general pseudo amino acid composition.通过将两种不同模式的进化信息整合到周氏广义伪氨基酸组成中预测蛋白质结构类别。
J Mol Graph Model. 2017 Nov;78:110-117. doi: 10.1016/j.jmgm.2017.10.003. Epub 2017 Oct 7.
6
A novel predictor for protein structural class based on integrated information of the secondary structure sequence.一种基于二级结构序列综合信息的蛋白质结构类新型预测器。
Biochimie. 2014 Aug;103:131-6. doi: 10.1016/j.biochi.2014.05.008. Epub 2014 May 22.
7
Prediction of protein structural class for low-similarity sequences using Chou's pseudo amino acid composition and wavelet denoising.基于周氏伪氨基酸组成和小波去噪的低相似性序列蛋白质结构类预测
J Mol Graph Model. 2017 Sep;76:260-273. doi: 10.1016/j.jmgm.2017.07.012. Epub 2017 Jul 14.
8
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
9
A novel feature representation method based on Chou's pseudo amino acid composition for protein structural class prediction.基于 Chou 的伪氨基酸组成的新型蛋白质结构类预测特征表示方法。
Comput Biol Chem. 2010 Dec;34(5-6):320-7. doi: 10.1016/j.compbiolchem.2010.09.002. Epub 2010 Nov 5.
10
Discriminating protein structure classes by incorporating Pseudo Average Chemical Shift to Chou's general PseAAC and Support Vector Machine.通过将伪平均化学位移纳入周的广义伪氨基酸组成和支持向量机来区分蛋白质结构类别。
Comput Methods Programs Biomed. 2014 Oct;116(3):184-92. doi: 10.1016/j.cmpb.2014.06.007. Epub 2014 Jun 21.

引用本文的文献

1
Enhancing the Feature Representation of Protein Sequence Descriptors in Protein-Protein Interaction Prediction.在蛋白质-蛋白质相互作用预测中增强蛋白质序列描述符的特征表示
Interdiscip Sci. 2025 Jun 2. doi: 10.1007/s12539-025-00723-5.
2
StackedEnC-AOP: prediction of antioxidant proteins using transform evolutionary and sequential features based multi-scale vector with stacked ensemble learning.StackedEnC-AOP:基于多尺度向量的转换进化和序列特征与堆叠集成学习预测抗氧化蛋白。
BMC Bioinformatics. 2024 Aug 4;25(1):256. doi: 10.1186/s12859-024-05884-6.
3
HPC-Atlas: Computationally Constructing A Comprehensive Atlas of Human Protein Complexes.
HPC图谱:通过计算构建人类蛋白质复合物综合图谱
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):976-990. doi: 10.1016/j.gpb.2023.05.001. Epub 2023 Sep 18.
4
DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.DNAPred_Prot:利用基于组成和位置的特征识别DNA结合蛋白。
Appl Bionics Biomech. 2022 Apr 13;2022:5483115. doi: 10.1155/2022/5483115. eCollection 2022.
5
PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles.PSSMCOOL:一个用于从PSSM谱生成基于进化的蛋白质序列描述符的综合R包。
Biol Methods Protoc. 2022 Mar 30;7(1):bpac008. doi: 10.1093/biomethods/bpac008. eCollection 2022.
6
Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features.基于进化特征的机器学习方法精确界定噬菌体病毒粒子的组成部分。
mSystems. 2021 Jun 29;6(3):e0024221. doi: 10.1128/mSystems.00242-21. Epub 2021 May 27.
7
iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles.iT4SE-EP:通过探索来自两个PSI-BLAST图谱的进化特征准确鉴定细菌IV型分泌效应蛋白
Molecules. 2021 Apr 24;26(9):2487. doi: 10.3390/molecules26092487.
8
Variable selection from a feature representing protein sequences: a case of classification on bacterial type IV secreted effectors.基于蛋白质序列特征的变量选择:以 IV 型细菌分泌效应子分类为例。
BMC Bioinformatics. 2020 Oct 27;21(1):480. doi: 10.1186/s12859-020-03826-6.
9
Some illuminating remarks on molecular genetics and genomics as well as drug development.关于分子遗传学和基因组学以及药物开发的一些有启发性的观点。
Mol Genet Genomics. 2020 Mar;295(2):261-274. doi: 10.1007/s00438-019-01634-z. Epub 2020 Jan 1.
10
Prediction of protein structural classes by different feature expressions based on 2-D wavelet denoising and fusion.基于二维小波去噪和融合的不同特征表达预测蛋白质结构类别。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):701. doi: 10.1186/s12859-019-3276-5.