通过融合多视图特征的双剖面采样从原始序列预测蛋白质-ATP 结合位点。

Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features.

机构信息

Department of Automation, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.

出版信息

BMC Bioinformatics. 2012 May 31;13:118. doi: 10.1186/1471-2105-13-118.

DOI:10.1186/1471-2105-13-118

PMID:22651691

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3424114/

Abstract

BACKGROUND

Adenosine-5'-triphosphate (ATP) is one of multifunctional nucleotides and plays an important role in cell biology as a coenzyme interacting with proteins. Revealing the binding sites between protein and ATP is significantly important to understand the functionality of the proteins and the mechanisms of protein-ATP complex.

RESULTS

In this paper, we propose a novel framework for predicting the proteins' functional residues, through which they can bind with ATP molecules. The new prediction protocol is achieved by combination of sequence evolutional information and bi-profile sampling of multi-view sequential features and the sequence derived structural features. The hypothesis for this strategy is single-view feature can only represent partial target's knowledge and multiple sources of descriptors can be complementary.

CONCLUSIONS

Prediction performances evaluated by both 5-fold and leave-one-out jackknife cross-validation tests on two benchmark datasets consisting of 168 and 227 non-homologous ATP binding proteins respectively demonstrate the efficacy of the proposed protocol. Our experimental results also reveal that the residue structural characteristics of real protein-ATP binding sites are significant different from those normal ones, for example the binding residues do not show high solvent accessibility propensities, and the bindings prefer to occur at the conjoint points between different secondary structure segments. Furthermore, results also show that performance is affected by the imbalanced training datasets by testing multiple ratios between positive and negative samples in the experiments. Increasing the dataset scale is also demonstrated useful for improving the prediction performances.

摘要

背景

三磷酸腺苷（ATP）是一种多功能核苷酸，作为与蛋白质相互作用的辅酶，在细胞生物学中发挥着重要作用。揭示蛋白质与 ATP 之间的结合位点对于理解蛋白质的功能和蛋白质-ATP 复合物的机制具有重要意义。

结果

在本文中，我们提出了一种新的框架，用于预测与 ATP 分子结合的蛋白质功能残基。新的预测方案是通过序列进化信息和多视图序列特征的双谱采样以及序列衍生结构特征的组合来实现的。该策略的假设是单视图特征只能代表部分目标的知识，并且多个描述符来源可以互补。

结论

通过对包含 168 个和 227 个非同源 ATP 结合蛋白的两个基准数据集进行的 5 折和留一法 jackknife 交叉验证测试的预测性能评估表明了该方案的有效性。我们的实验结果还表明，真实蛋白质-ATP 结合位点的残基结构特征与正常残基显著不同，例如结合残基不显示高溶剂可及性倾向，并且结合更倾向于发生在不同二级结构片段的连接点处。此外，通过在实验中测试多个正样本和负样本之间的比例，结果还表明性能受到不平衡训练数据集的影响。增加数据集规模也被证明有助于提高预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7186/3424114/d2ba4e3c24cd/1471-2105-13-118-1.jpg

相似文献

Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features.通过融合多视图特征的双剖面采样从原始序列预测蛋白质-ATP 结合位点。

BMC Bioinformatics. 2012 May 31;13:118. doi: 10.1186/1471-2105-13-118.

Identification of ATP binding residues of a protein from its primary sequence.从蛋白质的一级序列鉴定其 ATP 结合残基。

BMC Bioinformatics. 2009 Dec 19;10:434. doi: 10.1186/1471-2105-10-434.

TargetATPsite: a template-free method for ATP-binding sites prediction with residue evolution image sparse representation and classifier ensemble.靶标 ATP 结合位点预测的模板自由方法：基于残基进化图像稀疏表示和分类器集成。

J Comput Chem. 2013 Apr 30;34(11):974-85. doi: 10.1002/jcc.23219. Epub 2013 Jan 3.

A Survey for Predicting ATP Binding Residues of Proteins Using Machine Learning Methods.基于机器学习方法的蛋白质 ATP 结合残基预测调查。

Curr Med Chem. 2022;29(5):789-806. doi: 10.2174/0929867328666210910125802.

Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines.基于特征选择和支持向量机的域剖面预测域-域相互作用。

BMC Bioinformatics. 2010 Oct 29;11:537. doi: 10.1186/1471-2105-11-537.

Accurate prediction of protein-ATP binding residues using position-specific frequency matrix.利用位置特异性频率矩阵准确预测蛋白质-ATP 结合残基

Anal Biochem. 2021 Aug 1;626:114241. doi: 10.1016/j.ab.2021.114241. Epub 2021 May 7.

ATPbind: Accurate Protein-ATP Binding Site Prediction by Combining Sequence-Profiling and Structure-Based Comparisons.ATPbind：通过序列特征分析与结构比较相结合的方法进行准确的蛋白质-ATP 结合位点预测。

J Chem Inf Model. 2018 Feb 26;58(2):501-510. doi: 10.1021/acs.jcim.7b00397. Epub 2018 Feb 8.

Predicting protein-binding regions in RNA using nucleotide profiles and compositions.利用核苷酸谱和组成预测RNA中的蛋白质结合区域。

BMC Syst Biol. 2017 Mar 14;11(Suppl 2):16. doi: 10.1186/s12918-017-0386-4.

Novel structure-driven features for accurate prediction of protein structural class.用于准确预测蛋白质结构类别的新型结构驱动特征。

Genomics. 2014 Apr;103(4):292-7. doi: 10.1016/j.ygeno.2014.04.002. Epub 2014 Apr 18.

Prediction of protein binding sites in protein structures using hidden Markov support vector machine.利用隐马尔可夫支持向量机预测蛋白质结构中的蛋白质结合位点。

BMC Bioinformatics. 2009 Nov 20;10:381. doi: 10.1186/1471-2105-10-381.

引用本文的文献

S-DCNN: prediction of ATP binding residues by deep convolutional neural network based on SMOTE.S-DCNN：基于SMOTE的深度卷积神经网络预测ATP结合残基

Front Genet. 2025 Jan 6;15:1513201. doi: 10.3389/fgene.2024.1513201. eCollection 2024.

APIPred: An XGBoost-Based Method for Predicting Aptamer-Protein Interactions.APIPred：一种基于 XGBoost 的适体-蛋白质相互作用预测方法。

J Chem Inf Model. 2024 Apr 8;64(7):2290-2301. doi: 10.1021/acs.jcim.3c00713. Epub 2023 Dec 21.

Prediction of Protein-ATP Binding Residues Based on Ensemble of Deep Convolutional Neural Networks and LightGBM Algorithm.基于深度卷积神经网络集成和 LightGBM 算法的蛋白质-ATP 结合残基预测。

Int J Mol Sci. 2021 Jan 19;22(2):939. doi: 10.3390/ijms22020939.

PPAI: a web server for predicting protein-aptamer interactions.PPAI：一个用于预测蛋白质-适体相互作用的网络服务器。

BMC Bioinformatics. 2020 Jun 9;21(1):236. doi: 10.1186/s12859-020-03574-7.

Analysis and prediction of human acetylation using a cascade classifier based on support vector machine.基于支持向量机的级联分类器分析和预测人类乙酰化作用。

BMC Bioinformatics. 2019 Jun 17;20(1):346. doi: 10.1186/s12859-019-2938-7.

A Model Stacking Framework for Identifying DNA Binding Proteins by Orchestrating Multi-View Features and Classifiers.一种通过协调多视图特征和分类器来识别DNA结合蛋白的模型堆叠框架。

Genes (Basel). 2018 Aug 1;9(8):394. doi: 10.3390/genes9080394.

A Two-Step Feature Selection Method to Predict Cancerlectins by Multiview Features and Synthetic Minority Oversampling Technique.基于多视图特征和合成少数过采样技术的两步特征选择方法预测癌症凝集素。

Biomed Res Int. 2018 Feb 7;2018:9364182. doi: 10.1155/2018/9364182. eCollection 2018.

Sequence Based Prediction of Antioxidant Proteins Using a Classifier Selection Strategy.基于序列的抗氧化蛋白预测：一种分类器选择策略

PLoS One. 2016 Sep 23;11(9):e0163274. doi: 10.1371/journal.pone.0163274. eCollection 2016.

Prediction of aptamer-protein interacting pairs using an ensemble classifier in combination with various protein sequence attributes.使用集成分类器结合各种蛋白质序列属性预测适配体-蛋白质相互作用对。

BMC Bioinformatics. 2016 May 31;17(1):225. doi: 10.1186/s12859-016-1087-5.

JPPRED: Prediction of Types of J-Proteins from Imbalanced Data Using an Ensemble Learning Method.JPPRED：使用集成学习方法从不平衡数据预测J蛋白类型

Biomed Res Int. 2015;2015:705156. doi: 10.1155/2015/705156. Epub 2015 Oct 26.

本文引用的文献

ATPsite: sequence-based prediction of ATP-binding residues.ATPsite：基于序列的 ATP 结合残基预测。

Proteome Sci. 2011 Oct 14;9 Suppl 1(Suppl 1):S4. doi: 10.1186/1477-5956-9-S1-S4.

Prediction and analysis of nucleotide-binding residues using sequence and sequence-derived structural descriptors.利用序列和序列衍生的结构描述符预测和分析核苷酸结合残基。

Bioinformatics. 2012 Feb 1;28(3):331-41. doi: 10.1093/bioinformatics/btr657. Epub 2011 Nov 29.

Characterization of protein-protein interaction interfaces from a single species.从单一物种中鉴定蛋白质-蛋白质相互作用界面。

PLoS One. 2011;6(6):e21053. doi: 10.1371/journal.pone.0021053. Epub 2011 Jun 27.

Residue propensities, discrimination and binding site prediction of adenine and guanine phosphates.腺嘌呤和鸟嘌呤磷酸盐的残基倾向、判别和结合位点预测。

BMC Biochem. 2011 May 13;12:20. doi: 10.1186/1471-2091-12-20.

Unstructural biology coming of age.非结构生物学崭露头角。

Curr Opin Struct Biol. 2011 Jun;21(3):419-25. doi: 10.1016/j.sbi.2011.03.012. Epub 2011 Apr 21.

Critical assessment of high-throughput standalone methods for secondary structure prediction.高通量独立方法的二级结构预测的关键评估。

Brief Bioinform. 2011 Nov;12(6):672-88. doi: 10.1093/bib/bbq088. Epub 2011 Jan 20.

Structure-based prediction of RNA-binding domains and RNA-binding sites and application to structural genomics targets.基于结构的 RNA 结合域和 RNA 结合位点预测及其在结构基因组学靶标中的应用。

Nucleic Acids Res. 2011 Apr;39(8):3017-25. doi: 10.1093/nar/gkq1266. Epub 2010 Dec 22.

Analyzing the topology of active sites: on the prediction of pockets and subpockets.分析活性部位的拓扑结构：口袋和亚口袋的预测。

J Chem Inf Model. 2010 Nov 22;50(11):2041-52. doi: 10.1021/ci100241y. Epub 2010 Oct 14.

Protein folding, stability and interactions.蛋白质折叠、稳定性及相互作用。

Curr Protein Pept Sci. 2010 Nov;11(7):497. doi: 10.2174/138920310794109102.

The prediction of protein-protein interacting sites in genome-wide protein interaction networks: the test case of the human cell cycle.在全基因组蛋白质相互作用网络中预测蛋白质相互作用位点：以人类细胞周期为例。

Curr Protein Pept Sci. 2010 Nov;11(7):601-8. doi: 10.2174/138920310794109157.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过融合多视图特征的双剖面采样从原始序列预测蛋白质-ATP 结合位点。

Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献