• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种利用多条序列比对进行大规模蛋白质-蛋白质相互作用识别的整合方法。

An ensemble approach for large-scale identification of protein- protein interactions using the alignments of multiple sequences.

作者信息

Wang Lei, You Zhu-Hong, Chen Xing, Li Jian-Qiang, Yan Xin, Zhang Wei, Huang Yu-An

机构信息

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China.

College of Information Science and Engineering, Zaozhuang University, Zaozhuang, Shandong 277100, China.

出版信息

Oncotarget. 2017 Jan 17;8(3):5149-5159. doi: 10.18632/oncotarget.14103.

DOI:10.18632/oncotarget.14103
PMID:28029645
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5354898/
Abstract

Protein-Protein Interactions (PPI) is not only the critical component of various biological processes in cells, but also the key to understand the mechanisms leading to healthy and diseased states in organisms. However, it is time-consuming and cost-intensive to identify the interactions among proteins using biological experiments. Hence, how to develop a more efficient computational method rapidly became an attractive topic in the post-genomic era. In this paper, we propose a novel method for inference of protein-protein interactions from protein amino acids sequences only. Specifically, protein amino acids sequence is firstly transformed into Position-Specific Scoring Matrix (PSSM) generated by multiple sequences alignments; then the Pseudo PSSM is used to extract feature descriptors. Finally, ensemble Rotation Forest (RF) learning system is trained to predict and recognize PPIs based solely on protein sequence feature. When performed the proposed method on the three benchmark data sets (Yeast, H. pylori, and independent dataset) for predicting PPIs, our method can achieve good average accuracies of 98.38%, 89.75%, and 96.25%, respectively. In order to further evaluate the prediction performance, we also compare the proposed method with other methods using same benchmark data sets. The experiment results demonstrate that the proposed method consistently outperforms other state-of-the-art method. Therefore, our method is effective and robust and can be taken as a useful tool in exploring and discovering new relationships between proteins. A web server is made publicly available at the URL http://202.119.201.126:8888/PsePSSM/ for academic use.

摘要

蛋白质-蛋白质相互作用(PPI)不仅是细胞中各种生物过程的关键组成部分,也是理解生物体健康和疾病状态形成机制的关键。然而,通过生物学实验来鉴定蛋白质之间的相互作用既耗时又成本高昂。因此,如何开发一种更高效的计算方法迅速成为后基因组时代一个引人关注的话题。在本文中,我们提出了一种仅从蛋白质氨基酸序列推断蛋白质-蛋白质相互作用的新方法。具体而言,首先将蛋白质氨基酸序列转化为由多序列比对生成的位置特异性得分矩阵(PSSM);然后使用伪PSSM提取特征描述符。最后,训练集成旋转森林(RF)学习系统,仅基于蛋白质序列特征来预测和识别PPI。当在所提出的方法应用于用于预测PPI的三个基准数据集(酵母、幽门螺杆菌和独立数据集)时,我们的方法分别能够实现98.38%、89.75%和96.25%的良好平均准确率。为了进一步评估预测性能,我们还使用相同的基准数据集将所提出的方法与其他方法进行了比较。实验结果表明,所提出的方法始终优于其他现有最先进的方法。因此,我们的方法有效且稳健,可作为探索和发现蛋白质之间新关系的有用工具。一个网络服务器已在网址http://202.119.201.126:8888/PsePSSM/ 公开提供以供学术使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/8a41e97df57b/oncotarget-08-5149-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/c3c0adb20ca3/oncotarget-08-5149-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/2b8f08719693/oncotarget-08-5149-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/72cdecf8f857/oncotarget-08-5149-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/8a41e97df57b/oncotarget-08-5149-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/c3c0adb20ca3/oncotarget-08-5149-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/2b8f08719693/oncotarget-08-5149-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/72cdecf8f857/oncotarget-08-5149-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/09b9/5354898/8a41e97df57b/oncotarget-08-5149-g004.jpg

相似文献

1
An ensemble approach for large-scale identification of protein- protein interactions using the alignments of multiple sequences.一种利用多条序列比对进行大规模蛋白质-蛋白质相互作用识别的整合方法。
Oncotarget. 2017 Jan 17;8(3):5149-5159. doi: 10.18632/oncotarget.14103.
2
RVMAB: Using the Relevance Vector Machine Model Combined with Average Blocks to Predict the Interactions of Proteins from Protein Sequences.RVMAB:使用相关向量机模型结合平均块从蛋白质序列预测蛋白质相互作用
Int J Mol Sci. 2016 May 18;17(5):757. doi: 10.3390/ijms17050757.
3
Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier.利用来自位置特异性得分矩阵的进化信息和集成分类器提高蛋白质-蛋白质相互作用的预测准确性。
J Theor Biol. 2017 Apr 7;418:105-110. doi: 10.1016/j.jtbi.2017.01.003. Epub 2017 Jan 11.
4
Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation.基于进化矩阵表示的随机蕨类预测蛋白质-蛋白质相互作用。
Comput Math Methods Med. 2022 Feb 22;2022:7191684. doi: 10.1155/2022/7191684. eCollection 2022.
5
Predicting protein-protein interactions from primary protein sequences using a novel multi-scale local feature representation scheme and the random forest.使用一种新颖的多尺度局部特征表示方案和随机森林从蛋白质一级序列预测蛋白质-蛋白质相互作用。
PLoS One. 2015 May 6;10(5):e0125811. doi: 10.1371/journal.pone.0125811. eCollection 2015.
6
Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier.通过整合PSSM概况中嵌入的潜在进化信息和判别向量机分类器来准确预测蛋白质-蛋白质相互作用。
Oncotarget. 2017 Apr 4;8(14):23638-23649. doi: 10.18632/oncotarget.15564.
7
Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest.基于卷积神经网络和特征选择旋转森林的矩阵基蛋白序列预测蛋白-蛋白相互作用
Sci Rep. 2019 Jul 8;9(1):9848. doi: 10.1038/s41598-019-46369-4.
8
Global protein-protein interaction networks in yeast saccharomyces cerevisiae and helicobacter pylori.酵母酿酒酵母和幽门螺杆菌的全球蛋白质-蛋白质相互作用网络。
Talanta. 2023 Dec 1;265:124836. doi: 10.1016/j.talanta.2023.124836. Epub 2023 Jun 20.
9
PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.PCVMZM:使用概率分类向量机模型结合泽尼克矩描述符从蛋白质序列预测蛋白质-蛋白质相互作用
Int J Mol Sci. 2017 May 11;18(5):1029. doi: 10.3390/ijms18051029.
10
Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model.利用蛋白质进化信息和相关向量机模型提高蛋白质-蛋白质相互作用预测准确性
Protein Sci. 2016 Oct;25(10):1825-33. doi: 10.1002/pro.2991. Epub 2016 Aug 9.

引用本文的文献

1
ProSol-multi: Protein solubility prediction via amino acids multi-level correlation and discriminative distribution.ProSol-multi:基于氨基酸多级相关性和判别性分布的蛋白质溶解度预测
Heliyon. 2024 Aug 22;10(17):e36041. doi: 10.1016/j.heliyon.2024.e36041. eCollection 2024 Sep 15.
2
RRM2 promotes the proliferation of chicken myoblasts, inhibits their differentiation and muscle regeneration.RRM2 促进鸡成肌细胞的增殖,抑制其分化和肌肉再生。
Poult Sci. 2024 Mar;103(3):103407. doi: 10.1016/j.psj.2023.103407. Epub 2023 Dec 27.
3
TeM-DTBA: time-efficient drug target binding affinity prediction using multiple modalities with Lasso feature selection.

本文引用的文献

1
Highly Efficient Framework for Predicting Interactions Between Proteins.高效蛋白质相互作用预测框架。
IEEE Trans Cybern. 2017 Mar;47(3):731-743. doi: 10.1109/TCYB.2016.2524994. Epub 2016 Mar 30.
2
Ens-PPI: A Novel Ensemble Classifier for Predicting the Interactions of Proteins Using Autocovariance Transformation from PSSM.Ens-PPI:一种使用来自位置特异性得分矩阵(PSSM)的自协方差变换预测蛋白质相互作用的新型集成分类器。
Biomed Res Int. 2016;2016:4563524. doi: 10.1155/2016/4563524. Epub 2016 Jun 29.
3
Computational prediction of protein-protein interactions.
TeM-DTBA:使用具有套索特征选择的多模态进行高效药物靶点结合亲和力预测。
J Comput Aided Mol Des. 2023 Dec;37(12):573-584. doi: 10.1007/s10822-023-00533-1. Epub 2023 Sep 30.
4
Analysis of PPI networks of transcriptomic expression identifies hub genes associated with Newcastle disease virus persistent infection in bladder cancer.分析转录组表达的 PPI 网络鉴定与膀胱癌中新城疫病毒持续感染相关的 hub 基因。
Sci Rep. 2023 May 5;13(1):7323. doi: 10.1038/s41598-022-20521-z.
5
Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence.基于蛋白质序列的集成学习模型预测蛋白质-蛋白质相互作用
Biology (Basel). 2022 Jun 30;11(7):995. doi: 10.3390/biology11070995.
6
SIPGCN: A Novel Deep Learning Model for Predicting Self-Interacting Proteins from Sequence Information Using Graph Convolutional Networks.SIPGCN:一种使用图卷积网络从序列信息预测自相互作用蛋白的新型深度学习模型。
Biomedicines. 2022 Jun 29;10(7):1543. doi: 10.3390/biomedicines10071543.
7
MSPEDTI: Prediction of Drug-Target Interactions via Molecular Structure with Protein Evolutionary Information.MSPEDTI:通过具有蛋白质进化信息的分子结构预测药物-靶点相互作用
Biology (Basel). 2022 May 13;11(5):740. doi: 10.3390/biology11050740.
8
Incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions.结合化学亚结构和蛋白质进化信息推断药物-靶标相互作用。
Sci Rep. 2020 Apr 20;10(1):6641. doi: 10.1038/s41598-020-62891-2.
9
An Ensemble Classifier to Predict Protein-Protein Interactions by Combining PSSM-based Evolutionary Information with Local Binary Pattern Model.基于 PSSM 进化信息与局部二值模式模型相结合的蛋白质-蛋白质相互作用预测的集成分类器。
Int J Mol Sci. 2019 Jul 17;20(14):3511. doi: 10.3390/ijms20143511.
10
A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences.一种用于从氨基酸序列预测蛋白质-蛋白质相互作用的新型序列描述符矩阵。
PLoS One. 2019 Jun 7;14(6):e0217312. doi: 10.1371/journal.pone.0217312. eCollection 2019.
蛋白质-蛋白质相互作用的计算预测
Methods Mol Biol. 2015;1278:57-75. doi: 10.1007/978-1-4939-2425-7_4.
4
A highly efficient approach to protein interactome mapping based on collaborative filtering framework.一种基于协同过滤框架的蛋白质相互作用组图谱绘制的高效方法。
Sci Rep. 2015 Jan 9;5:7702. doi: 10.1038/srep07702.
5
Protein-protein interactions prediction based on iterative clique extension with gene ontology filtering.基于迭代团扩展和基因本体过滤的蛋白质-蛋白质相互作用预测
ScientificWorldJournal. 2014 Jan 22;2014:523634. doi: 10.1155/2014/523634. eCollection 2014.
6
QChIPat: a quantitative method to identify distinct binding patterns for two biological ChIP-seq samples in different experimental conditions.QChIPat:一种用于鉴定两种不同实验条件下的两个生物 ChIP-seq 样本的独特结合模式的定量方法。
BMC Genomics. 2013;14 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2164-14-S8-S3. Epub 2013 Dec 9.
7
Ensemble learning prediction of protein-protein interactions using proteins functional annotations.利用蛋白质功能注释进行蛋白质-蛋白质相互作用的集成学习预测
Mol Biosyst. 2014 Apr;10(4):820-30. doi: 10.1039/c3mb70486f. Epub 2014 Jan 27.
8
Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis.利用集成极限学习机和主成分分析从氨基酸序列预测蛋白质-蛋白质相互作用。
BMC Bioinformatics. 2013;14 Suppl 8(Suppl 8):S10. doi: 10.1186/1471-2105-14-S8-S10. Epub 2013 May 9.
9
An empirical study on the matrix-based protein representations and their combination with sequence-based approaches.基于矩阵的蛋白质表示及其与基于序列方法相结合的实证研究。
Amino Acids. 2013 Mar;44(3):887-901. doi: 10.1007/s00726-012-1416-6. Epub 2012 Oct 30.
10
Human protein-protein interaction prediction by a novel sequence-based co-evolution method: co-evolutionary divergence.基于新型序列共进化方法的人类蛋白质-蛋白质相互作用预测:共进化分歧。
Bioinformatics. 2013 Jan 1;29(1):92-8. doi: 10.1093/bioinformatics/bts620. Epub 2012 Oct 18.