• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于集成学习的新型计算方法,用于从蛋白质一级序列预测蛋白质-蛋白质相互作用。

A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences.

作者信息

Pan Jie, Wang Shiwei, Yu Changqing, Li Liping, You Zhuhong, Sun Yanmei

机构信息

Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, College of Life Science, Northwest University, Xi'an 710069, China.

School of Information Engineering, Xijing University, Xi'an 710123, China.

出版信息

Biology (Basel). 2022 May 19;11(5):775. doi: 10.3390/biology11050775.

DOI:10.3390/biology11050775
PMID:35625503
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9139052/
Abstract

Protein-protein interactions (PPIs) are crucial for understanding the cellular processes, including signal cascade, DNA transcription, metabolic cycles, and repair. In the past decade, a multitude of high-throughput methods have been introduced to detect PPIs. However, these techniques are time-consuming, laborious, and always suffer from high false negative rates. Therefore, there is a great need of new computational methods as a supplemental tool for PPIs prediction. In this article, we present a novel sequence-based model to predict PPIs that combines Discrete Hilbert transform (DHT) and Rotation Forest (RoF). This method contains three stages: firstly, the Position-Specific Scoring Matrices (PSSM) was adopted to transform the amino acid sequence into a PSSM matrix, which can contain rich information about protein evolution. Then, the 400-dimensional DHT descriptor was constructed for each protein pair. Finally, these feature descriptors were fed to the RoF classifier for identifying the potential PPI class. When exploring the proposed model on the , , and PPIs datasets, it yielded excellent prediction accuracies of 91.93, 96.35, and 94.24%, respectively. In addition, we also conducted numerous experiments on cross-species PPIs datasets, and the predictive capacity of our method is also very excellent. To further access the prediction ability of the proposed approach, we present the comparison of RoF with four powerful classifiers, including Support Vector Machine (SVM), Random Forest (RF), K-nearest Neighbor (KNN), and AdaBoost. We also compared it with some existing superiority works. These comprehensive experimental results further confirm the excellent and feasibility of the proposed approach. In future work, we hope it can be a supplemental tool for the proteomics analysis.

摘要

蛋白质-蛋白质相互作用(PPIs)对于理解细胞过程至关重要,这些过程包括信号级联、DNA转录、代谢循环和修复。在过去十年中,已经引入了多种高通量方法来检测PPIs。然而,这些技术耗时、费力,并且总是存在高假阴性率。因此,迫切需要新的计算方法作为PPIs预测的补充工具。在本文中,我们提出了一种基于序列的新型模型来预测PPIs,该模型结合了离散希尔伯特变换(DHT)和旋转森林(RoF)。该方法包括三个阶段:首先,采用位置特异性评分矩阵(PSSM)将氨基酸序列转换为PSSM矩阵,该矩阵可以包含有关蛋白质进化的丰富信息。然后,为每个蛋白质对构建400维DHT描述符。最后,将这些特征描述符输入到RoF分类器中以识别潜在的PPI类别。当在、和PPIs数据集上探索所提出的模型时,它分别产生了91.93%、96.35%和94.24%的优异预测准确率。此外,我们还对跨物种PPIs数据集进行了大量实验,我们方法的预测能力也非常出色。为了进一步评估所提出方法的预测能力,我们将RoF与四个强大的分类器进行了比较,包括支持向量机(SVM)、随机森林(RF)、K近邻(KNN)和AdaBoost。我们还将其与一些现有的优势工作进行了比较。这些全面的实验结果进一步证实了所提出方法的优异性能和可行性。在未来的工作中,我们希望它能成为蛋白质组学分析的补充工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/32e4155eea25/biology-11-00775-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/c91bdfca5ce6/biology-11-00775-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/212c8fc474fb/biology-11-00775-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/c8ba3b51613f/biology-11-00775-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/32e4155eea25/biology-11-00775-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/c91bdfca5ce6/biology-11-00775-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/212c8fc474fb/biology-11-00775-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/c8ba3b51613f/biology-11-00775-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/968f/9139052/32e4155eea25/biology-11-00775-g004.jpg

相似文献

1
A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences.一种基于集成学习的新型计算方法,用于从蛋白质一级序列预测蛋白质-蛋白质相互作用。
Biology (Basel). 2022 May 19;11(5):775. doi: 10.3390/biology11050775.
2
Sequence-Based Prediction of Plant Protein-Protein Interactions by Combining Discrete Sine Transformation With Rotation Forest.基于离散正弦变换与旋转森林相结合的植物蛋白质-蛋白质相互作用的序列预测
Evol Bioinform Online. 2021 Oct 12;17:11769343211050067. doi: 10.1177/11769343211050067. eCollection 2021.
3
Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence.基于蛋白质序列的集成学习模型预测蛋白质-蛋白质相互作用
Biology (Basel). 2022 Jun 30;11(7):995. doi: 10.3390/biology11070995.
4
CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information.CPIELA:基于蛋白质序列和进化信息的集成学习方法对植物蛋白质-蛋白质相互作用的计算预测
Front Genet. 2022 Mar 11;13:857839. doi: 10.3389/fgene.2022.857839. eCollection 2022.
5
RVMAB: Using the Relevance Vector Machine Model Combined with Average Blocks to Predict the Interactions of Proteins from Protein Sequences.RVMAB:使用相关向量机模型结合平均块从蛋白质序列预测蛋白质相互作用
Int J Mol Sci. 2016 May 18;17(5):757. doi: 10.3390/ijms17050757.
6
Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier.通过整合PSSM概况中嵌入的潜在进化信息和判别向量机分类器来准确预测蛋白质-蛋白质相互作用。
Oncotarget. 2017 Apr 4;8(14):23638-23649. doi: 10.18632/oncotarget.15564.
7
Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation.基于进化矩阵表示的随机蕨类预测蛋白质-蛋白质相互作用。
Comput Math Methods Med. 2022 Feb 22;2022:7191684. doi: 10.1155/2022/7191684. eCollection 2022.
8
Prediction of Drug-Target Interactions by Combining Dual-Tree Complex Wavelet Transform with Ensemble Learning Method.基于复树小波变换与集成学习方法融合的药物-靶点相互作用预测
Molecules. 2021 Sep 3;26(17):5359. doi: 10.3390/molecules26175359.
9
Using discriminative vector machine model with 2DPCA to predict interactions among proteins.使用带二维主成分分析的判别向量机模型预测蛋白质间的相互作用。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):694. doi: 10.1186/s12859-019-3268-5.
10
Prediction of Protein-Protein Interactions in , Maize, and Rice by Combining Deep Neural Network With Discrete Hilbert Transform.通过结合深度神经网络与离散希尔伯特变换预测玉米和水稻中的蛋白质-蛋白质相互作用
Front Genet. 2021 Sep 20;12:745228. doi: 10.3389/fgene.2021.745228. eCollection 2021.

本文引用的文献

1
DWPPI: A Deep Learning Approach for Predicting Protein-Protein Interactions in Plants Based on Multi-Source Information With a Large-Scale Biological Network.DWPPI:一种基于多源信息和大规模生物网络预测植物中蛋白质-蛋白质相互作用的深度学习方法。
Front Bioeng Biotechnol. 2022 Mar 21;10:807522. doi: 10.3389/fbioe.2022.807522. eCollection 2022.
2
Robust and accurate prediction of protein-protein interactions by exploiting evolutionary information.利用进化信息进行稳健且准确的蛋白质-蛋白质相互作用预测。
Sci Rep. 2021 Aug 19;11(1):16910. doi: 10.1038/s41598-021-96265-z.
3
DoMY-Seq: A yeast two-hybrid-based technique for precision mapping of protein-protein interaction motifs.
DoMY-Seq:一种基于酵母双杂交的蛋白质-蛋白质相互作用基序精确定位技术。
J Biol Chem. 2021 Jan-Jun;296:100023. doi: 10.1074/jbc.RA120.014284. Epub 2020 Nov 23.
4
Using Random Forest Model Combined With Gabor Feature to Predict Protein-Protein Interaction From Protein Sequence.使用随机森林模型结合Gabor特征从蛋白质序列预测蛋白质-蛋白质相互作用
Evol Bioinform Online. 2020 Jun 30;16:1176934320934498. doi: 10.1177/1176934320934498. eCollection 2020.
5
Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method.通过基于序列嵌入的机器学习方法预测人类与病毒的蛋白质-蛋白质相互作用。
Comput Struct Biotechnol J. 2019 Dec 26;18:153-161. doi: 10.1016/j.csbj.2019.12.005. eCollection 2020.
6
An Efficient Feature Extraction Technique Based on Local Coding PSSM and Multifeatures Fusion for Predicting Protein-Protein Interactions.一种基于局部编码位置特异性得分矩阵和多特征融合的高效蛋白质-蛋白质相互作用预测特征提取技术
Evol Bioinform Online. 2019 Oct 3;15:1176934319879920. doi: 10.1177/1176934319879920. eCollection 2019.
7
Protein-protein interaction site prediction through combining local and global features with deep neural networks.通过结合局部和全局特征与深度神经网络进行蛋白质-蛋白质相互作用位点预测。
Bioinformatics. 2020 Feb 15;36(4):1114-1120. doi: 10.1093/bioinformatics/btz699.
8
Affinity Purification of NF1 Protein-Protein Interactors Identifies Keratins and Neurofibromin Itself as Binding Partners.NF1 蛋白-蛋白相互作用的亲和纯化鉴定角蛋白和神经纤维瘤自身为结合伴侣。
Genes (Basel). 2019 Aug 28;10(9):650. doi: 10.3390/genes10090650.
9
The Methods Employed in Mass Spectrometric Analysis of Posttranslational Modifications (PTMs) and Protein-Protein Interactions (PPIs).用于翻译后修饰(PTMs)和蛋白质-蛋白质相互作用(PPIs)的质谱分析方法。
Adv Exp Med Biol. 2019;1140:169-198. doi: 10.1007/978-3-030-15950-4_10.
10
An integration of deep learning with feature embedding for protein-protein interaction prediction.用于蛋白质-蛋白质相互作用预测的深度学习与特征嵌入的集成。
PeerJ. 2019 Jun 17;7:e7126. doi: 10.7717/peerj.7126. eCollection 2019.