• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CPIELA:基于蛋白质序列和进化信息的集成学习方法对植物蛋白质-蛋白质相互作用的计算预测

CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information.

作者信息

Li Li-Ping, Zhang Bo, Cheng Li

机构信息

College of Grassland and Environment Sciences, Xinjiang Agricultural University, Urumqi, China.

Xinjiang Key Laboratory of Grassland Resources and Ecology, Urumqi, China.

出版信息

Front Genet. 2022 Mar 11;13:857839. doi: 10.3389/fgene.2022.857839. eCollection 2022.

DOI:10.3389/fgene.2022.857839
PMID:35360876
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8963800/
Abstract

Identification and characterization of plant protein-protein interactions (PPIs) are critical in elucidating the functions of proteins and molecular mechanisms in a plant cell. Although experimentally validated plant PPIs data have become increasingly available in diverse plant species, the high-throughput techniques are usually expensive and labor-intensive. With the incredibly valuable plant PPIs data accumulating in public databases, it is progressively important to propose computational approaches to facilitate the identification of possible PPIs. In this article, we propose an effective framework for predicting plant PPIs by combining the position-specific scoring matrix (PSSM), local optimal-oriented pattern (LOOP), and ensemble rotation forest (ROF) model. Specifically, the plant protein sequence is firstly transformed into the PSSM, in which the protein evolutionary information is perfectly preserved. Then, the local textural descriptor LOOP is employed to extract texture variation features from PSSM. Finally, the ROF classifier is adopted to infer the potential plant PPIs. The performance of CPIELA is evaluated via cross-validation on three plant PPIs datasets: , , and . The experimental results demonstrate that the CPIELA method achieved the high average prediction accuracies of 98.63%, 98.09%, and 94.02%, respectively. To further verify the high performance of CPIELA, we also compared it with the other state-of-the-art methods on three gold standard datasets. The experimental results illustrate that CPIELA is efficient and reliable for predicting plant PPIs. It is anticipated that the CPIELA approach could become a useful tool for facilitating the identification of possible plant PPIs.

摘要

鉴定和表征植物蛋白质-蛋白质相互作用(PPI)对于阐明植物细胞中蛋白质的功能和分子机制至关重要。尽管在多种植物物种中,经过实验验证的植物PPI数据越来越多,但高通量技术通常成本高昂且 labor-intensive。随着公共数据库中积累了极其有价值的植物PPI数据,提出计算方法以促进可能的PPI的鉴定变得越来越重要。在本文中,我们提出了一个有效的框架,通过结合位置特异性评分矩阵(PSSM)、局部最优方向模式(LOOP)和集成旋转森林(ROF)模型来预测植物PPI。具体而言,首先将植物蛋白质序列转化为PSSM,其中完美保留了蛋白质进化信息。然后,使用局部纹理描述符LOOP从PSSM中提取纹理变化特征。最后,采用ROF分类器推断潜在的植物PPI。通过在三个植物PPI数据集上进行交叉验证来评估CPIELA的性能: , ,和 。实验结果表明,CPIELA方法分别实现了98.63%、98.09%和94.02%的高平均预测准确率。为了进一步验证CPIELA的高性能,我们还在三个金标准数据集上与其他现有最先进方法进行了比较。实验结果表明,CPIELA在预测植物PPI方面是高效且可靠的。预计CPIELA方法可能会成为促进鉴定可能的植物PPI的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/5355fbebf3c9/fgene-13-857839-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/820b625e2516/fgene-13-857839-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/51845fbc6126/fgene-13-857839-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/fc861319346f/fgene-13-857839-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/5355fbebf3c9/fgene-13-857839-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/820b625e2516/fgene-13-857839-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/51845fbc6126/fgene-13-857839-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/fc861319346f/fgene-13-857839-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d562/8963800/5355fbebf3c9/fgene-13-857839-g004.jpg

相似文献

1
CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information.CPIELA:基于蛋白质序列和进化信息的集成学习方法对植物蛋白质-蛋白质相互作用的计算预测
Front Genet. 2022 Mar 11;13:857839. doi: 10.3389/fgene.2022.857839. eCollection 2022.
2
Sequence-Based Prediction of Plant Protein-Protein Interactions by Combining Discrete Sine Transformation With Rotation Forest.基于离散正弦变换与旋转森林相结合的植物蛋白质-蛋白质相互作用的序列预测
Evol Bioinform Online. 2021 Oct 12;17:11769343211050067. doi: 10.1177/11769343211050067. eCollection 2021.
3
A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences.一种基于集成学习的新型计算方法,用于从蛋白质一级序列预测蛋白质-蛋白质相互作用。
Biology (Basel). 2022 May 19;11(5):775. doi: 10.3390/biology11050775.
4
Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation.基于进化矩阵表示的随机蕨类预测蛋白质-蛋白质相互作用。
Comput Math Methods Med. 2022 Feb 22;2022:7191684. doi: 10.1155/2022/7191684. eCollection 2022.
5
Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier.通过整合PSSM概况中嵌入的潜在进化信息和判别向量机分类器来准确预测蛋白质-蛋白质相互作用。
Oncotarget. 2017 Apr 4;8(14):23638-23649. doi: 10.18632/oncotarget.15564.
6
An Ensemble Classifier to Predict Protein-Protein Interactions by Combining PSSM-based Evolutionary Information with Local Binary Pattern Model.基于 PSSM 进化信息与局部二值模式模型相结合的蛋白质-蛋白质相互作用预测的集成分类器。
Int J Mol Sci. 2019 Jul 17;20(14):3511. doi: 10.3390/ijms20143511.
7
PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.PCVMZM:使用概率分类向量机模型结合泽尼克矩描述符从蛋白质序列预测蛋白质-蛋白质相互作用
Int J Mol Sci. 2017 May 11;18(5):1029. doi: 10.3390/ijms18051029.
8
An Efficient Feature Extraction Technique Based on Local Coding PSSM and Multifeatures Fusion for Predicting Protein-Protein Interactions.一种基于局部编码位置特异性得分矩阵和多特征融合的高效蛋白质-蛋白质相互作用预测特征提取技术
Evol Bioinform Online. 2019 Oct 3;15:1176934319879920. doi: 10.1177/1176934319879920. eCollection 2019.
9
Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence.基于蛋白质序列的集成学习模型预测蛋白质-蛋白质相互作用
Biology (Basel). 2022 Jun 30;11(7):995. doi: 10.3390/biology11070995.
10
Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model.利用蛋白质进化信息和相关向量机模型提高蛋白质-蛋白质相互作用预测准确性
Protein Sci. 2016 Oct;25(10):1825-33. doi: 10.1002/pro.2991. Epub 2016 Aug 9.

引用本文的文献

1
Prediction of protein interactions between pine and pine wood nematode using deep learning and multi-dimensional feature fusion.基于深度学习和多维特征融合的松树与松材线虫蛋白质相互作用预测
Front Plant Sci. 2024 Dec 2;15:1489116. doi: 10.3389/fpls.2024.1489116. eCollection 2024.

本文引用的文献

1
Prediction of Protein-Protein Interactions in , Maize, and Rice by Combining Deep Neural Network With Discrete Hilbert Transform.通过结合深度神经网络与离散希尔伯特变换预测玉米和水稻中的蛋白质-蛋白质相互作用
Front Genet. 2021 Sep 20;12:745228. doi: 10.3389/fgene.2021.745228. eCollection 2021.
2
BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.BioSeq-BLM:一个基于生物语言模型分析 DNA、RNA 和蛋白质序列的平台。
Nucleic Acids Res. 2021 Dec 16;49(22):e129. doi: 10.1093/nar/gkab829.
3
Reliable identification of protein-protein interactions by crosslinking mass spectrometry.
通过交联质谱可靠地鉴定蛋白质-蛋白质相互作用。
Nat Commun. 2021 Jun 11;12(1):3564. doi: 10.1038/s41467-021-23666-z.
4
Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences.利用基因组序列计算的共进化进行残基分辨率下的大规模蛋白质相互作用发现。
Nat Commun. 2021 Mar 2;12(1):1396. doi: 10.1038/s41467-021-21636-z.
5
Vicinal Vertex Allocation for Matrix Factorization in Networks.网络中矩阵分解的邻接顶点分配
IEEE Trans Cybern. 2022 Aug;52(8):8047-8060. doi: 10.1109/TCYB.2021.3051606. Epub 2022 Jul 19.
6
Comprehensive characterization of protein-protein interactions perturbed by disease mutations.全面描绘受疾病突变影响的蛋白质-蛋白质相互作用。
Nat Genet. 2021 Mar;53(3):342-353. doi: 10.1038/s41588-020-00774-y. Epub 2021 Feb 8.
7
Predicting lncRNA-Protein Interactions With miRNAs as Mediators in a Heterogeneous Network Model.在异质网络模型中以miRNA为介导预测lncRNA-蛋白质相互作用
Front Genet. 2020 Jan 22;10:1341. doi: 10.3389/fgene.2019.01341. eCollection 2019.
8
BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.BioSeq-Analysis2.0:一个基于机器学习方法的更新平台,用于在序列水平和残基水平上分析 DNA、RNA 和蛋白质序列。
Nucleic Acids Res. 2019 Nov 18;47(20):e127. doi: 10.1093/nar/gkz740.
9
The BioGRID interaction database: 2019 update.生物相互作用数据库(BioGRID):2019 年更新版。
Nucleic Acids Res. 2019 Jan 8;47(D1):D529-D541. doi: 10.1093/nar/gky1079.
10
CPPred-FL: a sequence-based predictor for large-scale identification of cell-penetrating peptides by feature representation learning.CPPred-FL:一种基于序列的预测器,用于通过特征表示学习大规模鉴定细胞穿透肽。
Brief Bioinform. 2020 Jan 17;21(1):11-23. doi: 10.1093/bib/bby091.