• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多视角异质分子网络表示学习在蛋白质相互作用预测中的应用。

Multi-view heterogeneous molecular network representation learning for protein-protein interaction prediction.

机构信息

Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China.

University of Chinese Academy of Sciences, Beijing, 100049, China.

出版信息

BMC Bioinformatics. 2022 Jun 16;23(1):234. doi: 10.1186/s12859-022-04766-z.

DOI:10.1186/s12859-022-04766-z
PMID:35710342
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9205098/
Abstract

BACKGROUND

Protein-protein interaction (PPI) plays an important role in regulating cells and signals. Despite the ongoing efforts of the bioassay group, continued incomplete data limits our ability to understand the molecular roots of human disease. Therefore, it is urgent to develop a computational method to predict PPIs from the perspective of molecular system.

METHODS

In this paper, a highly efficient computational model, MTV-PPI, is proposed for PPI prediction based on a heterogeneous molecular network by learning inter-view protein sequences and intra-view interactions between molecules simultaneously. On the one hand, the inter-view feature is extracted from the protein sequence by k-mer method. On the other hand, we use a popular embedding method LINE to encode the heterogeneous molecular network to obtain the intra-view feature. Thus, the protein representation used in MTV-PPI is constructed by the aggregation of its inter-view feature and intra-view feature. Finally, random forest is integrated to predict potential PPIs.

RESULTS

To prove the effectiveness of MTV-PPI, we conduct extensive experiments on a collected heterogeneous molecular network with the accuracy of 86.55%, sensitivity of 82.49%, precision of 89.79%, AUC of 0.9301 and AUPR of 0.9308. Further comparison experiments are performed with various protein representations and classifiers to indicate the effectiveness of MTV-PPI in predicting PPIs based on a complex network.

CONCLUSION

The achieved experimental results illustrate that MTV-PPI is a promising tool for PPI prediction, which may provide a new perspective for the future interactions prediction researches based on heterogeneous molecular network.

摘要

背景

蛋白质-蛋白质相互作用(PPI)在调节细胞和信号方面起着重要作用。尽管生物测定组不断努力,但数据的持续不完整限制了我们理解人类疾病分子根源的能力。因此,迫切需要开发一种从分子系统角度预测 PPI 的计算方法。

方法

本文提出了一种基于异构分子网络的高效计算模型 MTV-PPI,通过同时学习跨视图蛋白质序列和分子内视图相互作用来预测 PPI。一方面,通过 k-mer 方法从蛋白质序列中提取跨视图特征。另一方面,我们使用流行的嵌入方法 LINE 对异构分子网络进行编码,以获得内视图特征。因此,MTV-PPI 中使用的蛋白质表示是通过其跨视图特征和内视图特征的聚合构建的。最后,集成随机森林来预测潜在的 PPI。

结果

为了证明 MTV-PPI 的有效性,我们在收集的异构分子网络上进行了广泛的实验,准确率为 86.55%,灵敏度为 82.49%,精度为 89.79%,AUC 为 0.9301,AUPR 为 0.9308。进一步进行了各种蛋白质表示和分类器的比较实验,以表明 MTV-PPI 在基于复杂网络预测 PPI 方面的有效性。

结论

实验结果表明,MTV-PPI 是一种很有前途的 PPI 预测工具,它可能为未来基于异构分子网络的相互作用预测研究提供新的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/dd4abb478dc9/12859_2022_4766_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/bdfb3dc45eac/12859_2022_4766_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/77510c0ca5fa/12859_2022_4766_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/bd50db3dc2ff/12859_2022_4766_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/d582e1406170/12859_2022_4766_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/28f65636aa85/12859_2022_4766_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/33efb0d6ccdd/12859_2022_4766_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/8ef1666c9322/12859_2022_4766_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/dd4abb478dc9/12859_2022_4766_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/bdfb3dc45eac/12859_2022_4766_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/77510c0ca5fa/12859_2022_4766_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/bd50db3dc2ff/12859_2022_4766_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/d582e1406170/12859_2022_4766_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/28f65636aa85/12859_2022_4766_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/33efb0d6ccdd/12859_2022_4766_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/8ef1666c9322/12859_2022_4766_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5c/9205098/dd4abb478dc9/12859_2022_4766_Fig8_HTML.jpg

相似文献

1
Multi-view heterogeneous molecular network representation learning for protein-protein interaction prediction.多视角异质分子网络表示学习在蛋白质相互作用预测中的应用。
BMC Bioinformatics. 2022 Jun 16;23(1):234. doi: 10.1186/s12859-022-04766-z.
2
SDNN-PPI: self-attention with deep neural network effect on protein-protein interaction prediction.SDNN-PPI:基于深度神经网络的自注意力在蛋白质-蛋白质相互作用预测中的应用。
BMC Genomics. 2022 Jun 27;23(1):474. doi: 10.1186/s12864-022-08687-2.
3
Predicting protein-protein interactions from primary protein sequences using a novel multi-scale local feature representation scheme and the random forest.使用一种新颖的多尺度局部特征表示方案和随机森林从蛋白质一级序列预测蛋白质-蛋白质相互作用。
PLoS One. 2015 May 6;10(5):e0125811. doi: 10.1371/journal.pone.0125811. eCollection 2015.
4
DSSGNN-PPI: A Protein-Protein Interactions prediction model based on Double Structure and Sequence graph neural networks.DSSGNN-PPI:一种基于双结构和序列图神经网络的蛋白质-蛋白质相互作用预测模型。
Comput Biol Med. 2024 Jul;177:108669. doi: 10.1016/j.compbiomed.2024.108669. Epub 2024 May 29.
5
Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition.通过结合连续小波描述符和伪氨基酸组成的加权稀疏表示模型改进蛋白质-蛋白质相互作用预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):120. doi: 10.1186/s12918-016-0360-6.
6
Improved Prediction of Protein-Protein Interaction Mapping on by Using Amino Acid Sequence Features in a Supervised Learning Framework.利用监督学习框架中的氨基酸序列特征改进蛋白质相互作用预测映射。
Protein Pept Lett. 2021;28(1):74-83. doi: 10.2174/0929866527666200610141258.
7
Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation.基于进化矩阵表示的随机蕨类预测蛋白质-蛋白质相互作用。
Comput Math Methods Med. 2022 Feb 22;2022:7191684. doi: 10.1155/2022/7191684. eCollection 2022.
8
Prediction of protein-protein interactions from protein sequence using local descriptors.利用局部描述符从蛋白质序列预测蛋白质-蛋白质相互作用。
Protein Pept Lett. 2010 Sep;17(9):1085-90. doi: 10.2174/092986610791760306.
9
Prediction of Protein-Protein Interaction via co-occurring Aligned Pattern Clusters.通过共现比对模式簇预测蛋白质-蛋白质相互作用
Methods. 2016 Nov 1;110:26-34. doi: 10.1016/j.ymeth.2016.07.018. Epub 2016 Jul 27.
10
Temporal Protein Complex Identification Based on Dynamic Heterogeneous Protein Information Network Representation Learning.基于动态异构蛋白质信息网络表示学习的时间蛋白质复合物识别
IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1154-1164. doi: 10.1109/TCBB.2024.3351078. Epub 2024 Oct 9.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
DNA sequence analysis landscape: a comprehensive review of DNA sequence analysis task types, databases, datasets, word embedding methods, and language models.DNA序列分析全景:对DNA序列分析任务类型、数据库、数据集、词嵌入方法和语言模型的全面综述。
Front Med (Lausanne). 2025 Apr 8;12:1503229. doi: 10.3389/fmed.2025.1503229. eCollection 2025.
3

本文引用的文献

1
A deep learning method for repurposing antiviral drugs against new viruses via multi-view nonnegative matrix factorization and its application to SARS-CoV-2.一种通过多视图非负矩阵分解对新病毒进行抗病毒药物再利用的深度学习方法及其在 SARS-CoV-2 中的应用。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab526.
2
An Efficient Computational Model for Large-Scale Prediction of Protein-Protein Interactions Based on Accurate and Scalable Graph Embedding.基于准确且可扩展的图嵌入的蛋白质-蛋白质相互作用大规模预测的高效计算模型。
Front Genet. 2021 Feb 26;12:635451. doi: 10.3389/fgene.2021.635451. eCollection 2021.
3
A survey on computational models for predicting protein-protein interactions.
Prediction of protein interactions with function in protein (de-)phosphorylation.
蛋白质(去)磷酸化过程中具有功能的蛋白质相互作用预测。
PLoS One. 2025 Mar 3;20(3):e0319084. doi: 10.1371/journal.pone.0319084. eCollection 2025.
4
Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.从湿实验室到人工智能的转变:对CRISPR中人工智能预测因子的系统综述
J Transl Med. 2025 Feb 4;23(1):153. doi: 10.1186/s12967-024-06013-w.
5
GNNGL-PPI: multi-category prediction of protein-protein interactions using graph neural networks based on global graphs and local subgraphs.GNNGL-PPI:基于全局图和局部子图的图神经网络的蛋白质-蛋白质相互作用多类别预测。
BMC Genomics. 2024 May 9;25(1):406. doi: 10.1186/s12864-024-10299-x.
6
Dual-channel hypergraph convolutional network for predicting herb-disease associations.双通道超图卷积网络用于预测草药-疾病关联。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae067.
7
Intelligent Protein Design and Molecular Characterization Techniques: A Comprehensive Review.智能蛋白质设计与分子特征技术:全面综述。
Molecules. 2023 Nov 30;28(23):7865. doi: 10.3390/molecules28237865.
8
RLFDDA: a meta-path based graph representation learning model for drug-disease association prediction.RLFDDA:一种基于元路径的图表示学习模型,用于药物-疾病关联预测。
BMC Bioinformatics. 2022 Dec 1;23(1):516. doi: 10.1186/s12859-022-05069-z.
9
Effectively predicting HIV-1 protease cleavage sites by using an ensemble learning approach.利用集成学习方法有效预测 HIV-1 蛋白酶切割位点。
BMC Bioinformatics. 2022 Oct 27;23(1):447. doi: 10.1186/s12859-022-04999-y.
蛋白质-蛋白质相互作用预测的计算模型研究综述。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab036.
4
HiSCF: leveraging higher-order structures for clustering analysis in biological networks.HiSCF:利用高级结构进行生物网络中的聚类分析。
Bioinformatics. 2021 May 1;37(4):542-550. doi: 10.1093/bioinformatics/btaa775.
5
Multifaceted protein-protein interaction prediction based on Siamese residual RCNN.基于孪生残差 RCNN 的多功能蛋白质-蛋白质相互作用预测。
Bioinformatics. 2019 Jul 15;35(14):i305-i314. doi: 10.1093/bioinformatics/btz328.
6
Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme.使用混合特征表示和堆叠泛化方案进行蛋白质-蛋白质相互作用预测。
BMC Bioinformatics. 2019 Jun 10;20(1):308. doi: 10.1186/s12859-019-2907-1.
7
PPI-Detect: A support vector machine model for sequence-based prediction of protein-protein interactions.PPI-Detect:一种基于序列的蛋白质-蛋白质相互作用预测的支持向量机模型。
J Comput Chem. 2019 Apr 30;40(11):1233-1242. doi: 10.1002/jcc.25780. Epub 2019 Feb 15.
8
miRBase: from microRNA sequences to function.miRBase:从 microRNA 序列到功能。
Nucleic Acids Res. 2019 Jan 8;47(D1):D155-D162. doi: 10.1093/nar/gky1141.
9
Predicting protein-protein interactions through sequence-based deep learning.基于序列的深度学习预测蛋白质-蛋白质相互作用。
Bioinformatics. 2018 Sep 1;34(17):i802-i810. doi: 10.1093/bioinformatics/bty573.
10
LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse.LncRNA2Target v2.0:一个综合性的数据库,包含人类和小鼠中 lncRNA 的靶基因。
Nucleic Acids Res. 2019 Jan 8;47(D1):D140-D144. doi: 10.1093/nar/gky1051.