• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于张量的双随机游走模型在蛋白质功能预测中的应用。

A tensor-based bi-random walks model for protein function prediction.

机构信息

College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, 410022, Hunan, China.

Hunan Provincial Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, 410022, Hunan, China.

出版信息

BMC Bioinformatics. 2022 May 30;23(1):199. doi: 10.1186/s12859-022-04747-2.

DOI:10.1186/s12859-022-04747-2
PMID:35637427
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9150346/
Abstract

BACKGROUND

The accurate characterization of protein functions is critical to understanding life at the molecular level and has a huge impact on biomedicine and pharmaceuticals. Computationally predicting protein function has been studied in the past decades. Plagued by noise and errors in protein-protein interaction (PPI) networks, researchers have undertaken to focus on the fusion of multi-omics data in recent years. A data model that appropriately integrates network topologies with biological data and preserves their intrinsic characteristics is still a bottleneck and an aspirational goal for protein function prediction.

RESULTS

In this paper, we propose the RWRT (Random Walks with Restart on Tensor) method to accomplish protein function prediction by applying bi-random walks on the tensor. RWRT firstly constructs a functional similarity tensor by combining protein interaction networks with multi-omics data derived from domain annotation and protein complex information. After this, RWRT extends the bi-random walks algorithm from a two-dimensional matrix to the tensor for scoring functional similarity between proteins. Finally, RWRT filters out possible pretenders based on the concept of cohesiveness coefficient and annotates target proteins with functions of the remaining functional partners. Experimental results indicate that RWRT performs significantly better than the state-of-the-art methods and improves the area under the receiver-operating curve (AUROC) by no less than 18%.

CONCLUSIONS

The functional similarity tensor offers us an alternative, in that it is a collection of networks sharing the same nodes; however, the edges belong to different categories or represent interactions of different nature. We demonstrate that the tensor-based random walk model can not only discover more partners with similar functions but also free from the constraints of errors in protein interaction networks effectively. We believe that the performance of function prediction depends greatly on whether we can extract and exploit proper functional similarity information on protein correlations.

摘要

背景

准确描述蛋白质功能对于理解分子水平的生命至关重要,对生物医学和制药领域有巨大影响。在过去几十年中,研究人员一直在研究计算预测蛋白质功能。由于蛋白质-蛋白质相互作用(PPI)网络中的噪声和错误,近年来,研究人员已着手专注于融合多组学数据。适当整合网络拓扑结构和生物数据并保留其内在特征的数据模型仍然是蛋白质功能预测的瓶颈和理想目标。

结果

在本文中,我们提出了 RWRT(张量上的带重启动随机游走)方法,通过在张量上应用双随机游走来完成蛋白质功能预测。RWRT 首先通过将蛋白质相互作用网络与源自域注释和蛋白质复合物信息的多组学数据相结合,构建功能相似性张量。然后,RWRT 将双随机游走算法从二维矩阵扩展到张量,以对蛋白质之间的功能相似性进行评分。最后,RWRT 根据内聚系数的概念过滤掉可能的冒充者,并根据剩余功能伙伴的功能对目标蛋白质进行注释。实验结果表明,RWRT 明显优于最先进的方法,并且将接收者操作特征曲线下的面积(AUROC)提高了至少 18%。

结论

功能相似性张量提供了一种替代方法,因为它是一个共享相同节点的网络集合;但是,边属于不同的类别或代表不同性质的相互作用。我们证明,基于张量的随机游走模型不仅可以发现更多具有相似功能的伙伴,而且还可以有效地摆脱蛋白质相互作用网络中错误的限制。我们相信,功能预测的性能在很大程度上取决于我们是否能够提取和利用蛋白质相关性上的适当功能相似信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/e4641796b8ff/12859_2022_4747_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/a064ca9d18b2/12859_2022_4747_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/345e3231d8c4/12859_2022_4747_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/d68a223f4fe4/12859_2022_4747_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/9c0e19793ad6/12859_2022_4747_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/8199f3ab14d8/12859_2022_4747_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/01fea1bc5516/12859_2022_4747_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/41721bc591fd/12859_2022_4747_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/8492bb770e4e/12859_2022_4747_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/e4641796b8ff/12859_2022_4747_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/a064ca9d18b2/12859_2022_4747_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/345e3231d8c4/12859_2022_4747_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/d68a223f4fe4/12859_2022_4747_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/9c0e19793ad6/12859_2022_4747_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/8199f3ab14d8/12859_2022_4747_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/01fea1bc5516/12859_2022_4747_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/41721bc591fd/12859_2022_4747_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/8492bb770e4e/12859_2022_4747_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa71/9150346/e4641796b8ff/12859_2022_4747_Fig9_HTML.jpg

相似文献

1
A tensor-based bi-random walks model for protein function prediction.基于张量的双随机游走模型在蛋白质功能预测中的应用。
BMC Bioinformatics. 2022 May 30;23(1):199. doi: 10.1186/s12859-022-04747-2.
2
Protein function annotation based on heterogeneous biological networks.基于异质生物网络的蛋白质功能注释。
BMC Bioinformatics. 2022 Nov 18;23(1):493. doi: 10.1186/s12859-022-05057-3.
3
Isoform function prediction based on bi-random walks on a heterogeneous network.基于异构网络上双随机游走的同工型功能预测。
Bioinformatics. 2020 Jan 1;36(1):303-310. doi: 10.1093/bioinformatics/btz535.
4
Predicting protein functions by using unbalanced bi-random walk algorithm on protein-protein interaction network and functional interrelationship network.基于蛋白质-蛋白质相互作用网络和功能相互关系网络,运用非平衡双随机游走算法预测蛋白质功能。
Curr Protein Pept Sci. 2014;15(6):529-39. doi: 10.2174/1389203715666140724085224.
5
NPF:network propagation for protein function prediction.NPF:用于蛋白质功能预测的网络传播。
BMC Bioinformatics. 2020 Aug 12;21(1):355. doi: 10.1186/s12859-020-03663-7.
6
Identifying Cancer genes by combining two-rounds RWR based on multiple biological data.基于多种生物数据的两轮 RWR 相结合识别癌症基因。
BMC Bioinformatics. 2019 Nov 25;20(Suppl 18):518. doi: 10.1186/s12859-019-3123-8.
7
A novel method to predict essential proteins based on tensor and HITS algorithm.一种基于张量和HITS算法预测必需蛋白质的新方法。
Hum Genomics. 2020 Apr 6;14(1):14. doi: 10.1186/s40246-020-00263-7.
8
Multiple similarity drug-target interaction prediction with random walks and matrix factorization.基于随机游走和矩阵分解的多重相似药物-靶标相互作用预测。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac353.
9
Laplacian normalization and bi-random walks on heterogeneous networks for predicting lncRNA-disease associations.基于异质网络的拉普拉斯归一化和双随机游走预测lncRNA与疾病的关联
BMC Syst Biol. 2018 Dec 31;12(Suppl 9):122. doi: 10.1186/s12918-018-0660-0.
10
Effective comparative analysis of protein-protein interaction networks by measuring the steady-state network flow using a Markov model.通过使用马尔可夫模型测量稳态网络流对蛋白质-蛋白质相互作用网络进行有效的比较分析。
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):395. doi: 10.1186/s12859-016-1215-2.

引用本文的文献

1
Protein function annotation based on heterogeneous biological networks.基于异质生物网络的蛋白质功能注释。
BMC Bioinformatics. 2022 Nov 18;23(1):493. doi: 10.1186/s12859-022-05057-3.

本文引用的文献

1
Improving cancer driver gene identification using multi-task learning on graph convolutional network.利用图卷积网络的多任务学习提高癌症驱动基因识别
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab432.
2
A multi-objective genetic algorithm to find active modules in multiplex biological networks.一种用于在多重生物网络中发现活性模块的多目标遗传算法。
PLoS Comput Biol. 2021 Aug 30;17(8):e1009263. doi: 10.1371/journal.pcbi.1009263. eCollection 2021 Aug.
3
Structure-based protein function prediction using graph convolutional networks.
基于结构的蛋白质功能预测使用图卷积网络。
Nat Commun. 2021 May 26;12(1):3168. doi: 10.1038/s41467-021-23303-9.
4
NPF:network propagation for protein function prediction.NPF:用于蛋白质功能预测的网络传播。
BMC Bioinformatics. 2020 Aug 12;21(1):355. doi: 10.1186/s12859-020-03663-7.
5
Identifying and ranking potential cancer drivers using representation learning on attributed network.基于属性网络的表示学习识别和排序潜在的癌症驱动基因。
Methods. 2021 Aug;192:13-24. doi: 10.1016/j.ymeth.2020.07.013. Epub 2020 Aug 3.
6
Metric learning on expression data for gene function prediction.基于表达数据的度量学习进行基因功能预测。
Bioinformatics. 2020 Feb 15;36(4):1182-1190. doi: 10.1093/bioinformatics/btz731.
7
An iteration method for identifying yeast essential proteins from heterogeneous network.从异质网络中鉴定酵母必需蛋白的迭代方法。
BMC Bioinformatics. 2019 Jun 24;20(1):355. doi: 10.1186/s12859-019-2930-2.
8
INGA 2.0: improving protein function prediction for the dark proteome.INGA 2.0:改进黑暗蛋白质组中蛋白质功能的预测。
Nucleic Acids Res. 2019 Jul 2;47(W1):W373-W378. doi: 10.1093/nar/gkz375.
9
An Entropy-Based Method for Identifying Mutual Exclusive Driver Genes in Cancer.基于熵的方法鉴定癌症中相互排斥的驱动基因
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):758-768. doi: 10.1109/TCBB.2019.2897931. Epub 2019 Feb 7.
10
The BioGRID interaction database: 2019 update.生物相互作用数据库(BioGRID):2019 年更新版。
Nucleic Acids Res. 2019 Jan 8;47(D1):D529-D541. doi: 10.1093/nar/gky1079.