基于扩散的蛋白质功能网络预测的新方向：置信度整合途径。

New directions for diffusion-based network prediction of protein function: incorporating pathways with confidence.

机构信息

Department of Computer Science, Tufts University, Medford, MA 02155, USA and Department of Computer Science, University of Minnesota, Minneapolis, MN 55455, USA.

出版信息

Bioinformatics. 2014 Jun 15;30(12):i219-27. doi: 10.1093/bioinformatics/btu263.

DOI:10.1093/bioinformatics/btu263

PMID:24931987

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4058952/

Abstract

MOTIVATION

It has long been hypothesized that incorporating models of network noise as well as edge directions and known pathway information into the representation of protein-protein interaction (PPI) networks might improve their utility for functional inference. However, a simple way to do this has not been obvious. We find that diffusion state distance (DSD), our recent diffusion-based metric for measuring dissimilarity in PPI networks, has natural extensions that incorporate confidence, directions and can even express coherent pathways by calculating DSD on an augmented graph.

RESULTS

We define three incremental versions of DSD which we term cDSD, caDSD and capDSD, where the capDSD matrix incorporates confidence, known directed edges, and pathways into the measure of how similar each pair of nodes is according to the structure of the PPI network. We test four popular function prediction methods (majority vote, weighted majority vote, multi-way cut and functional flow) using these different matrices on the Baker's yeast PPI network in cross-validation. The best performing method is weighted majority vote using capDSD. We then test the performance of our augmented DSD methods on an integrated heterogeneous set of protein association edges from the STRING database. The superior performance of capDSD in this context confirms that treating the pathways as probabilistic units is more powerful than simply incorporating pathway edges independently into the network.

AVAILABILITY

All source code for calculating the confidences, for extracting pathway information from KEGG XML files, and for calculating the cDSD, caDSD and capDSD matrices are available from http://dsd.cs.tufts.edu/capdsd

摘要

动机

长期以来，人们一直假设将网络噪声模型以及边缘方向和已知的途径信息纳入蛋白质-蛋白质相互作用（PPI）网络的表示中，可以提高其进行功能推断的效用。然而，一种简单的方法并不明显。我们发现，我们最近基于扩散的 PPI 网络差异度量方法扩散状态距离（DSD）具有自然扩展，可以通过在扩充图上计算 DSD 来纳入置信度、方向，甚至可以表达连贯的途径。

结果

我们定义了三个增量版本的 DSD，分别称为 cDSD、caDSD 和 capDSD，其中 capDSD 矩阵将置信度、已知有向边和途径纳入了根据 PPI 网络结构测量每对节点相似程度的度量中。我们在贝克氏酵母 PPI 网络的交叉验证中使用这些不同的矩阵测试了四种流行的功能预测方法（多数投票、加权多数投票、多向切割和功能流）。表现最好的方法是使用 capDSD 的加权多数投票。然后，我们在 STRING 数据库中的综合异构蛋白质关联边缘集上测试了我们扩充的 DSD 方法的性能。在这种情况下，capDSD 的优越性能证实了将途径视为概率单元比简单地将途径边缘独立地纳入网络更有效。

可用性

计算置信度、从 KEGG XML 文件提取途径信息以及计算 cDSD、caDSD 和 capDSD 矩阵的所有源代码均可从 http://dsd.cs.tufts.edu/capdsd 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d8a/4058952/33b3f34faf8b/btu263f1.jpg

相似文献

New directions for diffusion-based network prediction of protein function: incorporating pathways with confidence.

Bioinformatics. 2014 Jun 15;30(12):i219-27. doi: 10.1093/bioinformatics/btu263.

Fitting a geometric graph to a protein-protein interaction network.

Bioinformatics. 2008 Apr 15;24(8):1093-9. doi: 10.1093/bioinformatics/btn079. Epub 2008 Mar 14.

A novel link prediction algorithm for reconstructing protein-protein interaction networks by topological similarity.

Bioinformatics. 2013 Feb 1;29(3):355-64. doi: 10.1093/bioinformatics/bts688. Epub 2012 Dec 11.

Graphlet-based edge clustering reveals pathogen-interacting proteins.

Bioinformatics. 2012 Sep 15;28(18):i480-i486. doi: 10.1093/bioinformatics/bts376.

Majority Vote Cascading: A Semi-Supervised Framework for Improving Protein Function Prediction.

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):1933-1945. doi: 10.1109/TCBB.2021.3059812. Epub 2022 Aug 8.

Connectedness of PPI network neighborhoods identifies regulatory hub proteins.

Bioinformatics. 2011 Apr 15;27(8):1135-42. doi: 10.1093/bioinformatics/btr099. Epub 2011 Mar 2.

Topological and functional comparison of community detection algorithms in biological networks.

BMC Bioinformatics. 2019 Apr 27;20(1):212. doi: 10.1186/s12859-019-2746-0.

Detangling PPI networks to uncover functionally meaningful clusters.

BMC Syst Biol. 2018 Mar 21;12(Suppl 3):24. doi: 10.1186/s12918-018-0550-5.

A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.

BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.

RedNemo: topology-based PPI network reconstruction via repeated diffusion with neighborhood modifications.

Bioinformatics. 2017 Feb 15;33(4):537-544. doi: 10.1093/bioinformatics/btw655.

引用本文的文献

Exploring weighting schemes for the discovery of informative generalized between pathway models to uncover pathways in genetic interaction networks.

Sci Rep. 2025 Aug 18;15(1):30169. doi: 10.1038/s41598-025-16353-2.

Cerebrospinal Fluid (CSF) Proteomic Signature in Preclinical and Clinical Alzheimer's disease (AD): Role of Adhesion Molecules.

Res Sq. 2025 Jan 16:rs.3.rs-5404760. doi: 10.21203/rs.3.rs-5404760/v1.

Decoding the Functional Interactome of Non-Model Organisms with PHILHARMONIC.

bioRxiv. 2025 Jan 14:2024.10.25.620267. doi: 10.1101/2024.10.25.620267.

D'or: deep orienter of protein-protein interaction networks.

Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae355.

Identifying Protein Phosphorylation Site-Disease Associations Based on Multi-Similarity Fusion and Negative Sample Selection by Convolutional Neural Network.

Interdiscip Sci. 2024 Sep;16(3):649-664. doi: 10.1007/s12539-024-00615-0. Epub 2024 Mar 8.

The Extent of Edgetic Perturbations in the Human Interactome Caused by Population-Specific Mutations.

Biomolecules. 2023 Dec 27;14(1):0. doi: 10.3390/biom14010040.

Explainable Multilayer Graph Neural Network for cancer gene prediction.

Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad643.

iLncDA-RSN: identification of lncRNA-disease associations based on reliable similarity networks.

Front Genet. 2023 Aug 8;14:1249171. doi: 10.3389/fgene.2023.1249171. eCollection 2023.

Assessing network-based methods in the context of system toxicology.

Front Pharmacol. 2023 Jul 12;14:1225697. doi: 10.3389/fphar.2023.1225697. eCollection 2023.

Diffusion-enhanced characterization of 3D chromatin structure reveals its linkage to gene regulatory networks and the interactome.

Genome Res. 2023 Aug;33(8):1354-1368. doi: 10.1101/gr.277737.123. Epub 2023 Jul 25.

本文引用的文献

Going the distance for protein function prediction: a new distance metric for protein interaction networks.

PLoS One. 2013 Oct 23;8(10):e76339. doi: 10.1371/journal.pone.0076339. eCollection 2013.

Protein function prediction by massive integration of evolutionary analyses and multiple data sources.

BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2105-14-S3-S1. Epub 2013 Feb 28.

A gene ontology inferred from molecular networks.

Nat Biotechnol. 2013 Jan;31(1):38-45. doi: 10.1038/nbt.2463.

STRING v9.1: protein-protein interaction networks, with increased coverage and integration.

Nucleic Acids Res. 2013 Jan;41(Database issue):D808-15. doi: 10.1093/nar/gks1094. Epub 2012 Nov 29.

Systematic differences in signal emitting and receiving revealed by PageRank analysis of a human protein interactome.

PLoS One. 2012;7(9):e44872. doi: 10.1371/journal.pone.0044872. Epub 2012 Sep 19.

MINT, the molecular interaction database: 2012 update.

Nucleic Acids Res. 2012 Jan;40(Database issue):D857-61. doi: 10.1093/nar/gkr930. Epub 2011 Nov 16.

Vavien: an algorithm for prioritizing candidate disease genes based on topological similarity of proteins in interaction networks.

J Comput Biol. 2011 Nov;18(11):1561-74. doi: 10.1089/cmb.2011.0154. Epub 2011 Oct 28.

Discovering pathways by orienting edges in protein interaction networks.

Nucleic Acids Res. 2011 Mar;39(4):e22. doi: 10.1093/nar/gkq1207. Epub 2010 Nov 24.

Associating genes and protein complexes with disease via network propagation.

PLoS Comput Biol. 2010 Jan 15;6(1):e1000641. doi: 10.1371/journal.pcbi.1000641.

Spectral affinity in protein networks.

BMC Syst Biol. 2009 Nov 29;3:112. doi: 10.1186/1752-0509-3-112.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于扩散的蛋白质功能网络预测的新方向：置信度整合途径。

New directions for diffusion-based network prediction of protein function: incorporating pathways with confidence.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献