• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于蛋白质复合物预测的二进制矩阵分解算法。

A binary matrix factorization algorithm for protein complex prediction.

机构信息

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong.

Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101.

出版信息

Proteome Sci. 2011 Oct 14;9 Suppl 1(Suppl 1):S18. doi: 10.1186/1477-5956-9-S1-S18.

DOI:10.1186/1477-5956-9-S1-S18
PMID:22166008
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3724484/
Abstract

BACKGROUND

Identifying biologically relevant protein complexes from a large protein-protein interaction (PPI) network, is essential to understand the organization of biological systems. However, high-throughput experimental techniques that can produce a large amount of PPIs are known to yield non-negligible rates of false-positives and false-negatives, making the protein complexes difficult to be identified.

RESULTS

We propose a binary matrix factorization (BMF) algorithm under the Bayesian Ying-Yang (BYY) harmony learning, to detect protein complexes by clustering the proteins which share similar interactions through factorizing the binary adjacent matrix of a PPI network. The proposed BYY-BMF algorithm automatically determines the cluster number while this number is pre-given for most existing BMF algorithms. Also, BYY-BMF's clustering results does not depend on any parameters or thresholds, unlike the Markov Cluster Algorithm (MCL) that relies on a so-called inflation parameter. On synthetic PPI networks, the predictions evaluated by the known annotated complexes indicate that BYY-BMF is more robust than MCL for most cases. On real PPI networks from the MIPS and DIP databases, BYY-BMF obtains a better balanced prediction accuracies than MCL and a spectral analysis method, while MCL has its own advantages, e.g., with good separation values.

摘要

背景

从大型蛋白质相互作用(PPI)网络中识别具有生物学意义的蛋白质复合物对于理解生物系统的组织至关重要。然而,能够产生大量 PPI 的高通量实验技术已知会产生不可忽略的假阳性和假阴性率,使得蛋白质复合物难以识别。

结果

我们提出了一种基于贝叶斯阴阳(BYY)和谐学习的二值矩阵分解(BMF)算法,通过对 PPI 网络的二值邻接矩阵进行因子分解,对具有相似相互作用的蛋白质进行聚类,从而检测蛋白质复合物。所提出的 BYY-BMF 算法在大多数现有 BMF 算法中预先给定聚类数量的情况下自动确定聚类数量。此外,BYY-BMF 的聚类结果不依赖于任何参数或阈值,而与依赖于所谓的膨胀参数的 Markov 聚类算法(MCL)不同。在合成 PPI 网络上,通过已知注释复合物评估的预测表明,对于大多数情况,BYY-BMF 比 MCL 更稳健。在来自 MIPS 和 DIP 数据库的真实 PPI 网络上,BYY-BMF 获得了比 MCL 和光谱分析方法更好的平衡预测准确性,而 MCL 有其自身的优势,例如具有良好的分离值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d549/3724484/701dd728a11f/1477-5956-9-S1-S18-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d549/3724484/4857edf200b3/1477-5956-9-S1-S18-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d549/3724484/8d1b4a02ae60/1477-5956-9-S1-S18-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d549/3724484/701dd728a11f/1477-5956-9-S1-S18-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d549/3724484/4857edf200b3/1477-5956-9-S1-S18-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d549/3724484/8d1b4a02ae60/1477-5956-9-S1-S18-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d549/3724484/701dd728a11f/1477-5956-9-S1-S18-3.jpg

相似文献

1
A binary matrix factorization algorithm for protein complex prediction.一种用于蛋白质复合物预测的二进制矩阵分解算法。
Proteome Sci. 2011 Oct 14;9 Suppl 1(Suppl 1):S18. doi: 10.1186/1477-5956-9-S1-S18.
2
MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure.MCL-CAw:一种改进的 MCL 方法,用于通过整合核心附着结构,从加权 PPI 网络中检测酵母复合物。
BMC Bioinformatics. 2010 Oct 12;11:504. doi: 10.1186/1471-2105-11-504.
3
Evaluation of clustering algorithms for protein-protein interaction networks.蛋白质-蛋白质相互作用网络聚类算法的评估
BMC Bioinformatics. 2006 Nov 6;7:488. doi: 10.1186/1471-2105-7-488.
4
FCMAC-BYY: fuzzy CMAC using Bayesian Ying-Yang learning.FCMAC - BYY:使用贝叶斯阴阳学习的模糊小脑模型关节控制器
IEEE Trans Syst Man Cybern B Cybern. 2006 Oct;36(5):1180-90. doi: 10.1109/tsmcb.2006.874691.
5
BYY harmony learning, structural RPCL, and topological self-organizing on mixture models.基于混合模型的BYY和谐学习、结构RPCL和拓扑自组织。
Neural Netw. 2002 Oct-Nov;15(8-9):1125-51. doi: 10.1016/s0893-6080(02)00084-9.
6
Markov clustering versus affinity propagation for the partitioning of protein interaction graphs.用于蛋白质相互作用图划分的马尔可夫聚类与亲和传播算法
BMC Bioinformatics. 2009 Mar 30;10:99. doi: 10.1186/1471-2105-10-99.
7
Detecting overlapping protein complexes in PPI networks based on robustness.基于稳健性检测蛋白质相互作用网络中的重叠蛋白质复合物。
Proteome Sci. 2013 Nov 7;11(Suppl 1):S18. doi: 10.1186/1477-5956-11-S1-S18.
8
A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.
9
Protein complex prediction via verifying and reconstructing the topology of domain-domain interactions.通过验证和重建结构域-结构域相互作用的拓扑结构来预测蛋白质复合物。
BMC Bioinformatics. 2010 Jun 28;11:350. doi: 10.1186/1471-2105-11-350.
10
Refining Markov Clustering for protein complex prediction by incorporating core-attachment structure.通过整合核心-附属结构优化马尔可夫聚类用于蛋白质复合物预测
Genome Inform. 2009 Oct;23(1):159-68.

引用本文的文献

1
Hematological and gene co-expression network analyses of high-risk beef cattle defines immunological mechanisms and biological complexes involved in bovine respiratory disease and weight gain.高危肉牛的血液学和基因共表达网络分析定义了牛呼吸道疾病和增重相关的免疫机制和生物学复合物。
PLoS One. 2022 Nov 3;17(11):e0277033. doi: 10.1371/journal.pone.0277033. eCollection 2022.
2
Identifying complexes from protein interaction networks according to different types of neighborhood density.根据不同类型的邻域密度从蛋白质相互作用网络中识别复合物。
J Comput Biol. 2012 Dec;19(12):1284-94. doi: 10.1089/cmb.2012.0195.

本文引用的文献

1
NeAT: a toolbox for the analysis of biological networks, clusters, classes and pathways.NeAT:用于分析生物网络、簇、类别和通路的工具箱。
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W444-51. doi: 10.1093/nar/gkn336. Epub 2008 Jun 4.
2
Network-based prediction of protein function.基于网络的蛋白质功能预测。
Mol Syst Biol. 2007;3:88. doi: 10.1038/msb4100129. Epub 2007 Mar 13.
3
Evaluation of clustering algorithms for protein-protein interaction networks.蛋白质-蛋白质相互作用网络聚类算法的评估
BMC Bioinformatics. 2006 Nov 6;7:488. doi: 10.1186/1471-2105-7-488.
4
A systematic comparison and evaluation of biclustering methods for gene expression data.基因表达数据双聚类方法的系统比较与评估
Bioinformatics. 2006 May 1;22(9):1122-9. doi: 10.1093/bioinformatics/btl060. Epub 2006 Feb 24.
5
How biologically relevant are interaction-based modules in protein networks?蛋白质网络中基于相互作用的模块在生物学上的相关性如何?
Genome Biol. 2004;5(11):R93. doi: 10.1186/gb-2004-5-11-r93. Epub 2004 Nov 1.
6
Structure-based assembly of protein complexes in yeast.酵母中基于结构的蛋白质复合物组装
Science. 2004 Mar 26;303(5666):2026-9. doi: 10.1126/science.1092645.
7
Topological structure analysis of the protein-protein interaction network in budding yeast.芽殖酵母中蛋白质-蛋白质相互作用网络的拓扑结构分析
Nucleic Acids Res. 2003 May 1;31(9):2443-50. doi: 10.1093/nar/gkg340.
8
An efficient algorithm for large-scale detection of protein families.一种用于大规模检测蛋白质家族的高效算法。
Nucleic Acids Res. 2002 Apr 1;30(7):1575-84. doi: 10.1093/nar/30.7.1575.
9
DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions.DIP,即相互作用蛋白质数据库:一种用于研究蛋白质相互作用细胞网络的研究工具。
Nucleic Acids Res. 2002 Jan 1;30(1):303-5. doi: 10.1093/nar/30.1.303.
10
Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO).酵母基因组数据库(SGD)使用基因本体论(GO)提供二级基因注释。
Nucleic Acids Res. 2002 Jan 1;30(1):69-72. doi: 10.1093/nar/30.1.69.