利用模糊机器学习模型鉴定蛋白质复合物。

Identifying protein complexes with fuzzy machine learning model.

出版信息

Proteome Sci. 2013 Nov 7;11(Suppl 1):S21. doi: 10.1186/1477-5956-11-S1-S21.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3908516/

Abstract

BACKGROUND

Many computational approaches have been developed to detect protein complexes from protein-protein interaction (PPI) networks. However, these PPI networks are always built from high-throughput experiments. The presence of unreliable interactions in PPI network makes this task very challenging.

METHODS

In this study, we proposed a Genetic-Algorithm Fuzzy Naïve Bayes (GAFNB) filter to classify the protein complexes from candidate subgraphs. It takes unreliability into consideration and tackles the presence of unreliable interactions in protein complex. We first got candidate protein complexes through existed popular methods. Each candidate protein complex is represented by 29 graph features and 266 biological property based features. GAFNB model is then applied to classify the candidate complexes into positive or negative.

RESULTS

Our evaluation indicates that the protein complex identification algorithms using the GAFNB model filtering outperform original ones. For evaluation of GAFNB model, we also compared the performance of GAFNB with Naïve Bayes (NB). Results show that GAFNB performed better than NB. It indicates that a fuzzy model is more suitable when unreliability is present.

CONCLUSIONS

We conclude that filtering candidate protein complexes with GAFNB model can improve the effectiveness of protein complex identification. It is necessary to consider the unreliability in this task.

摘要

背景

已经开发出许多计算方法来从蛋白质-蛋白质相互作用（PPI）网络中检测蛋白质复合物。然而，这些 PPI 网络通常是从高通量实验中构建的。PPI 网络中存在不可靠的相互作用使得这项任务极具挑战性。

方法

在本研究中，我们提出了一种遗传算法模糊朴素贝叶斯（GAFNB）过滤器，用于从候选子图中分类蛋白质复合物。它考虑了不可靠性，并解决了蛋白质复合物中存在不可靠相互作用的问题。我们首先通过现有的流行方法获得候选蛋白质复合物。每个候选蛋白质复合物由 29 个图特征和 266 个基于生物学特性的特征表示。然后，将 GAFNB 模型应用于分类候选复合物为阳性或阴性。

结果

我们的评估表明，使用 GAFNB 模型过滤候选蛋白质复合物的蛋白质复合物识别算法优于原始算法。为了评估 GAFNB 模型，我们还比较了 GAFNB 与朴素贝叶斯（NB）的性能。结果表明，GAFNB 的性能优于 NB。这表明在存在不可靠性时，模糊模型更适用。

结论

我们得出结论，使用 GAFNB 模型过滤候选蛋白质复合物可以提高蛋白质复合物识别的有效性。在这项任务中，考虑不可靠性是必要的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6581/3908516/ffbcc29f1a20/1477-5956-11-S1-S21-1.jpg

相似文献

Identifying protein complexes with fuzzy machine learning model.利用模糊机器学习模型鉴定蛋白质复合物。

Proteome Sci. 2013 Nov 7;11(Suppl 1):S21. doi: 10.1186/1477-5956-11-S1-S21.

Protein complex identification by integrating protein-protein interaction evidence from multiple sources.通过整合来自多个来源的蛋白质-蛋白质相互作用证据来鉴定蛋白质复合物。

PLoS One. 2013 Dec 27;8(12):e83841. doi: 10.1371/journal.pone.0083841. eCollection 2013.

Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks.过滤基因本体语义相似性以识别大型蛋白质相互作用网络中的蛋白质复合物。

Proteome Sci. 2012 Jun 21;10 Suppl 1(Suppl 1):S18. doi: 10.1186/1477-5956-10-S1-S18.

PCE-FR: A Novel Method for Identifying Overlapping Protein Complexes in Weighted Protein-Protein Interaction Networks Using Pseudo-Clique Extension Based on Fuzzy Relation.PCE-FR：一种基于模糊关系的伪团扩展在加权蛋白质-蛋白质相互作用网络中识别重叠蛋白质复合物的新方法。

IEEE Trans Nanobioscience. 2016 Oct;15(7):728-738. doi: 10.1109/TNB.2016.2611683. Epub 2016 Sep 20.

Identifying Protein Complexes With Clear Module Structure Using Pairwise Constraints in Protein Interaction Networks.利用蛋白质相互作用网络中的成对约束识别具有清晰模块结构的蛋白质复合物

Front Genet. 2021 Aug 27;12:664786. doi: 10.3389/fgene.2021.664786. eCollection 2021.

Protein Complexes Detection Based on Semi-Supervised Network Embedding Model.基于半监督网络嵌入模型的蛋白质复合物检测。

IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):797-803. doi: 10.1109/TCBB.2019.2944809. Epub 2021 Apr 8.

Identifying protein complexes based on node embeddings obtained from protein-protein interaction networks.基于从蛋白质-蛋白质相互作用网络中获得的节点嵌入来识别蛋白质复合物。

BMC Bioinformatics. 2018 Sep 21;19(1):332. doi: 10.1186/s12859-018-2364-2.

Combining SVM and ECOC for Identification of Protein Complexes from Protein Protein Interaction Networks by Integrating Amino Acids' Physical Properties and Complex Topology.结合 SVM 和 ECOC 从蛋白质相互作用网络中鉴定蛋白质复合物，整合氨基酸的物理性质和复合物拓扑结构。

Interdiscip Sci. 2020 Sep;12(3):264-275. doi: 10.1007/s12539-020-00369-5. Epub 2020 May 21.

MOEPGA: A novel method to detect protein complexes in yeast protein-protein interaction networks based on MultiObjective Evolutionary Programming Genetic Algorithm.MOEPGA：一种基于多目标进化规划遗传算法检测酵母蛋白质-蛋白质相互作用网络中蛋白质复合物的新方法。

Comput Biol Chem. 2015 Oct;58:173-81. doi: 10.1016/j.compbiolchem.2015.06.006. Epub 2015 Jul 7.

Discovering functional interdependence relationship in PPI networks for protein complex identification.发现蛋白质相互作用网络中的功能相互依赖关系，用于蛋白质复合物识别。

IEEE Trans Biomed Eng. 2012 Apr;59(4):899-908. doi: 10.1109/TBME.2010.2093524. Epub 2010 Nov 18.

引用本文的文献

Genetic Optimization-Based Consensus Control of Multi-Agent 6-DoF UAV System.基于遗传优化的多智能体六自由度无人机系统一致性控制

Sensors (Basel). 2020 Jun 24;20(12):3576. doi: 10.3390/s20123576.

本文引用的文献

Identification of protein complexes from co-immunoprecipitation data.从共免疫沉淀数据中鉴定蛋白质复合物。

Bioinformatics. 2011 Jan 1;27(1):111-7. doi: 10.1093/bioinformatics/btq652. Epub 2010 Nov 25.

Computational approaches for detecting protein complexes from protein interaction networks: a survey.从蛋白质相互作用网络中检测蛋白质复合物的计算方法：综述。

BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2164-11-S1-S3.

Identifying protein complexes using hybrid properties.利用混合特性鉴定蛋白质复合物。

J Proteome Res. 2009 Nov;8(11):5212-8. doi: 10.1021/pr900554a.

A core-attachment based method to detect protein complexes in PPI networks.一种基于核心附着的方法来检测蛋白质-蛋白质相互作用网络中的蛋白质复合物。

BMC Bioinformatics. 2009 Jun 2;10:169. doi: 10.1186/1471-2105-10-169.

Complex discovery from weighted PPI networks.基于加权 PPI 网络的复杂发现。

Bioinformatics. 2009 Aug 1;25(15):1891-7. doi: 10.1093/bioinformatics/btp311. Epub 2009 May 12.

Using indirect protein-protein interactions for protein complex prediction.利用间接蛋白质-蛋白质相互作用进行蛋白质复合物预测。

J Bioinform Comput Biol. 2008 Jun;6(3):435-66. doi: 10.1142/s0219720008003497.

Proteome survey reveals modularity of the yeast cell machinery.蛋白质组研究揭示酵母细胞机制的模块化特性。

Nature. 2006 Mar 30;440(7084):631-6. doi: 10.1038/nature04532. Epub 2006 Jan 22.

Discovering reliable protein interactions from high-throughput experimental data using network topology.利用网络拓扑结构从高通量实验数据中发现可靠的蛋白质相互作用。

Artif Intell Med. 2005 Sep-Oct;35(1-2):37-47. doi: 10.1016/j.artmed.2005.02.004.

GO::TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes.GO::TermFinder——用于访问基因本体论信息并查找与基因列表相关的显著富集基因本体论术语的开源软件。

Bioinformatics. 2004 Dec 12;20(18):3710-5. doi: 10.1093/bioinformatics/bth456. Epub 2004 Aug 5.

MIPS: analysis and annotation of proteins from whole genomes.MIPS：全基因组蛋白质的分析与注释

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D41-4. doi: 10.1093/nar/gkh092.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用模糊机器学习模型鉴定蛋白质复合物。

Identifying protein complexes with fuzzy machine learning model.

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献