Suppr超能文献

利用模糊机器学习模型鉴定蛋白质复合物。

Identifying protein complexes with fuzzy machine learning model.

出版信息

Proteome Sci. 2013 Nov 7;11(Suppl 1):S21. doi: 10.1186/1477-5956-11-S1-S21.

Abstract

BACKGROUND

Many computational approaches have been developed to detect protein complexes from protein-protein interaction (PPI) networks. However, these PPI networks are always built from high-throughput experiments. The presence of unreliable interactions in PPI network makes this task very challenging.

METHODS

In this study, we proposed a Genetic-Algorithm Fuzzy Naïve Bayes (GAFNB) filter to classify the protein complexes from candidate subgraphs. It takes unreliability into consideration and tackles the presence of unreliable interactions in protein complex. We first got candidate protein complexes through existed popular methods. Each candidate protein complex is represented by 29 graph features and 266 biological property based features. GAFNB model is then applied to classify the candidate complexes into positive or negative.

RESULTS

Our evaluation indicates that the protein complex identification algorithms using the GAFNB model filtering outperform original ones. For evaluation of GAFNB model, we also compared the performance of GAFNB with Naïve Bayes (NB). Results show that GAFNB performed better than NB. It indicates that a fuzzy model is more suitable when unreliability is present.

CONCLUSIONS

We conclude that filtering candidate protein complexes with GAFNB model can improve the effectiveness of protein complex identification. It is necessary to consider the unreliability in this task.

摘要

背景

已经开发出许多计算方法来从蛋白质-蛋白质相互作用(PPI)网络中检测蛋白质复合物。然而,这些 PPI 网络通常是从高通量实验中构建的。PPI 网络中存在不可靠的相互作用使得这项任务极具挑战性。

方法

在本研究中,我们提出了一种遗传算法模糊朴素贝叶斯(GAFNB)过滤器,用于从候选子图中分类蛋白质复合物。它考虑了不可靠性,并解决了蛋白质复合物中存在不可靠相互作用的问题。我们首先通过现有的流行方法获得候选蛋白质复合物。每个候选蛋白质复合物由 29 个图特征和 266 个基于生物学特性的特征表示。然后,将 GAFNB 模型应用于分类候选复合物为阳性或阴性。

结果

我们的评估表明,使用 GAFNB 模型过滤候选蛋白质复合物的蛋白质复合物识别算法优于原始算法。为了评估 GAFNB 模型,我们还比较了 GAFNB 与朴素贝叶斯(NB)的性能。结果表明,GAFNB 的性能优于 NB。这表明在存在不可靠性时,模糊模型更适用。

结论

我们得出结论,使用 GAFNB 模型过滤候选蛋白质复合物可以提高蛋白质复合物识别的有效性。在这项任务中,考虑不可靠性是必要的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6581/3908516/ffbcc29f1a20/1477-5956-11-S1-S21-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验