Suppr超能文献

InPrePPI:一种基于基因组背景的综合评估方法,用于预测原核生物基因组中的蛋白质-蛋白质相互作用。

InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes.

作者信息

Sun Jingchun, Sun Yan, Ding Guohui, Liu Qi, Wang Chuan, He Youyu, Shi Tieliu, Li Yixue, Zhao Zhongming

机构信息

Virginia Institute for Psychiatric and Behavioral Genetics and Department of Psychiatry, Virginia Commonwealth University, Richmond, VA 23298, USA.

出版信息

BMC Bioinformatics. 2007 Oct 26;8:414. doi: 10.1186/1471-2105-8-414.

Abstract

BACKGROUND

Although many genomic features have been used in the prediction of protein-protein interactions (PPIs), frequently only one is used in a computational method. After realizing the limited power in the prediction using only one genomic feature, investigators are now moving toward integration. So far, there have been few integration studies for PPI prediction; one failed to yield appreciable improvement of prediction and the others did not conduct performance comparison. It remains unclear whether an integration of multiple genomic features can improve the PPI prediction and, if it can, how to integrate these features.

RESULTS

In this study, we first performed a systematic evaluation on the PPI prediction in Escherichia coli (E. coli) by four genomic context based methods: the phylogenetic profile method, the gene cluster method, the gene fusion method, and the gene neighbor method. The number of predicted PPIs and the average degree in the predicted PPI networks varied greatly among the four methods. Further, no method outperformed the others when we tested using three well-defined positive datasets from the KEGG, EcoCyc, and DIP databases. Based on these comparisons, we developed a novel integrated method, named InPrePPI. InPrePPI first normalizes the AC value (an integrated value of the accuracy and coverage) of each method using three positive datasets, then calculates a weight for each method, and finally uses the weight to calculate an integrated score for each protein pair predicted by the four genomic context based methods. We demonstrate that InPrePPI outperforms each of the four individual methods and, in general, the other two existing integrated methods: the joint observation method and the integrated prediction method in STRING. These four methods and InPrePPI are implemented in a user-friendly web interface.

CONCLUSION

This study evaluated the PPI prediction by four genomic context based methods, and presents an integrated evaluation method that shows better performance in E. coli.

摘要

背景

尽管许多基因组特征已被用于预测蛋白质-蛋白质相互作用(PPI),但在计算方法中通常仅使用一种。在意识到仅使用一种基因组特征进行预测的能力有限之后,研究人员现在正朝着整合的方向发展。到目前为止,针对PPI预测的整合研究很少;一项研究未能在预测方面取得明显改进,其他研究则未进行性能比较。目前尚不清楚整合多种基因组特征是否能改善PPI预测,如果可以,如何整合这些特征。

结果

在本研究中,我们首先通过四种基于基因组背景的方法对大肠杆菌中的PPI预测进行了系统评估:系统发育谱方法、基因簇方法、基因融合方法和基因邻域方法。这四种方法预测的PPI数量以及预测的PPI网络中的平均度数差异很大。此外,当我们使用来自KEGG、EcoCyc和DIP数据库的三个定义明确的阳性数据集进行测试时,没有一种方法优于其他方法。基于这些比较,我们开发了一种新的整合方法,名为InPrePPI。InPrePPI首先使用三个阳性数据集对每种方法的AC值(准确性和覆盖率的综合值)进行归一化,然后计算每种方法的权重,最后使用权重为基于四种基因组背景的方法预测的每个蛋白质对计算综合得分。我们证明InPrePPI优于四种单独方法中的每一种,并且总体上优于其他两种现有的整合方法:联合观察法和STRING中的整合预测法。这四种方法和InPrePPI都在一个用户友好的网络界面中实现。

结论

本研究评估了四种基于基因组背景的方法对PPI的预测,并提出了一种在大肠杆菌中表现更好的整合评估方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0a7/2238723/08fc05936033/1471-2105-8-414-1.jpg

相似文献

4
Computational methods for the prediction of protein-protein interactions.
Protein Pept Lett. 2010 Sep;17(9):1069-78. doi: 10.2174/092986610791760405.
6
MEGADOCK-Web: an integrated database of high-throughput structure-based protein-protein interaction predictions.
BMC Bioinformatics. 2018 May 8;19(Suppl 4):62. doi: 10.1186/s12859-018-2073-x.
7
Graph-based prediction of Protein-protein interactions with attributed signed graph embedding.
BMC Bioinformatics. 2020 Jul 21;21(1):323. doi: 10.1186/s12859-020-03646-8.
8
EcID. A database for the inference of functional interactions in E. coli.
Nucleic Acids Res. 2009 Jan;37(Database issue):D629-35. doi: 10.1093/nar/gkn853. Epub 2008 Nov 12.
9
Prediction of Protein-Protein Interaction via co-occurring Aligned Pattern Clusters.
Methods. 2016 Nov 1;110:26-34. doi: 10.1016/j.ymeth.2016.07.018. Epub 2016 Jul 27.
10
Evaluation of physical and functional protein-protein interaction prediction methods for detecting biological pathways.
PLoS One. 2013;8(1):e54325. doi: 10.1371/journal.pone.0054325. Epub 2013 Jan 17.

引用本文的文献

1
Computational Network Inference for Bacterial Interactomics.
mSystems. 2022 Apr 26;7(2):e0145621. doi: 10.1128/msystems.01456-21. Epub 2022 Mar 30.
2
Prediction of Protein-Protein Interactions by Evidence Combining Methods.
Int J Mol Sci. 2016 Nov 22;17(11):1946. doi: 10.3390/ijms17111946.
3
Inference of protein-protein interaction networks from multiple heterogeneous data.
EURASIP J Bioinform Syst Biol. 2016 Feb 19;2016(1):8. doi: 10.1186/s13637-016-0040-2. eCollection 2016 Dec.
5
Genome-wide prediction of prokaryotic two-component system networks using a sequence-based meta-predictor.
BMC Bioinformatics. 2015 Sep 18;16:297. doi: 10.1186/s12859-015-0741-7.
8
Bayesian inference for genomic data integration reduces misclassification rate in predicting protein-protein interactions.
PLoS Comput Biol. 2011 Jul;7(7):e1002110. doi: 10.1371/journal.pcbi.1002110. Epub 2011 Jul 28.
9
Do cancer proteins really interact strongly in the human protein-protein interaction network?
Comput Biol Chem. 2011 Jun;35(3):121-5. doi: 10.1016/j.compbiolchem.2011.04.005.
10
3D-interologs: an evolution database of physical protein- protein interactions across multiple genomes.
BMC Genomics. 2010 Dec 1;11 Suppl 3(Suppl 3):S7. doi: 10.1186/1471-2164-11-S3-S7.

本文引用的文献

2
Deciphering protein-protein interactions. Part I. Experimental techniques and databases.
PLoS Comput Biol. 2007 Mar 30;3(3):e42. doi: 10.1371/journal.pcbi.0030042.
3
Construction of phylogenetic profiles based on the genetic distance of hundreds of genomes.
Biochem Biophys Res Commun. 2007 Apr 13;355(3):849-53. doi: 10.1016/j.bbrc.2007.02.048. Epub 2007 Feb 20.
4
Phylogenetic profiles for the prediction of protein-protein interactions: how to select reference organisms?
Biochem Biophys Res Commun. 2007 Feb 23;353(4):985-91. doi: 10.1016/j.bbrc.2006.12.146. Epub 2006 Dec 27.
6
A human protein-protein interaction network: a resource for annotating the proteome.
Cell. 2005 Sep 23;122(6):957-68. doi: 10.1016/j.cell.2005.08.029.
7
Assessing the limits of genomic data integration for predicting protein networks.
Genome Res. 2005 Jul;15(7):945-53. doi: 10.1101/gr.3610305.
8
Refined phylogenetic profiles method for predicting protein-protein interactions.
Bioinformatics. 2005 Aug 15;21(16):3409-15. doi: 10.1093/bioinformatics/bti532. Epub 2005 Jun 9.
9
STRING: known and predicted protein-protein associations, integrated and transferred across organisms.
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D433-7. doi: 10.1093/nar/gki005.
10
A first-draft human protein-interaction map.
Genome Biol. 2004;5(9):R63. doi: 10.1186/gb-2004-5-9-r63. Epub 2004 Aug 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验