利用网络上下文信息预测人类癌症相关基因。

Predicting genes involved in human cancer using network contextual information.

作者信息

Rahmani Hossein, Blockeel Hendrik, Bender Andreas

机构信息

Leiden Institute of Advanced Computer Science, Universiteit Leiden, Niels Bohrweg 1, 2333 CA Leiden, The Netherlands.

出版信息

J Integr Bioinform. 2012 Sep 5;9(1):210. doi: 10.2390/biecoll-jib-2012-210.

DOI:10.2390/biecoll-jib-2012-210

PMID:22948007

Abstract

Protein-Protein Interaction (PPI) networks have been widely used for the task of predicting proteins involved in cancer. Previous research has shown that functional information about the protein for which a prediction is made, proximity to specific other proteins in the PPI network, as well as local network structure are informative features in this respect. In this work, we introduce two new types of input features, reflecting additional information: (1) Functional Context: the functions of proteins interacting with the target protein (rather than the protein itself); and (2) Structural Context: the relative position of the target protein with respect to specific other proteins selected according to a novel ANOVA (analysis of variance) based measure. We also introduce a selection strategy to pinpoint the most informative features. Results show that the proposed feature types and feature selection strategy yield informative features. A standard machine learning method (Naive Bayes) that uses the features proposed here outperforms the current state-of-the-art methods by more than 5% with respect to F-measure. In addition, manual inspection confirms the biological relevance of the top-ranked features.

摘要

蛋白质-蛋白质相互作用（PPI）网络已被广泛用于预测参与癌症的蛋白质的任务。先前的研究表明，关于进行预测的蛋白质的功能信息、在PPI网络中与特定其他蛋白质的接近程度以及局部网络结构在这方面都是有信息价值的特征。在这项工作中，我们引入了两种反映额外信息的新型输入特征：（1）功能上下文：与目标蛋白质相互作用的蛋白质的功能（而非蛋白质本身）；（2）结构上下文：目标蛋白质相对于根据基于新颖的方差分析（ANOVA）的度量选择的特定其他蛋白质的相对位置。我们还引入了一种选择策略来确定最具信息价值的特征。结果表明，所提出的特征类型和特征选择策略产生了有信息价值的特征。使用此处提出的特征的标准机器学习方法（朴素贝叶斯）在F值方面比当前的最先进方法高出5%以上。此外，人工检查证实了排名靠前的特征的生物学相关性。

相似文献

Predicting genes involved in human cancer using network contextual information.利用网络上下文信息预测人类癌症相关基因。

J Integr Bioinform. 2012 Sep 5;9(1):210. doi: 10.2390/biecoll-jib-2012-210.

PPIevo: protein-protein interaction prediction from PSSM based evolutionary information.PPIevo：基于 PSSM 的进化信息的蛋白质-蛋白质相互作用预测。

Genomics. 2013 Oct;102(4):237-42. doi: 10.1016/j.ygeno.2013.05.006. Epub 2013 Jun 6.

Globally predicting protein functions based on co-expressed protein-protein interaction networks and ontology taxonomy similarities.基于共表达蛋白质-蛋白质相互作用网络和本体分类相似性对全球蛋白质功能进行预测。

Gene. 2007 Apr 15;391(1-2):113-9. doi: 10.1016/j.gene.2006.12.008. Epub 2006 Dec 22.

A discriminative approach for identifying domain-domain interactions from protein-protein interactions.一种从蛋白质相互作用中识别结构域-结构域相互作用的判别方法。

Proteins. 2010 Apr;78(5):1243-53. doi: 10.1002/prot.22643.

Assessment of protein domain fusions in human protein interaction networks prediction: application to the human kinetochore model.评估人类蛋白质相互作用网络预测中的蛋白质结构域融合：在人类着丝粒模型中的应用。

N Biotechnol. 2010 Dec 31;27(6):755-65. doi: 10.1016/j.nbt.2010.09.005. Epub 2010 Sep 17.

Prediction of protein-RNA binding sites by a random forest method with combined features.基于组合特征的随机森林方法预测蛋白质-RNA 结合位点。

Bioinformatics. 2010 Jul 1;26(13):1616-22. doi: 10.1093/bioinformatics/btq253. Epub 2010 May 18.

BioPPISVMExtractor: a protein-protein interaction extractor for biomedical literature using SVM and rich feature sets.BioPPISVMExtractor：一种使用 SVM 和丰富特征集的生物医学文献蛋白质-蛋白质相互作用提取器。

J Biomed Inform. 2010 Feb;43(1):88-96. doi: 10.1016/j.jbi.2009.08.013. Epub 2009 Aug 23.

Detection of functional modules from protein interaction networks with an enhanced random walk based algorithm.基于增强随机游走算法从蛋白质相互作用网络中检测功能模块

Int J Comput Biol Drug Des. 2011;4(3):290-306. doi: 10.1504/IJCBDD.2011.041416. Epub 2011 Jul 21.

Protein complex prediction based on simultaneous protein interaction network.基于蛋白质相互作用网络的蛋白质复合物预测。

Bioinformatics. 2010 Feb 1;26(3):385-91. doi: 10.1093/bioinformatics/btp668. Epub 2009 Dec 4.

A novel method for prediction of protein interaction sites based on integrated RBF neural networks.基于集成 RBF 神经网络的蛋白质相互作用位点预测新方法。

Comput Biol Med. 2012 Apr;42(4):402-7. doi: 10.1016/j.compbiomed.2011.12.007. Epub 2012 Jan 9.

引用本文的文献

An integrated network of Arabidopsis growth regulators and its use for gene prioritization.拟南芥生长调节因子的整合网络及其在基因优先级排序中的应用。

Sci Rep. 2015 Dec 1;5:17617. doi: 10.1038/srep17617.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用网络上下文信息预测人类癌症相关基因。

Predicting genes involved in human cancer using network contextual information.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献