从异构数据预测基因的癌症相关性。

Predicting cancer involvement of genes from heterogeneous data.

作者信息

Aragues Ramon, Sander Chris, Oliva Baldo

机构信息

Structural Bioinformatics Lab, (GRIB), Universitat Pompeu Fabra-IMIM, Barcelona Research Park of Biomedicine (PRBB), 08003-Barcelona, Catalonia, Spain.

出版信息

BMC Bioinformatics. 2008 Mar 27;9:172. doi: 10.1186/1471-2105-9-172.

DOI:10.1186/1471-2105-9-172

PMID:18371197

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2330045/

Abstract

BACKGROUND

Systematic approaches for identifying proteins involved in different types of cancer are needed. Experimental techniques such as microarrays are being used to characterize cancer, but validating their results can be a laborious task. Computational approaches are used to prioritize between genes putatively involved in cancer, usually based on further analyzing experimental data.

RESULTS

We implemented a systematic method using the PIANA software that predicts cancer involvement of genes by integrating heterogeneous datasets. Specifically, we produced lists of genes likely to be involved in cancer by relying on: (i) protein-protein interactions; (ii) differential expression data; and (iii) structural and functional properties of cancer genes. The integrative approach that combines multiple sources of data obtained positive predictive values ranging from 23% (on a list of 811 genes) to 73% (on a list of 22 genes), outperforming the use of any of the data sources alone. We analyze a list of 20 cancer gene predictions, finding that most of them have been recently linked to cancer in literature.

CONCLUSION

Our approach to identifying and prioritizing candidate cancer genes can be used to produce lists of genes likely to be involved in cancer. Our results suggest that differential expression studies yielding high numbers of candidate cancer genes can be filtered using protein interaction networks.

摘要

背景

需要有系统的方法来识别参与不同类型癌症的蛋白质。诸如微阵列等实验技术正被用于表征癌症，但验证其结果可能是一项艰巨的任务。计算方法通常基于对实验数据的进一步分析，用于在假定参与癌症的基因之间进行优先级排序。

结果

我们使用PIANA软件实施了一种系统方法，通过整合异构数据集来预测基因与癌症的关联。具体而言，我们通过依赖以下方面生成可能参与癌症的基因列表：（i）蛋白质-蛋白质相互作用；（ii）差异表达数据；以及（iii）癌症基因的结构和功能特性。结合多种数据来源的综合方法获得的阳性预测值范围从23%（在811个基因的列表上）到73%（在22个基因的列表上），优于单独使用任何一种数据来源。我们分析了一份包含20个癌症基因预测的列表，发现其中大多数最近在文献中已与癌症相关联。

结论

我们识别候选癌症基因并对其进行优先级排序的方法可用于生成可能参与癌症的基因列表。我们的结果表明，可使用蛋白质相互作用网络对产生大量候选癌症基因的差异表达研究结果进行筛选。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2181/2330045/757c9635f97a/1471-2105-9-172-1.jpg

相似文献

Predicting cancer involvement of genes from heterogeneous data.

BMC Bioinformatics. 2008 Mar 27;9:172. doi: 10.1186/1471-2105-9-172.

Characterization of protein-interaction networks in tumors.

BMC Bioinformatics. 2007 Jun 27;8:224. doi: 10.1186/1471-2105-8-224.

CGI: a new approach for prioritizing genes by combining gene expression and protein-protein interaction data.

Bioinformatics. 2007 Jan 15;23(2):215-21. doi: 10.1093/bioinformatics/btl569. Epub 2006 Nov 10.

Probabilistic model of the human protein-protein interaction network.

Nat Biotechnol. 2005 Aug;23(8):951-9. doi: 10.1038/nbt1103.

Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis.

BMC Bioinformatics. 2006 Jan 6;7:2. doi: 10.1186/1471-2105-7-2.

Can we identify cellular pathways implicated in cancer using gene expression data?

Proc IEEE Comput Soc Bioinform Conf. 2003;2:94-103.

Discovering distinct functional modules of specific cancer types using protein-protein interaction networks.

Biomed Res Int. 2015;2015:146365. doi: 10.1155/2015/146365. Epub 2015 Sep 30.

Systems biology approach to identification of biomarkers for metastatic progression in cancer.

BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S8. doi: 10.1186/1471-2105-9-S9-S8.

Cancer Progression Prediction Using Gene Interaction Regularized Elastic Net.

IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):145-154. doi: 10.1109/TCBB.2015.2511758. Epub 2015 Dec 23.

Global topological features of cancer proteins in the human interactome.

Bioinformatics. 2006 Sep 15;22(18):2291-7. doi: 10.1093/bioinformatics/btl390. Epub 2006 Jul 14.

引用本文的文献

Predicting links between tumor samples and genes using 2-Layered graph based diffusion approach.

BMC Bioinformatics. 2019 Sep 9;20(1):462. doi: 10.1186/s12859-019-3056-2.

Interactome Analysis of Microtubule-targeting Agents Reveals Cytotoxicity Bases in Normal Cells.

Genomics Proteomics Bioinformatics. 2017 Dec;15(6):352-360. doi: 10.1016/j.gpb.2017.04.006. Epub 2017 Dec 12.

Distinctive Behaviors of Druggable Proteins in Cellular Networks.

PLoS Comput Biol. 2015 Dec 23;11(12):e1004597. doi: 10.1371/journal.pcbi.1004597. eCollection 2015 Dec.

Associations of SNPs located at candidate genes to bovine growth traits, prioritized with an interaction networks construction approach.

BMC Genet. 2015 Jul 22;16:91. doi: 10.1186/s12863-015-0247-3.

Prediction of cancer proteins by integrating protein interaction, domain frequency, and domain interaction data using machine learning algorithms.

Biomed Res Int. 2015;2015:312047. doi: 10.1155/2015/312047. Epub 2015 Mar 17.

Survey of network-based approaches to research of cardiovascular diseases.

Biomed Res Int. 2014;2014:527029. doi: 10.1155/2014/527029. Epub 2014 Mar 20.

A systematic in silico mining of the mechanistic implications and therapeutic potentials of estrogen receptor (ER)-α in breast cancer.

PLoS One. 2014 Mar 10;9(3):e91894. doi: 10.1371/journal.pone.0091894. eCollection 2014.

Network topology reveals key cardiovascular disease genes.

PLoS One. 2013 Aug 15;8(8):e71537. doi: 10.1371/journal.pone.0071537. eCollection 2013.

Search for signatures in miRNAs associated with cancer.

Bioinformation. 2013 Jun 8;9(10):524-7. doi: 10.6026/97320630009524. Print 2013.

A transcriptome-proteome integrated network identifies endoplasmic reticulum thiol oxidoreductase (ERp57) as a hub that mediates bone metastasis.

Mol Cell Proteomics. 2013 Aug;12(8):2111-25. doi: 10.1074/mcp.M112.022772. Epub 2013 Apr 26.

本文引用的文献

The human disease network.

Proc Natl Acad Sci U S A. 2007 May 22;104(21):8685-90. doi: 10.1073/pnas.0701361104. Epub 2007 May 14.

The macrophage-stimulating protein pathway promotes metastasis in a mouse model for breast cancer and predicts poor prognosis in humans.

Proc Natl Acad Sci U S A. 2007 May 1;104(18):7570-5. doi: 10.1073/pnas.0702095104. Epub 2007 Apr 24.

Genetic determinants of cancer metastasis.

Nat Rev Genet. 2007 May;8(5):341-52. doi: 10.1038/nrg2101.

Contribution of oncoproteomics to cancer biomarker discovery.

Mol Cancer. 2007 Apr 2;6:25. doi: 10.1186/1476-4598-6-25.

A human phenome-interactome network of protein complexes implicated in genetic disorders.

Nat Biotechnol. 2007 Mar;25(3):309-16. doi: 10.1038/nbt1295.

From bytes to bedside: data integration and computational biology for translational cancer research.

PLoS Comput Biol. 2007 Feb 23;3(2):e12. doi: 10.1371/journal.pcbi.0030012.

Integrative molecular concept modeling of prostate cancer progression.

Nat Genet. 2007 Jan;39(1):41-51. doi: 10.1038/ng1935. Epub 2006 Dec 17.

Computational prediction of cancer-gene function.

Nat Rev Cancer. 2007 Jan;7(1):23-34. doi: 10.1038/nrc2036. Epub 2006 Dec 14.

IntAct--open source resource for molecular interaction data.

Nucleic Acids Res. 2007 Jan;35(Database issue):D561-5. doi: 10.1093/nar/gkl958. Epub 2006 Dec 1.

The Universal Protein Resource (UniProt).

Nucleic Acids Res. 2007 Jan;35(Database issue):D193-7. doi: 10.1093/nar/gkl929. Epub 2006 Nov 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从异构数据预测基因的癌症相关性。

Predicting cancer involvement of genes from heterogeneous data.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献