使用输出核树推断生物网络。

Inferring biological networks with output kernel trees.

作者信息

Geurts Pierre, Touleimat Nizar, Dutreix Marie, d'Alché-Buc Florence

机构信息

IBISC FRE CNRS 2873 & Epigenomics project, GENOPOLE, Evry, France.

出版信息

BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S4. doi: 10.1186/1471-2105-8-S2-S4.

DOI:10.1186/1471-2105-8-S2-S4

PMID:17493253

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1892073/

Abstract

BACKGROUND

Elucidating biological networks between proteins appears nowadays as one of the most important challenges in systems biology. Computational approaches to this problem are important to complement high-throughput technologies and to help biologists in designing new experiments. In this work, we focus on the completion of a biological network from various sources of experimental data.

RESULTS

We propose a new machine learning approach for the supervised inference of biological networks, which is based on a kernelization of the output space of regression trees. It inherits several features of tree-based algorithms such as interpretability, robustness to irrelevant variables, and input scalability. We applied this method to the inference of two kinds of networks in the yeast S. cerevisiae: a protein-protein interaction network and an enzyme network. In both cases, we obtained results competitive with existing approaches. We also show that our method provides relevant insights on input data regarding their potential relationship with the existence of interactions. Furthermore, we confirm the biological validity of our predictions in the context of an analysis of gene expression data.

CONCLUSION

Output kernel tree based methods provide an efficient tool for the inference of biological networks from experimental data. Their simplicity and interpretability should make them of great value for biologists.

摘要

背景

如今，阐明蛋白质之间的生物网络似乎是系统生物学中最重要的挑战之一。针对这个问题的计算方法对于补充高通量技术以及帮助生物学家设计新实验而言至关重要。在这项工作中，我们专注于从各种实验数据源完成生物网络。

结果

我们提出了一种用于生物网络监督推理的新机器学习方法，该方法基于回归树输出空间的核化。它继承了基于树的算法的几个特征，如可解释性、对无关变量的鲁棒性和输入可扩展性。我们将此方法应用于酿酒酵母中两种网络的推理：蛋白质 - 蛋白质相互作用网络和酶网络。在这两种情况下，我们都获得了与现有方法相竞争的结果。我们还表明，我们的方法在输入数据与其相互作用存在的潜在关系方面提供了相关见解。此外，在基因表达数据分析的背景下，我们证实了我们预测的生物学有效性。

结论

基于输出核树的方法为从实验数据推理生物网络提供了一种有效工具。它们的简单性和可解释性应该使其对生物学家具有很大价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a9d/1892073/dd40fb41387d/1471-2105-8-S2-S4-1.jpg

相似文献

Inferring biological networks with output kernel trees.

BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S4. doi: 10.1186/1471-2105-8-S2-S4.

Supervised inference of gene-regulatory networks.

BMC Bioinformatics. 2008 Jan 4;9:2. doi: 10.1186/1471-2105-9-2.

Selective integration of multiple biological data for supervised network inference.

Bioinformatics. 2005 May 15;21(10):2488-95. doi: 10.1093/bioinformatics/bti339. Epub 2005 Feb 22.

Validating module network learning algorithms using simulated data.

BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2105-8-S2-S5.

Supervised reconstruction of biological networks with local models.

Bioinformatics. 2007 Jul 1;23(13):i57-65. doi: 10.1093/bioinformatics/btm204.

Boolean dynamics of genetic regulatory networks inferred from microarray time series data.

Bioinformatics. 2007 Apr 1;23(7):866-74. doi: 10.1093/bioinformatics/btm021. Epub 2007 Jan 31.

Gene regulatory network inference: data integration in dynamic models-a review.

Biosystems. 2009 Apr;96(1):86-103. doi: 10.1016/j.biosystems.2008.12.004. Epub 2008 Dec 27.

Learning kernels from biological networks by maximizing entropy.

Bioinformatics. 2004 Aug 4;20 Suppl 1:i326-33. doi: 10.1093/bioinformatics/bth906.

Protein network inference from multiple genomic data: a supervised approach.

Bioinformatics. 2004 Aug 4;20 Suppl 1:i363-70. doi: 10.1093/bioinformatics/bth910.

Learning regulatory programs that accurately predict differential expression with MEDUSA.

Ann N Y Acad Sci. 2007 Dec;1115:178-202. doi: 10.1196/annals.1407.020. Epub 2007 Oct 12.

引用本文的文献

Longitudinal multiple sclerosis lesion segmentation: Resource and challenge.

Neuroimage. 2017 Mar 1;148:77-102. doi: 10.1016/j.neuroimage.2016.12.064. Epub 2017 Jan 11.

Multi-Output Decision Trees for Lesion Segmentation in Multiple Sclerosis.

Proc SPIE Int Soc Opt Eng. 2015 Feb;9413. doi: 10.1117/12.2082157. Epub 2015 Mar 20.

Machine Learning of Protein Interactions in Fungal Secretory Pathways.

PLoS One. 2016 Jul 21;11(7):e0159302. doi: 10.1371/journal.pone.0159302. eCollection 2016.

Classifying pairs with trees for supervised biological network inference.

Mol Biosyst. 2015 Aug;11(8):2116-25. doi: 10.1039/c5mb00174a.

On protocols and measures for the validation of supervised methods for the inference of biological networks.

Front Genet. 2013 Dec 3;4:262. doi: 10.3389/fgene.2013.00262.

本文引用的文献

The budding yeast rRNA and ribosome biosynthesis (RRB) regulon contains over 200 genes.

Yeast. 2006 Mar;23(4):293-306. doi: 10.1002/yea.1353.

A haploid-specific transcriptional response to irradiation in Saccharomyces cerevisiae.

Nucleic Acids Res. 2005 Nov 30;33(20):6635-43. doi: 10.1093/nar/gki959. Print 2005.

BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks.

Bioinformatics. 2005 Aug 15;21(16):3448-9. doi: 10.1093/bioinformatics/bti551. Epub 2005 Jun 21.

Supervised enzyme network inference from the integration of genomic data and chemical information.

Bioinformatics. 2005 Jun;21 Suppl 1:i468-77. doi: 10.1093/bioinformatics/bti1012.

Kernel methods for predicting protein-protein interactions.

Bioinformatics. 2005 Jun;21 Suppl 1:i38-46. doi: 10.1093/bioinformatics/bti1016.

Selective integration of multiple biological data for supervised network inference.

Bioinformatics. 2005 May 15;21(10):2488-95. doi: 10.1093/bioinformatics/bti339. Epub 2005 Feb 22.

STRING: known and predicted protein-protein associations, integrated and transferred across organisms.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D433-7. doi: 10.1093/nar/gki005.

Protein network inference from multiple genomic data: a supervised approach.

Bioinformatics. 2004 Aug 4;20 Suppl 1:i363-70. doi: 10.1093/bioinformatics/bth910.

Evidence for dynamically organized modularity in the yeast protein-protein interaction network.

Nature. 2004 Jul 1;430(6995):88-93. doi: 10.1038/nature02555. Epub 2004 Jun 9.

The KEGG resource for deciphering the genome.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D277-80. doi: 10.1093/nar/gkh063.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用输出核树推断生物网络。

Inferring biological networks with output kernel trees.

作者信息

Geurts Pierre, Touleimat Nizar, Dutreix Marie, d'Alché-Buc Florence

机构信息

IBISC FRE CNRS 2873 & Epigenomics project, GENOPOLE, Evry, France.