Institut Curie, Paris, France; INSERM U900, Paris, France; Mines ParisTech, Fontainebleau, France.
Division of Genetics & Genomics, The Roslin Institute, The University of Edinburgh, Easter Bush, Midlothian, United Kingdom.
PLoS Comput Biol. 2015 Feb 13;11(2):e1003983. doi: 10.1371/journal.pcbi.1003983. eCollection 2015 Feb.
Module network inference is an established statistical method to reconstruct co-expression modules and their upstream regulatory programs from integrated multi-omics datasets measuring the activity levels of various cellular components across different individuals, experimental conditions or time points of a dynamic process. We have developed Lemon-Tree, an open-source, platform-independent, modular, extensible software package implementing state-of-the-art ensemble methods for module network inference. We benchmarked Lemon-Tree using large-scale tumor datasets and showed that Lemon-Tree algorithms compare favorably with state-of-the-art module network inference software. We also analyzed a large dataset of somatic copy-number alterations and gene expression levels measured in glioblastoma samples from The Cancer Genome Atlas and found that Lemon-Tree correctly identifies known glioblastoma oncogenes and tumor suppressors as master regulators in the inferred module network. Novel candidate driver genes predicted by Lemon-Tree were validated using tumor pathway and survival analyses. Lemon-Tree is available from http://lemon-tree.googlecode.com under the GNU General Public License version 2.0.
模块网络推断是一种成熟的统计方法,可用于从整合的多组学数据集中重建共表达模块及其上游调控程序,这些数据集中测量了不同个体、实验条件或动态过程中不同时间点的各种细胞成分的活性水平。我们开发了 Lemon-Tree,这是一个开源的、与平台无关的、模块化的、可扩展的软件包,实现了用于模块网络推断的最先进的集成方法。我们使用大规模肿瘤数据集对 Lemon-Tree 进行了基准测试,并表明 Lemon-Tree 算法与最先进的模块网络推断软件相比具有优势。我们还分析了来自癌症基因组图谱的胶质母细胞瘤样本中测量的体细胞拷贝数改变和基因表达水平的大型数据集,发现 Lemon-Tree 可以正确识别已知的胶质母细胞瘤癌基因和肿瘤抑制基因作为推断的模块网络中的主要调控因子。使用肿瘤途径和生存分析验证了 Lemon-Tree 预测的新候选驱动基因。Lemon-Tree 可从 http://lemon-tree.googlecode.com 获得,根据 GNU 通用公共许可证第 2.0 版授权。