Telethon Institute of Genetics and Medicine, Via P. Castellino, Naples, Italy.
Nucleic Acids Res. 2011 Nov 1;39(20):8677-88. doi: 10.1093/nar/gkr593. Epub 2011 Jul 23.
We collected a massive and heterogeneous dataset of 20 255 gene expression profiles (GEPs) from a variety of human samples and experimental conditions, as well as 8895 GEPs from mouse samples. We developed a mutual information (MI) reverse-engineering approach to quantify the extent to which the mRNA levels of two genes are related to each other across the dataset. The resulting networks consist of 4 817 629 connections among 20 255 transcripts in human and 14 461 095 connections among 45 101 transcripts in mouse, with a inter-species conservation of 12%. The inferred connections were compared against known interactions to assess their biological significance. We experimentally validated a subset of not previously described protein-protein interactions. We discovered co-expressed modules within the networks, consisting of genes strongly connected to each other, which carry out specific biological functions, and tend to be in physical proximity at the chromatin level in the nucleus. We show that the network can be used to predict the biological function and subcellular localization of a protein, and to elucidate the function of a disease gene. We experimentally verified that granulin precursor (GRN) gene, whose mutations cause frontotemporal lobar degeneration, is involved in lysosome function. We have developed an online tool to explore the human and mouse gene networks.
我们收集了来自各种人类样本和实验条件的 20255 个基因表达谱(GEP)以及 8895 个来自小鼠样本的 GEP,构建了一个大规模的、异质的数据集。我们开发了一种互信息(MI)反向工程方法,以量化两个基因在整个数据集之间的 mRNA 水平相关性。由此产生的网络包含 20255 个人类转录本之间的 4817629 个连接和 45101 个小鼠转录本之间的 14461095 个连接,种间保守性为 12%。推断的连接与已知的相互作用进行了比较,以评估它们的生物学意义。我们实验验证了一部分以前未描述的蛋白质-蛋白质相互作用。我们在网络中发现了共表达模块,这些模块由彼此强烈连接的基因组成,它们执行特定的生物学功能,并且往往在核染色质水平上具有物理接近性。我们表明,该网络可用于预测蛋白质的生物学功能和亚细胞定位,并阐明疾病基因的功能。我们通过实验验证了颗粒蛋白前体(GRN)基因,其突变导致额颞叶变性,与溶酶体功能有关。我们已经开发了一个在线工具来探索人类和小鼠的基因网络。