Murali Thilakam, Pacifico Svetlana, Finley Russell L
Center for Molecular Medicine and Genetics, Wayne State University School of Medicine, Detroit, Michigan 48201, USA.
BMC Bioinformatics. 2014 Jun 10;15:177. doi: 10.1186/1471-2105-15-177.
Networks of interacting genes and gene products mediate most cellular and developmental processes. High throughput screening methods combined with literature curation are identifying many of the protein-protein interactions (PPI) and protein-DNA interactions (PDI) that constitute these networks. Most of the detection methods, however, fail to identify the in vivo spatial or temporal context of the interactions. Thus, the interaction data are a composite of the individual networks that may operate in specific tissues or developmental stages. Genome-wide expression data may be useful for filtering interaction data to identify the subnetworks that operate in specific spatial or temporal contexts. Here we take advantage of the extensive interaction and expression data available for Drosophila to analyze how interaction networks may be unique to specific tissues and developmental stages.
We ranked genes on a scale from ubiquitously expressed to tissue or stage specific and examined their interaction patterns. Interestingly, ubiquitously expressed genes have many more interactions among themselves than do non-ubiquitously expressed genes both in PPI and PDI networks. While the PDI network is enriched for interactions between tissue-specific transcription factors and their tissue-specific targets, a preponderance of the PDI interactions are between ubiquitous and non-ubiquitously expressed genes and proteins. In contrast to PDI, PPI networks are depleted for interactions among tissue- or stage- specific proteins, which instead interact primarily with widely expressed proteins. In light of these findings, we present an approach to filter interaction data based on gene expression levels normalized across tissues or developmental stages. We show that this filter (the percent maximum or pmax filter) can be used to identify subnetworks that function within individual tissues or developmental stages.
These observations suggest that protein networks are frequently organized into hubs of widely expressed proteins to which are attached various tissue- or stage-specific proteins. This is consistent with earlier analyses of human PPI data and suggests a similar organization of interaction networks across species. This organization implies that tissue or stage specific networks can be best identified from interactome data by using filters designed to include both ubiquitously expressed and specifically expressed genes and proteins.
相互作用的基因和基因产物网络介导了大多数细胞和发育过程。高通量筛选方法与文献整理相结合,正在识别构成这些网络的许多蛋白质-蛋白质相互作用(PPI)和蛋白质-DNA相互作用(PDI)。然而,大多数检测方法无法识别相互作用的体内空间或时间背景。因此,相互作用数据是可能在特定组织或发育阶段起作用的各个网络的综合。全基因组表达数据可能有助于筛选相互作用数据,以识别在特定空间或时间背景下起作用的子网络。在这里,我们利用果蝇可获得的广泛相互作用和表达数据,来分析相互作用网络如何可能在特定组织和发育阶段具有独特性。
我们根据基因从普遍表达到组织或阶段特异性的程度进行排名,并检查它们的相互作用模式。有趣的是,在PPI和PDI网络中,普遍表达的基因之间的相互作用比非普遍表达的基因之间的相互作用更多。虽然PDI网络富含组织特异性转录因子与其组织特异性靶标之间的相互作用,但PDI相互作用的大多数是在普遍表达和非普遍表达的基因与蛋白质之间。与PDI相反,PPI网络中组织或阶段特异性蛋白质之间的相互作用较少,这些蛋白质主要与广泛表达的蛋白质相互作用。鉴于这些发现,我们提出了一种基于跨组织或发育阶段标准化的基因表达水平来筛选相互作用数据的方法。我们表明,这种筛选器(最大百分比或pmax筛选器)可用于识别在单个组织或发育阶段起作用的子网络。
这些观察结果表明,蛋白质网络经常组织成广泛表达的蛋白质中心,各种组织或阶段特异性蛋白质附着在这些中心上。这与早期对人类PPI数据的分析一致,并表明跨物种的相互作用网络具有类似的组织方式。这种组织方式意味着,通过使用旨在纳入普遍表达和特异性表达的基因与蛋白质的筛选器,可以从相互作用组数据中最好地识别组织或阶段特异性网络。