Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR, USA.
BMC Genomics. 2010 Oct 19;11:585. doi: 10.1186/1471-2164-11-585.
The current study focused on the extent genetic diversity within a species (Mus musculus) affects gene co-expression network structure. To examine this issue, we have created a new mouse resource, a heterogeneous stock (HS) formed from the same eight inbred strains that have been used to create the collaborative cross (CC). The eight inbred strains capture > 90% of the genetic diversity available within the species. For contrast with the HS-CC, a C57BL/6J (B6) × DBA/2J (D2) F2 intercross and the HS4, derived from crossing the B6, D2, BALB/cJ and LP/J strains, were used. Brain (striatum) gene expression data were obtained using the Illumina Mouse WG 6.1 array, and the data sets were interrogated using a weighted gene co-expression network analysis (WGCNA).
Genes reliably detected as expressed were similar in all three data sets as was the variability of expression. As measured by the WGCNA, the modular structure of the transcriptome networks was also preserved both on the basis of module assignment and from the perspective of the topological overlap maps. Details of the HS-CC gene modules are provided; essentially identical results were obtained for the HS4 and F2 modules. Gene ontology annotation of the modules revealed a significant overrepresentation in some modules for neuronal processes, e.g., central nervous system development. Integration with known protein-protein interactions data indicated significant enrichment among co-expressed genes. We also noted significant overlap with markers of central nervous system cell types (neurons, oligodendrocytes and astrocytes). Using the Allen Brain Atlas, we found evidence of spatial co-localization within the striatum for several modules. Finally, for some modules it was possible to detect an enrichment of transcription binding sites. The binding site for Wt1, which is associated with neurodegeneration, was the most significantly overrepresented.
Despite the marked differences in genetic diversity, the transcriptome structure was remarkably similar for the F2, HS4 and HS-CC. These data suggest that it should be possible to integrate network data from simple and complex crosses. A careful examination of the HS-CC transcriptome revealed the expected structure for striatal gene expression. Importantly, we demonstrate the integration of anatomical and network expression data.
本研究旨在探讨物种内遗传多样性对基因共表达网络结构的影响。为此,我们构建了一个新的小鼠资源,即由用于构建杂交群体的八个近交系组成的异质群体(HS)。这八个近交系捕获了物种内 90%以上的遗传多样性。为了与 HS-CC(B6 和 D2 的 F2 杂交群体)和 HS4(由 B6、D2、BALB/cJ 和 LP/J 杂交形成)进行对比,我们使用了 C57BL/6J(B6)×DBA/2J(D2)F2 杂交群体和 HS4。使用 Illumina Mouse WG 6.1 芯片获得大脑(纹状体)的基因表达数据,并使用加权基因共表达网络分析(WGCNA)对数据集进行分析。
在所有三个数据集的表达检测中,可靠检测到的基因以及表达的可变性都相似。通过 WGCNA 分析,基于模块分配和拓扑重叠图,转录组网络的模块结构也得以保留。HS-CC 基因模块的详细信息提供;HS4 和 F2 模块得到了几乎相同的结果。对模块的基因本体论注释显示,一些模块中神经元过程的显著过表达,例如中枢神经系统的发育。与已知的蛋白质-蛋白质相互作用数据的整合表明,在共表达基因中存在显著的富集。我们还注意到与中枢神经系统细胞类型(神经元、少突胶质细胞和星形胶质细胞)标志物的显著重叠。使用 Allen 大脑图谱,我们发现了一些模块在纹状体中的空间共定位证据。最后,对于一些模块,可以检测到转录结合位点的富集。与神经退行性变相关的 Wt1 的结合位点是最显著的过表达。
尽管遗传多样性存在显著差异,但 F2、HS4 和 HS-CC 的转录组结构非常相似。这些数据表明,应该有可能整合简单和复杂杂交的网络数据。对 HS-CC 转录组的仔细研究揭示了纹状体基因表达的预期结构。重要的是,我们展示了解剖学和网络表达数据的整合。