RIKEN Plant Science Center, Yokohama, Kanagawa 230-0045, Japan.
Plant Physiol. 2012 Apr;158(4):1487-502. doi: 10.1104/pp.111.188367. Epub 2012 Feb 3.
Gene-to-gene coexpression analysis provides fundamental information and is a promising approach for predicting unknown gene functions in plants. We investigated various associations in the gene expression of tomato (Solanum lycopersicum) to predict unknown gene functions in an unbiased manner. We obtained more than 300 microarrays from publicly available databases and our own hybridizations, and here, we present tomato coexpression networks and coexpression modules. The topological characteristics of the networks were highly heterogenous. We extracted 465 total coexpression modules from the data set by graph clustering, which allows users to divide a graph effectively into a set of clusters. Of these, 88% were assigned systematically by Gene Ontology terms. Our approaches revealed functional modules in the tomato transcriptome data; the predominant functions of coexpression modules were biologically relevant. We also investigated differential coexpression among data sets consisting of leaf, fruit, and root samples to gain further insights into the tomato transcriptome. We now demonstrate that (1) duplicated genes, as well as metabolic genes, exhibit a small but significant number of differential coexpressions, and (2) a reversal of gene coexpression occurred in two metabolic pathways involved in lycopene and flavonoid biosynthesis. Independent experimental verification of the findings for six selected genes was done using quantitative real-time polymerase chain reaction. Our findings suggest that differential coexpression may assist in the investigation of key regulatory steps in metabolic pathways. The approaches and results reported here will be useful to prioritize candidate genes for further functional genomics studies of tomato metabolism.
基因间共表达分析提供了基本信息,是一种预测植物中未知基因功能的有前途的方法。我们以无偏倚的方式研究了番茄(Solanum lycopersicum)基因表达中的各种关联,以预测未知基因的功能。我们从公共数据库和我们自己的杂交获得了 300 多个微阵列,在这里,我们展示了番茄共表达网络和共表达模块。网络的拓扑特征高度异质。我们通过图聚类从数据集提取了 465 个总共表达模块,这允许用户有效地将图划分为一组簇。其中,88%的模块通过基因本体论术语系统地分配。我们的方法揭示了番茄转录组数据中的功能模块;共表达模块的主要功能是生物学相关的。我们还研究了由叶、果实和根样本组成的数据集之间的差异共表达,以进一步深入了解番茄转录组。我们现在表明:(1)复制基因和代谢基因表现出少量但显著的差异共表达;(2)番茄红素和类黄酮生物合成两个代谢途径中的基因共表达发生逆转。对六个选定基因的发现进行了定量实时聚合酶链反应的独立实验验证。我们的研究结果表明,差异共表达可能有助于研究代谢途径中的关键调控步骤。这里报告的方法和结果将有助于确定番茄代谢的进一步功能基因组学研究的候选基因。