Department 1 (Willmitzer), Central Metabolism, Max Planck Institute for Plant Physiology Potsdam, Germany.
Front Plant Sci. 2014 Mar 17;5:75. doi: 10.3389/fpls.2014.00075. eCollection 2014.
Whole genome sequencing and the relative ease of transcript profiling have facilitated the collection and data warehousing of immense quantities of expression data. However, a substantial proportion of genes are not yet functionally annotated a problem which is particularly acute for transport proteins. In Arabidopsis, for example, only a minor fraction of the estimated 700 intracellular transporters have been identified at the molecular genetic level. Furthermore it is only within the last couple of years that critical genes such as those encoding the final transport step required for the long distance transport of sucrose and the first transporter of the core photorespiratory pathway have been identified. Here we will describe how transcriptional coordination between genes of known function and non-annotated genes allows the identification of putative transporters on the premise that such co-expressed genes tend to be functionally related. We will additionally extend this to include the expansion of this approach to include phenotypic information from other levels of cellular organization such as proteomic and metabolomic data and provide case studies wherein this approach has successfully been used to fill knowledge gaps in important metabolic pathways and physiological processes.
全基因组测序和相对容易的转录谱分析促进了大量表达数据的收集和数据存储。然而,相当一部分基因尚未进行功能注释——这对于转运蛋白来说是一个特别严重的问题。例如,在拟南芥中,只有一小部分估计的 700 种细胞内转运蛋白在分子遗传水平上被鉴定出来。此外,直到最近几年,关键基因如编码蔗糖长距离运输所需的最后运输步骤的基因和核心光呼吸途径的第一个转运蛋白的基因才被鉴定出来。在这里,我们将描述如何在已知功能基因和未注释基因之间进行转录协调,从而识别推定的转运蛋白,前提是这种共表达的基因往往具有功能相关性。我们还将把这种方法扩展到包括来自细胞组织其他层次的表型信息,如蛋白质组学和代谢组学数据,并提供成功使用这种方法来填补重要代谢途径和生理过程中的知识空白的案例研究。