Töpfer Nadine, Fuchs Lisa-Maria, Aharoni Asaph
Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel.
Nucleic Acids Res. 2017 Jul 7;45(12):7049-7063. doi: 10.1093/nar/gkx404.
The existence of Metabolic Gene Clusters (MGCs) in plant genomes has recently raised increased interest. Thus far, MGCs were commonly identified for pathways of specialized metabolism, mostly those associated with terpene type products. For efficient identification of novel MGCs, computational approaches are essential. Here, we present PhytoClust; a tool for the detection of candidate MGCs in plant genomes. The algorithm employs a collection of enzyme families related to plant specialized metabolism, translated into hidden Markov models, to mine given genome sequences for physically co-localized metabolic enzymes. Our tool accurately identifies previously characterized plant MGCs. An exhaustive search of 31 plant genomes detected 1232 and 5531 putative gene cluster types and candidates, respectively. Clustering analysis of putative MGCs types by species reflected plant taxonomy. Furthermore, enrichment analysis revealed taxa- and species-specific enrichment of certain enzyme families in MGCs. When operating through our web-interface, PhytoClust users can mine a genome either based on a list of known cluster types or by defining new cluster rules. Moreover, for selected plant species, the output can be complemented by co-expression analysis. Altogether, we envisage PhytoClust to enhance novel MGCs discovery which will in turn impact the exploration of plant metabolism.
植物基因组中代谢基因簇(MGCs)的存在最近引起了越来越多的关注。到目前为止,MGCs通常在特殊代谢途径中被鉴定出来,主要是那些与萜类产物相关的途径。为了有效地鉴定新的MGCs,计算方法至关重要。在这里,我们介绍了PhytoClust;一种用于检测植物基因组中候选MGCs的工具。该算法采用了一系列与植物特殊代谢相关的酶家族,并将其转化为隐马尔可夫模型,以在给定的基因组序列中挖掘物理上共定位的代谢酶。我们的工具能够准确识别先前已表征的植物MGCs。对31个植物基因组进行详尽搜索,分别检测到1232种和5531种推定的基因簇类型及候选基因。按物种对推定的MGCs类型进行聚类分析反映了植物分类学。此外,富集分析揭示了MGCs中某些酶家族的分类群和物种特异性富集。通过我们的网络界面运行时,PhytoClust用户可以根据已知的簇类型列表或通过定义新的簇规则来挖掘基因组。此外,对于选定的植物物种,输出结果可以通过共表达分析进行补充。总之,我们设想PhytoClust能够加强新MGCs的发现,这反过来又将影响对植物代谢的探索。