Departamento de Ingeniería de Sistemas Computacionales y Automatización, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Ciudad Universitaria, México DF, México.
PLoS One. 2013 Jul 29;8(7):e69707. doi: 10.1371/journal.pone.0069707. Print 2013.
In this work, the content of enzymes and DNA-binding transcription factors (TFs) in 794 non-redundant prokaryotic genomes was evaluated. The identification of enzymes was based on annotations deposited in the KEGG database as well as in databases of functional domains (COG and PFAM) and structural domains (Superfamily). For identifications of the TFs, hidden Markov profiles were constructed based on well-known transcriptional regulatory families. From these analyses, we obtained diverse and interesting results, such as the negative rate of incremental changes in the number of detected enzymes with respect to the genome size. On the contrary, for TFs the rate incremented as the complexity of genome increased. This inverse related performance shapes the diversity of metabolic and regulatory networks and impacts the availability of enzymes and TFs. Furthermore, the intersection of the derivatives between enzymes and TFs was identified at 9,659 genes, after this point, the regulatory complexity grows faster than metabolic complexity. In addition, TFs have a low number of duplications, in contrast to the apparent high number of duplications associated with enzymes. Despite the greater number of duplicated enzymes versus TFs, the increment by which duplicates appear is higher in TFs. A lower proportion of enzymes among archaeal genomes (22%) than in the bacterial ones (27%) was also found. This low proportion might be compensated by the interconnection between the metabolic pathways in Archaea. A similar proportion was also found for the archaeal TFs, for which the formation of regulatory complexes has been proposed. Finally, an enrichment of multifunctional enzymes in Bacteria, as a mechanism of ecological adaptation, was detected.
在这项工作中,评估了 794 个非冗余原核基因组中酶和 DNA 结合转录因子(TFs)的含量。酶的鉴定基于 KEGG 数据库以及功能域(COG 和 PFAM)和结构域(Superfamily)数据库中存储的注释。对于 TFs 的鉴定,基于已知的转录调控家族构建了隐马尔可夫轮廓。从这些分析中,我们获得了各种各样的有趣结果,例如,检测到的酶的数量与基因组大小的增量变化的负率。相反,对于 TFs,随着基因组复杂度的增加,速率增加。这种相反的表现形式塑造了代谢和调控网络的多样性,并影响了酶和 TFs 的可用性。此外,在 9659 个基因处鉴定了酶和 TFs 的导数的交集,在此之后,调控复杂性的增长速度快于代谢复杂性。此外,TFs 的复制数量较少,而与酶相关的明显高数量的复制则相反。尽管与 TFs 相比,酶的重复数量更多,但 TFs 中的重复出现的增量更高。在古菌基因组(22%)中发现的酶的比例低于细菌(27%)。这一低比例可能被古菌代谢途径之间的互联所弥补。在古菌 TFs 中也发现了类似的比例,有人提出了形成调控复合物的机制。最后,检测到细菌中多功能酶的富集,作为生态适应的一种机制。