Lawrence Berkeley National Laboratory, Berkeley, California, USA.
J Bacteriol. 2013 Jan;195(1):29-38. doi: 10.1128/JB.01977-12. Epub 2012 Oct 19.
Accurate detection of transcriptional regulatory elements is essential for high-quality genome annotation, metabolic reconstruction, and modeling of regulatory networks. We developed a computational approach for reconstruction of regulons operated by transcription factors (TFs) from large protein families and applied this novel approach to three TF families in 10 Desulfovibrionales genomes. Phylogenetic analyses of 125 regulators from the ArsR, Crp/Fnr, and GntR families revealed that 65% of these regulators (termed reference TFs) are well conserved in Desulfovibrionales, while the remaining 35% of regulators (termed singleton TFs) are species specific and show a mosaic distribution. For regulon reconstruction in the group of singleton TFs, the standard orthology-based approach was inefficient, and thus, we developed a novel approach based on the simultaneous study of all homologous TFs from the same family in a group of genomes. As a result, we identified binding for 21 singleton TFs and for all reference TFs in all three analyzed families. Within each TF family we observed structural similarities between DNA-binding motifs of different reference and singleton TFs. The collection of reconstructed regulons is available at the RegPrecise database (http://regprecise.lbl.gov/RegPrecise/Desulfovibrionales.jsp).
准确检测转录调控元件对于高质量的基因组注释、代谢重建和调控网络建模至关重要。我们开发了一种从大型蛋白质家族中重建转录因子(TFs)调控子的计算方法,并将该新方法应用于 10 个脱硫弧菌目中的 3 个 TF 家族。来自 ArsR、Crp/Fnr 和 GntR 家族的 125 个调节剂的系统发育分析表明,这些调节剂中有 65%(称为参考 TF)在脱硫弧菌目中高度保守,而其余 35%(称为单态 TF)是物种特异性的,呈镶嵌分布。对于单态 TF 组的调控子重建,基于标准直系同源的方法效率低下,因此,我们开发了一种基于同时研究同一家族中一组基因组中所有同源 TF 的新方法。结果,我们鉴定了 21 个单态 TF 和所有三种分析家族中所有参考 TF 的结合。在每个 TF 家族中,我们观察到不同参考和单态 TF 的 DNA 结合基序之间存在结构相似性。重建的调控子集合可在 RegPrecise 数据库(http://regprecise.lbl.gov/RegPrecise/Desulfovibrionales.jsp)中获得。