Shelest Ekaterina
Systems biology/Bioinformatics group, Leibniz Institute for Natural Product Research and Infection Biology, Hans Knoell InstituteJena, Germany.
Front Genet. 2017 May 4;8:53. doi: 10.3389/fgene.2017.00053. eCollection 2017.
Transcription factors (TFs) are essential regulators of gene expression in a cell; the entire repertoire of TFs (TFome) of a species reflects its regulatory potential and the evolutionary history of the regulatory mechanisms. In this work, I give an overview of fungal TFs, analyze TFome dynamics, and discuss TF families and types of particular interest. Whole-genome annotation of TFs in more than 200 fungal species revealed ~80 families of TFs that are typically found in fungi. Almost half of the considered genomes belonged to basidiomycetes and zygomycetes, which have been underrepresented in earlier annotations due to dearth of sequenced genomes. The TFomes were analyzed in terms of expansion strategies genome- and lineage-wise. Generally, TFomes are known to correlate with genome size; but what happens to particular families when a TFome is expanding? By dissecting TFomes into single families and estimating the impact of each of them, I show that in fungi the TFome increment is largely limited to three families (C6 Zn clusters, C2H2-like Zn fingers, and homeodomain-like). To see whether this is a fungal peculiarity or a ubiquitous eukaryotic feature, I also analyzed metazoan TFomes, where I observed a similar trend (limited number of TFome-shaping families) but also some important differences connected mostly with the increased complexity in animals. The expansion strategies of TF families are lineage-specific; I demonstrate how the patterns of the TF families' distributions, designated as "TF signatures," can be used as a taxonomic feature, e.g., for allocation of uncertain phyla. In addition, both fungal and metazoan genomes contain an intriguing type of TFs. While usually TFs have a single DNA-binding domain, these TFs possess two (or more) different DNA-binding specificities. I demonstrate that dual-specific TFs comprising various combinations of all major TF families are a typical feature of fungal and animal genomes and have an interesting evolutionary history involving gene duplications and domain losses.
转录因子(TFs)是细胞中基因表达的关键调节因子;一个物种的转录因子全集(转录因子组)反映了其调节潜力以及调节机制的进化历史。在这项工作中,我概述了真菌转录因子,分析了转录因子组动态,并讨论了特别感兴趣的转录因子家族和类型。对200多种真菌物种的转录因子进行全基因组注释,发现了约80个通常在真菌中发现的转录因子家族。几乎一半的被考虑基因组属于担子菌门和接合菌门,由于测序基因组的缺乏,它们在早期注释中代表性不足。从基因组和谱系角度分析了转录因子组的扩展策略。一般来说,已知转录因子组与基因组大小相关;但是当转录因子组扩张时,特定家族会发生什么呢?通过将转录因子组分解为单个家族并估计每个家族的影响,我表明在真菌中,转录因子组的增加在很大程度上局限于三个家族(C6锌簇、C2H2样锌指和类同源域)。为了了解这是真菌特有的现象还是真核生物普遍存在的特征,我还分析了后生动物的转录因子组,在那里我观察到了类似的趋势(转录因子组形成家族数量有限),但也有一些重要的差异,主要与动物中增加的复杂性有关。转录因子家族的扩展策略是谱系特异性的;我展示了如何将转录因子家族分布模式(称为“转录因子特征”)用作分类特征,例如用于不确定门的分类。此外,真菌和后生动物基因组都包含一种有趣的转录因子类型。虽然通常转录因子具有单个DNA结合结构域,但这些转录因子具有两种(或更多)不同的DNA结合特异性。我证明包含所有主要转录因子家族各种组合的双特异性转录因子是真菌和动物基因组的典型特征,并且具有涉及基因重复和结构域丢失的有趣进化历史。