Stubbs Lisa, Sun Younguk, Caetano-Anolles Derek
Department of Cell and Developmental Biology, Institute for Genomic Biology, University of Illinois, Urbana, IL, 61801, USA,
Subcell Biochem. 2011;52:75-94. doi: 10.1007/978-90-481-9069-0_4.
Krüppel-type or C2H2 zinc fingers represent a dominant DNA-binding motif in eukaryotic transcription factor (TF) proteins. In Krüppel-type (KZNF) TFs, KZNF motifs are arranged in arrays of three to as many as 40 tandem units, which cooperate to define the unique DNA recognition properties of the protein. Each finger contains four amino acids located at specific positions, which are brought into direct contact with adjacent nucleotides in the DNA sequence as the KZNF array winds around the major groove of the alpha helix. This arrangement creates an intimate and potentially predictable relationship between the amino acid sequence of KZNF arrays and the nucleotide sequence of target binding sites. The large number of possible combinations and arrangements of modular KZNF motifs, and the increasing lengths of KZNF arrays in vertebrate species, has created huge repertoires of functionally unique TF proteins. The properties of this versatile DNA-binding motif have been exploited independently many times over the course of evolution, through attachment to effector motifs that confer activating, repressing or other activities to the proteins. Once created, some of these novel inventions have expanded in specific evolutionary clades, creating large families of TFs that are lineage- or species-unique. This chapter reviews the properties and their remarkable evolutionary history of eukaryotic KZNF TF proteins, with special focus on large families that dominate the TF landscapes in different metazoan species.
克鲁ppel型或C2H2锌指是真核转录因子(TF)蛋白中占主导地位的DNA结合基序。在克鲁ppel型(KZNF)转录因子中,KZNF基序以三到多达40个串联单元的阵列形式排列,这些单元协同作用以定义蛋白质独特的DNA识别特性。每个锌指包含位于特定位置的四个氨基酸,当KZNF阵列围绕α螺旋的大沟缠绕时,这些氨基酸与DNA序列中的相邻核苷酸直接接触。这种排列在KZNF阵列的氨基酸序列与靶结合位点的核苷酸序列之间建立了密切且可能可预测的关系。模块化KZNF基序的大量可能组合和排列,以及脊椎动物物种中KZNF阵列长度的增加,产生了功能独特的TF蛋白的巨大库。在进化过程中,这种通用DNA结合基序的特性通过与赋予蛋白质激活、抑制或其他活性的效应基序连接而被多次独立利用。一旦产生,这些新发明中的一些在特定的进化分支中扩展,形成了谱系或物种特有的TF大家族。本章回顾了真核KZNF TF蛋白的特性及其非凡的进化史,特别关注在不同后生动物物种中主导TF格局的大家族。