Laboratory for Behavioral and Developmental Disorders, RIKEN Brain Science Institute, Wako-shi, Saitama 351-0198, Japan.
BMC Evol Biol. 2010 Feb 19;10:53. doi: 10.1186/1471-2148-10-53.
The C2H2 zinc finger (ZF) domain is widely conserved among eukaryotic proteins. In Zic/Gli/Zap1 C2H2 ZF proteins, the two N-terminal ZFs form a single structural unit by sharing a hydrophobic core. This structural unit defines a new motif comprised of two tryptophan side chains at the center of the hydrophobic core. Because each tryptophan residue is located between the two cysteine residues of the C2H2 motif, we have named this structure the tandem CWCH2 (tCWCH2) motif.
Here, we characterized 587 tCWCH2-containing genes using data derived from public databases. We categorized genes into 11 classes including Zic/Gli/Glis, Arid2/Rsc9, PacC, Mizf, Aebp2, Zap1/ZafA, Fungl, Zfp106, Twincl, Clr1, and Fungl-4ZF, based on sequence similarity, domain organization, and functional similarities. tCWCH2 motifs are mostly found in organisms belonging to the Opisthokonta (metazoa, fungi, and choanoflagellates) and Amoebozoa (amoeba, Dictyostelium discoideum). By comparison, the C2H2 ZF motif is distributed widely among the eukaryotes. The structure and organization of the tCWCH2 motif, its phylogenetic distribution, and molecular phylogenetic analysis suggest that prototypical tCWCH2 genes existed in the Opisthokonta ancestor. Within-group or between-group comparisons of the tCWCH2 amino acid sequence identified three additional sequence features (site-specific amino acid frequencies, longer linker sequence between two C2H2 ZFs, and frequent extra-sequences within C2H2 ZF motifs).
These features suggest that the tCWCH2 motif is a specialized motif involved in inter-zinc finger interactions.
C2H2 锌指 (ZF) 结构域在真核蛋白中广泛保守。在 Zic/Gli/Zap1 C2H2 ZF 蛋白中,两个 N 端 ZF 通过共享一个疏水性核心形成单个结构单元。这个结构单元定义了一个新的基序,由疏水性核心中心的两个色氨酸侧链组成。由于每个色氨酸残基位于 C2H2 基序的两个半胱氨酸残基之间,我们将这个结构命名为串联 CWCH2(tCWCH2)基序。
在这里,我们使用公共数据库中的数据对包含 587 个 tCWCH2 的基因进行了特征描述。我们根据序列相似性、结构域组织和功能相似性,将基因分为 11 类,包括 Zic/Gli/Glis、Arid2/Rsc9、PacC、Mizf、Aebp2、Zap1/ZafA、Fungl、Zfp106、Twincl、Clr1 和 Fungl-4ZF。tCWCH2 基序主要存在于后生动物(动物、真菌和领鞭毛虫)和变形动物(变形虫、Dictyostelium discoideum)中。相比之下,C2H2 ZF 基序广泛分布于真核生物中。tCWCH2 基序的结构和组织、其系统发育分布以及分子系统发育分析表明,原型 tCWCH2 基因存在于后生动物祖先中。对 tCWCH2 氨基酸序列的组内或组间比较确定了另外三个序列特征(特定位点的氨基酸频率、两个 C2H2 ZF 之间更长的连接序列以及 C2H2 ZF 基序内频繁的额外序列)。
这些特征表明,tCWCH2 基序是一个专门的基序,参与锌指之间的相互作用。