Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.
Genome Res. 2011 Nov;21(11):1800-12. doi: 10.1101/gr.121749.111. Epub 2011 Jul 22.
Vertebrate genomes encode large and highly variable numbers of tandem C2H2 zinc finger (tandem ZF) transcription factor proteins. In mammals, most tandem ZF genes also encode a KRAB domain (KZNF proteins). Very little is known about what forces have driven the number and diversity of tandem ZF genes. Recent studies suggest that one role of KZNF proteins is to bind and repress transcription of exogenous retroviruses and their endogenous counterpart LTR retroelements. We report a striking correlation across vertebrate genomes between the number of LTR retroelements and the number of host tandem ZF genes. This correlation is specific to LTR retroelements and ZF genes and was not explained by covariation in other genomic features. We further show that recently active LTR retroelements are correlated with recent tandem ZF gene duplicates across vertebrates. On branches of the primate phylogeny, we find that the appearance of new families of endogenous retroviruses is strongly predictive of the appearance of new duplicate KZNF genes. We hypothesize that retroviral and LTR retroelement burden drives evolution of host tandem ZF genes. This hypothesis is consistent with previously described molecular evolutionary patterns in duplicate ZF genes throughout vertebrates. To further explore these patterns, we investigated 34 duplicate human KZNF gene pairs, all of which underwent an early burst of divergence in the major nucleotide contact residues of their ZF domains, followed by purifying selection in both duplicates. Our results support a host-pathogen model for tandem ZF gene evolution, in which new LTR retroelement challenges drive duplication and divergence of host tandem ZF genes.
脊椎动物基因组编码大量数量和高度变异的串联 C2H2 锌指(串联 ZF)转录因子蛋白。在哺乳动物中,大多数串联 ZF 基因还编码 KRAB 结构域(KZNF 蛋白)。对于是什么力量驱动了串联 ZF 基因的数量和多样性,人们知之甚少。最近的研究表明,KZNF 蛋白的一个作用是结合并抑制外源性逆转录病毒及其内源性对应物 LTR 反转录元件的转录。我们报告了在脊椎动物基因组中,LTR 反转录元件的数量与宿主串联 ZF 基因的数量之间存在显著相关性。这种相关性是特定于 LTR 反转录元件和 ZF 基因的,不能用其他基因组特征的共变来解释。我们进一步表明,最近活跃的 LTR 反转录元件与脊椎动物中最近的串联 ZF 基因重复相关。在灵长类动物进化枝的分支上,我们发现新的内源性逆转录病毒家族的出现强烈预示着新的 KZNF 基因重复的出现。我们假设逆转录病毒和 LTR 反转录元件负担驱动宿主串联 ZF 基因的进化。这一假设与以前在整个脊椎动物中描述的重复 ZF 基因的分子进化模式一致。为了进一步探讨这些模式,我们研究了 34 对重复的人类 KZNF 基因对,它们的 ZF 结构域的主要核苷酸接触残基都经历了早期的分歧爆发,随后在两个重复中都经历了纯化选择。我们的结果支持了串联 ZF 基因进化的宿主-病原体模型,其中新的 LTR 反转录元件的挑战推动了宿主串联 ZF 基因的复制和分歧。