Zuo Zheng
Shenzhen University, Shenzhen, China.
NAR Genom Bioinform. 2023 Sep 6;5(3):lqad078. doi: 10.1093/nargab/lqad078. eCollection 2023 Sep.
To defend against the invasion of transposons, hundreds of KRAB-zinc finger genes (ZNFs) evolved to recognize and silence various repeat families specifically. However, most repeat elements reside in the human genome with high copy numbers, making the ChIP-seq reads of ZNFs targeting these repeats predominantly multi-mapping reads. This complicates downstream data analysis and signal quantification. To better visualize and quantify the arms race between transposons and ZNFs, the R package TECookbook has been developed to lift ChIP-seq data into reference repeat coordinates with proper normalization and extract all putative ZNF binding sites from defined loci of reference repeats for downstream analysis. In conjunction with specificity profiles derived from Spec-seq data, human ZNF10 has been found to bind to a conserved ORF2 locus of selected LINE-1 subfamilies. This provides insight into how LINE-1 evaded capture at least twice and was subsequently recaptured by ZNF10 during evolutionary history. Through similar analyses, ZNF382 and ZNF248 were shown to be broad-spectrum LINE-1 binders. Overall, this work establishes a general analysis workflow to decipher the arms race between ZNFs and transposons through nucleotide substitutions rather than structural variations, particularly in the protein-coding region of transposons.
为抵御转座子的入侵,数百个KRAB-锌指基因(ZNFs)进化而来,以特异性识别并沉默各种重复序列家族。然而,大多数重复元件以高拷贝数存在于人类基因组中,使得靶向这些重复序列的ZNFs的ChIP-seq reads主要是多映射reads。这使得下游数据分析和信号定量变得复杂。为了更好地可视化和量化转座子与ZNFs之间的军备竞赛,已开发出R包TECookbook,将ChIP-seq数据提升到具有适当归一化的参考重复序列坐标中,并从参考重复序列的定义位点提取所有推定的ZNF结合位点用于下游分析。结合从Spec-seq数据得出的特异性概况,已发现人类ZNF10与选定LINE-1亚家族的保守ORF2位点结合。这为LINE-1在进化历史中如何至少两次逃避捕获并随后被ZNF10重新捕获提供了见解。通过类似分析,ZNF382和ZNF248被证明是广谱LINE-1结合蛋白。总体而言,这项工作建立了一个通用的分析工作流程,以通过核苷酸取代而非结构变异来解读ZNFs与转座子之间的军备竞赛,特别是在转座子的蛋白质编码区域。