Key Laboratory of RNA Biology, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.
University of Chinese Academy of Sciences, Beijing 100049, China.
Cell Res. 2017 Nov;27(11):1365-1377. doi: 10.1038/cr.2017.131. Epub 2017 Oct 27.
CTCF, a conserved 3D genome architecture protein, determines proper genome-wide chromatin looping interactions through directional binding to specific sequence elements of four modules within numerous CTCF-binding sites (CBSs) by its 11 zinc fingers (ZFs). Here, we report four crystal structures of human CTCF in complex with CBSs of the protocadherin (Pcdh) clusters. We show that directional CTCF binding to cognate CBSs of the Pcdh enhancers and promoters is achieved through inserting its ZF3, ZFs 4-7, and ZFs 9-11 into the major groove along CBSs, resulting in a sequence-specific recognition of module 4, modules 3 and 2, and module 1, respectively; and ZF8 serves as a spacer element for variable distances between modules 1 and 2. In addition, the base contact with the asymmetric "A" in the central position of modules 2-3, is essential for directional recognition of the CBSs with symmetric core sequences but lacking module 1. Furthermore, CTCF tolerates base changes at specific positions within the degenerated CBS sequences, permitting genome-wide CTCF binding to a diverse range of CBSs. Together, these complex structures provide important insights into the molecular mechanisms for the directionality, diversity, flexibility, dynamics, and conservation of multivalent CTCF binding to its cognate sites across the entire human genome.
CTCF 是一种保守的三维基因组结构蛋白,通过其 11 个锌指(ZF)与大量 CTCF 结合位点(CBS)内的四个模块的特定序列元件定向结合,决定了全基因组染色质环相互作用的适当性。在这里,我们报告了人类 CTCF 与原钙粘蛋白(Pcdh)簇的 CBS 复合物的四个晶体结构。我们表明,通过将其 ZF3、ZF4-7 和 ZF9-11 插入 CBS 的大沟中,CTCF 对 Pcdh 增强子和启动子的同源 CBS 的定向结合实现了,从而分别实现了对模块 4、模块 3 和模块 2 以及模块 1 的序列特异性识别;ZF8 作为模块 1 和 2 之间可变距离的间隔元件。此外,与模块 2-3 中心位置的不对称“A”碱基的接触对于具有对称核心序列但缺乏模块 1 的 CBS 的定向识别是必不可少的。此外,CTCF 可以容忍 CBS 序列中特定位置的碱基变化,从而允许 CTCF 在全基因组范围内结合到广泛的 CBS 上。总之,这些复杂的结构为 CTCF 与其同源结合位点的方向性、多样性、灵活性、动力学和保守性的分子机制提供了重要的见解。