†Department of Organic Chemistry I, University of the Basque Country UPV/EHU, Avenida de Tolosa 72, 20018 San Sebastian, Guipúzcoa, Spain.
‡IKERBASQUE, Basque Foundation for Science, 48013 Bilbao, Bizkaia, Spain.
J Chem Inf Model. 2015 Mar 23;55(3):636-44. doi: 10.1021/ci500657d. Epub 2015 Feb 18.
The cell is able to regulate which genes to express via chemical marks on the DNA and on the histone proteins. In all vertebrates, the modification on the DNA is methylation at position 5 of the two cytosines present in the dinucleotide sequence CpG. The information encoded by these chemical marks on the DNA is processed by a family of protein factors containing a conserved methyl-CpG binding domain (MBD). Essential to their function, the MBD proteins are able to bind DNA containing dimethylated CpG sites, whereas binding to unmethylated sites is not observed. In this paper, we perform molecular dynamics simulations to investigate the mechanism by which the mCpG binding domain of MBD1 is able to bind specifically dimethylated CpG sites. We find that the binding affinity of MBD1 to a DNA containing dimethylated CpG site is stronger by 26.4 kJ/mol relative to binding the same DNA but with an unmethylated CpG site. The contribution of each of the methyl groups to the change in free energy is very similar and additive. Therefore, this binding affinity (to a dimethylated DNA) is halved when considered relative to binding a hemimethylated DNA, a result that is also supported by experimental observations. Despite their equal contributions, the two methyl groups are recognized differently by MBD1. In one case, demethylation induces conformational changes in which the hydrophobic patch formed by the conserved residues Val20, Arg22, and Tyr34 moves away from the (methyl)cytosine, weakening the DNA-protein interactions. This is accompanied by an intrusion of a bulk water into the binding site at the protein-DNA interface. As a consequence, there is a reduction and rearrangements of the protein-DNA hydrogen bonds including a loss of a crucial hydrogen bond between Tyr34 and the (methyl)cytosine. The methylcytosine on the opposite strand is recognized by conformational changes of the surrounding conserved hydrophobic residues, Arg44 and Ser45, in which Arg44 participate in the 5mC-Arg-G triad. More specifically, the hydrogens of the methyl group form weak hydrogen bonds with the guanidino group and backbone carbonyl of the conserved Arg44, interactions that are absent when the cytosine is unmethylated. The results presented in this paper contribute to our knowledge of the different ways the chemical mark on the DNA is recognized by the epigenetic machinery.
细胞能够通过 DNA 和组蛋白上的化学标记来调节哪些基因表达。在所有脊椎动物中,DNA 上的修饰是在二核苷酸序列 CpG 中两个胞嘧啶存在的位置 5 上的甲基化。这些 DNA 上化学标记所编码的信息由一组含有保守甲基-CpG 结合域(MBD)的蛋白质因子处理。对于它们的功能至关重要的是,MBD 蛋白能够结合含有二甲基化 CpG 位点的 DNA,而未观察到与未甲基化位点的结合。在本文中,我们进行了分子动力学模拟,以研究 MBD1 的 mCpG 结合域能够特异性结合二甲基化 CpG 位点的机制。我们发现,MBD1 与含有二甲基化 CpG 位点的 DNA 的结合亲和力比与相同 DNA 但含有未甲基化 CpG 位点的 DNA 的结合亲和力强 26.4 kJ/mol。每个甲基基团对自由能变化的贡献非常相似且可加。因此,当相对于结合半甲基化 DNA 时,这种结合亲和力(对二甲基化 DNA)减半,这一结果也得到了实验观察的支持。尽管它们的贡献相等,但两个甲基基团被 MBD1 以不同的方式识别。在一种情况下,去甲基化会诱导构象变化,其中由保守残基 Val20、Arg22 和 Tyr34 形成的疏水区从(甲基)胞嘧啶移动,从而削弱 DNA-蛋白质相互作用。这伴随着大量水分子侵入蛋白质-DNA 界面的结合位点。结果,蛋白质-DNA 氢键发生减少和重排,包括 Tyr34 与(甲基)胞嘧啶之间关键氢键的丧失。相反链上的甲基胞嘧啶被周围保守疏水性残基 Arg44 和 Ser45 的构象变化识别,其中 Arg44 参与 5mC-Arg-G 三联体。更具体地说,甲基的氢与保守 Arg44 的胍基和骨架羰基形成弱氢键,当胞嘧啶未甲基化时,这些相互作用不存在。本文的研究结果有助于我们了解 DNA 上的化学标记被表观遗传机制识别的不同方式。