Laboratory of Cell Biology, National Cancer Institute, NIH, Bethesda, MD 20892, USA.
J Biomol Struct Dyn. 2010 Jun;27(6):821-41. doi: 10.1080/073911010010524947.
Recent studies of genome-wide nucleosomal organization suggest that the DNA sequence is one of the major determinants of nucleosome positioning. Although the search for underlying patterns encoded in nucleosomal DNA has been going on for about 30 years, our knowledge of these patterns still remains limited. Based on our evaluations of DNA deformation energy, we developed new scoring functions to predict nucleosome positioning. There are three principal differences between our approach and earlier studies: (i) we assume that the length of nucleosomal DNA varies from 146 to 147 bp; (ii) we consider the anisotropic flexibility of pyrimidine-purine (YR) dimeric steps in the context of their neighbors (e.g., YYRR versus RYRY); (iii) we postulate that alternating AT-rich and GC-rich motifs reflect sequence-dependent interactions between histone arginines and DNA in the minor groove. Using these functions, we analyzed 20 nucleosome positions mapped in vitro at single nucleotide resolution (including clones 601, 603, 605, the pGUB plasmid, chicken beta-globin and three 5S rDNA genes). We predicted 15 of the 20 positions with 1-bp precision, and two positions with 2-bp precision. The predicted position of the '601' nucleosome (i.e., the optimum of the computed score) deviates from the experimentally determined unique position by no more than 1 bp - an accuracy exceeding that of earlier predictions. Our analysis reveals a clear heterogeneity of the nucleosomal sequences which can be divided into two groups based on the positioning 'rules' they follow. The sequences of one group are enriched by highly deformable YR/YYRR motifs at the minor-groove bending sites SHL+/- 3.5 and +/- 5.5, which is similar to the alpha-satellite sequence used in most crystallized nucleosomes. Apparently, the positioning of these nucleosomes is determined by the interactions between histones H2A/H2B and the terminal parts of nucleosomal DNA. In the other group (that includes the '601' clone) the same YR/YYRR motifs occur predominantly at the sites SHL +/- 1.5. The interaction between the H3/H4 tetramer and the central part of the nucleosomal DNA is likely to be responsible for the positioning of nucleosomes of this group, and the DNA trajectory in these nucleosomes may differ in detail from the published structures. Thus, from the stereochemical perspective, the in vitro nucleosomes studied here follow either an X-ray-like pattern (with strong deformations in the terminal parts of nucleosomal DNA), or an alternative pattern (with the deformations occurring predominantly in the central part of the nucleosomal DNA). The results presented here may be useful for genome-wide classification of nucleosomes, linking together structural and thermodynamic characteristics of nucleosomes with the underlying DNA sequence patterns guiding their positions.
最近对全基因组核小体组织的研究表明,DNA 序列是核小体定位的主要决定因素之一。尽管人们一直在寻找核小体 DNA 中编码的潜在模式,但我们对这些模式的了解仍然有限。基于我们对 DNA 变形能的评估,我们开发了新的评分函数来预测核小体定位。与早期研究相比,我们的方法有三个主要区别:(i)我们假设核小体 DNA 的长度从 146 到 147bp 不等;(ii)我们考虑嘧啶-嘌呤(YR)二聚体在其相邻位置的各向异性柔性(例如,YYRR 与 RYRY);(iii)我们假设交替的富含 AT 和富含 GC 的基序反映了组蛋白精氨酸与 DNA 之间在小沟中的序列依赖性相互作用。使用这些函数,我们分析了在体外以单核苷酸分辨率映射的 20 个核小体位置(包括克隆 601、603、605、pGUB 质粒、鸡β-球蛋白和三个 5S rDNA 基因)。我们以 1bp 的精度预测了 20 个位置中的 15 个,以 2bp 的精度预测了 2 个。“601”核小体的预测位置(即计算得分的最优位置)与实验确定的唯一位置相差不超过 1bp-这一精度超过了早期的预测。我们的分析揭示了核小体序列的明显异质性,这些序列可以根据它们所遵循的定位“规则”分为两组。一组序列在小沟弯曲位点 SHL+/-3.5 和 +/-5.5 处富含高度可变形的 YR/YYRR 基序,这与大多数结晶核小体中使用的α卫星序列相似。显然,这些核小体的定位是由组蛋白 H2A/H2B 与核小体 DNA 末端部分之间的相互作用决定的。在另一组(包括“601”克隆)中,相同的 YR/YYRR 基序主要出现在 SHL +/-1.5 位点。H3/H4 四聚体与核小体 DNA 中心部分之间的相互作用可能是导致该组核小体定位的原因,并且这些核小体中的 DNA 轨迹可能在细节上与已发表的结构不同。因此,从立体化学的角度来看,这里研究的体外核小体遵循 X 射线样模式(核小体 DNA 末端部分有强烈变形)或替代模式(核小体 DNA 中心部分主要发生变形)。这里呈现的结果可能有助于对核小体进行全基因组分类,将核小体的结构和热力学特征与指导其位置的潜在 DNA 序列模式联系起来。