Peckham Heather E, Thurman Robert E, Fu Yutao, Stamatoyannopoulos John A, Noble William Stafford, Struhl Kevin, Weng Zhiping
Bioinformatics Program, Boston University, Boston, MA 02215, USA.
Genome Res. 2007 Aug;17(8):1170-7. doi: 10.1101/gr.6101007. Epub 2007 Jul 9.
Although histones can form nucleosomes on virtually any genomic sequence, DNA sequences show considerable variability in their binding affinity. We have used DNA sequences of Saccharomyces cerevisiae whose nucleosome binding affinities have been experimentally determined (Yuan et al. 2005) to train a support vector machine to identify the nucleosome formation potential of any given sequence of DNA. The DNA sequences whose nucleosome formation potential are most accurately predicted are those that contain strong nucleosome forming or inhibiting signals and are found within nucleosome length stretches of genomic DNA with continuous nucleosome formation or inhibition signals. We have accurately predicted the experimentally determined nucleosome positions across a well-characterized promoter region of S. cerevisiae and identified strong periodicity within 199 center-aligned mononucleosomes studied recently (Segal et al. 2006) despite there being no periodicity information used to train the support vector machine. Our analysis suggests that only a subset of nucleosomes are likely to be positioned by intrinsic sequence signals. This observation is consistent with the available experimental data and is inconsistent with the proposal of a nucleosome positioning code. Finally, we show that intrinsic nucleosome positioning signals are both more inhibitory and more variable in promoter regions than in open reading frames in S. cerevisiae.
尽管组蛋白实际上可以在任何基因组序列上形成核小体,但DNA序列在其结合亲和力方面表现出相当大的变异性。我们使用了酿酒酵母的DNA序列,其核小体结合亲和力已通过实验确定(Yuan等人,2005年),以训练支持向量机来识别任何给定DNA序列的核小体形成潜力。那些核小体形成潜力被最准确预测的DNA序列,是那些包含强核小体形成或抑制信号的序列,并且存在于基因组DNA的核小体长度片段内,具有连续的核小体形成或抑制信号。我们已经准确预测了酿酒酵母一个特征明确的启动子区域内通过实验确定的核小体位置,并在最近研究的199个中心对齐的单核小体中识别出强周期性(Segal等人,2006年),尽管在训练支持向量机时没有使用周期性信息。我们的分析表明,只有一部分核小体可能由内在序列信号定位。这一观察结果与现有的实验数据一致,并且与核小体定位密码的提议不一致。最后,我们表明,在酿酒酵母中,启动子区域的内在核小体定位信号比开放阅读框中的更具抑制性且更具变异性。