Li Wentian, Sosa Daniela, Jose Marco V
The Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, 350 Community Drive, NY 11030, USA.
Facultad de Ciencias, Universidad Nacional Autónoma de México, México 04510 DF, Mexico; Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, México 04510 DF, Mexico.
Genomics. 2013 Feb;101(2):125-33. doi: 10.1016/j.ygeno.2012.10.005. Epub 2012 Nov 5.
We examined statistical correlations between the frequencies of seven proposed nucleosome positioning motifs and the densities of repetitive sequences in the human genome. For both parametric and non-parametric measures of statistical correlations there is a tendency for repetitive sequence density to be negatively correlated with the density of R/Y-based nucleosome positioning motifs, while being positively correlated with that of W/S-based motifs. These results largely hold even when motifs are examined only within repeat-filtered sequences. The RRRRRYYYYY motif and its 5-base shift YYYYYRRRRR, in particular, is over-represented in the human genome; and its negative correlation is consistently present at different regions and at different length scales. For some other nucleosome positioning motifs, the relationship with repeats can be regional or length scale dependent. Considering the importance of nucleosome formation in epigenetic regulations, these results may provide new insight to the evolution of repetitive sequences.
我们研究了人类基因组中七个提出的核小体定位基序的频率与重复序列密度之间的统计相关性。对于统计相关性的参数和非参数测量方法,重复序列密度都有与基于R/Y的核小体定位基序密度呈负相关的趋势,而与基于W/S的基序密度呈正相关。即使仅在重复序列过滤后的序列中检查基序,这些结果在很大程度上仍然成立。特别是RRRRRYYYYY基序及其5个碱基的移位YYYYYRRRRR在人类基因组中过度存在;并且其负相关性在不同区域和不同长度尺度上始终存在。对于其他一些核小体定位基序,与重复序列的关系可能取决于区域或长度尺度。考虑到核小体形成在表观遗传调控中的重要性,这些结果可能为重复序列的进化提供新的见解。