基于化学图的核小体定位预测,使用 Bioconductor 包 nuCpos。
Chemical map-based prediction of nucleosome positioning using the Bioconductor package nuCpos.
机构信息
Department of Biochemistry, Shimane University School of Medicine, 89-1 Enya-cho, Izumo, Shimane, 693-8501, Japan.
Department of Chemistry, Graduate School of Science and Engineering, Program in Chemistry and Life Science, School of Science and Engineering, Meisei University, 2-1-1 Hodokubo, Hino, Tokyo, 191-8506, Japan.
出版信息
BMC Bioinformatics. 2021 Jun 13;22(1):322. doi: 10.1186/s12859-021-04240-2.
BACKGROUND
Assessing the nucleosome-forming potential of specific DNA sequences is important for understanding complex chromatin organization. Methods for predicting nucleosome positioning include bioinformatics and biophysical approaches. An advantage of bioinformatics methods, which are based on in vivo nucleosome maps, is the use of natural sequences that may contain previously unknown elements involved in nucleosome positioning in vivo. The accuracy of such prediction attempts reflects the genomic coordinate resolution of the nucleosome maps applied. Nucleosome maps are constructed using micrococcal nuclease digestion followed by high-throughput sequencing (MNase-seq). However, as MNase has a strong preference for A/T-rich sequences, MNase-seq may not be appropriate for this purpose. In addition to MNase-seq-based maps, base pair-resolution chemical maps of in vivo nucleosomes from three different species (budding and fission yeasts, and mice) are currently available. However, these chemical maps have yet to be integrated into publicly available computational methods.
RESULTS
We developed a Bioconductor package (named nuCpos) to demonstrate the superiority of chemical maps in predicting nucleosome positioning. The accuracy of chemical map-based prediction in rotational settings was higher than that of the previously developed MNase-seq-based approach. With our method, predicted nucleosome occupancy reasonably matched in vivo observations and was not affected by A/T nucleotide frequency. Effects of genetic alterations on nucleosome positioning that had been observed in living yeast cells could also be predicted. nuCpos calculates individual histone binding affinity (HBA) scores for given 147-bp sequences to examine their suitability for nucleosome formation. We also established local HBA as a new parameter to predict nucleosome formation, which was calculated for 13 overlapping nucleosomal DNA subsequences. HBA and local HBA scores for various sequences agreed well with previous in vitro and in vivo studies. Furthermore, our results suggest that nucleosomal subsegments that are disfavored in different rotational settings contribute to the defined positioning of nucleosomes.
CONCLUSIONS
Our results demonstrate that chemical map-based statistical models are beneficial for studying nucleosomal DNA features. Studies employing nuCpos software can enhance understanding of chromatin regulation and the interpretation of genetic alterations and facilitate the design of artificial sequences.
背景
评估特定 DNA 序列的核小体形成潜力对于理解复杂的染色质结构至关重要。预测核小体定位的方法包括生物信息学和生物物理方法。基于体内核小体图谱的生物信息学方法的一个优势是使用可能包含体内核小体定位中先前未知的元素的天然序列。这种预测尝试的准确性反映了所应用的核小体图谱的基因组坐标分辨率。核小体图谱是通过微球菌核酸酶消化后进行高通量测序(MNase-seq)构建的。然而,由于 MNase 对 A/T 丰富的序列有很强的偏好,MNase-seq 可能不适合用于此目的。除了基于 MNase-seq 的图谱外,目前还可获得来自三种不同物种(芽殖酵母和裂殖酵母以及小鼠)的体内核小体的碱基对分辨率的化学图谱。然而,这些化学图谱尚未整合到可用的计算方法中。
结果
我们开发了一个 Bioconductor 软件包(命名为 nuCpos),以证明化学图谱在预测核小体定位方面的优越性。在旋转设置下,基于化学图谱的预测的准确性高于先前开发的基于 MNase-seq 的方法。使用我们的方法,预测的核小体占有率与体内观察结果相当吻合,并且不受 A/T 核苷酸频率的影响。在活酵母细胞中观察到的遗传改变对核小体定位的影响也可以预测。nuCpos 为给定的 147 个碱基对序列计算单个组蛋白结合亲和力(HBA)得分,以检查它们适合核小体形成的程度。我们还将局部 HBA 作为预测核小体形成的新参数,该参数是针对 13 个重叠核小体 DNA 子序列计算的。各种序列的 HBA 和局部 HBA 得分与先前的体外和体内研究吻合良好。此外,我们的结果表明,在不同旋转设置下不被青睐的核小体亚片段有助于核小体的定义定位。
结论
我们的结果表明,基于化学图谱的统计模型有助于研究核小体 DNA 特征。使用 nuCpos 软件进行的研究可以增强对染色质调控的理解以及对遗传改变的解释,并有助于人工序列的设计。