Mondo Stephen J, He Guifen, Sharma Aditi, Ciobanu Doina, Riley Robert, Andreopoulos William B, Lipzen Anna, Kuo Alan, LaButti Kurt, Pangilinan Jasmyn, Salamov Asaf, Salamon Hugh, Shu Lili, Gladden John, Magnuson Jon, Aime M Catherine, O'Malley Ronan, Grigoriev Igor V
US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
iScience. 2025 Apr 18;28(5):112472. doi: 10.1016/j.isci.2025.112472. eCollection 2025 May 16.
Nucleosomes are the basic repeating unit, each spanning ≈150bp, that structures DNA in the nucleus and their positions have major consequences on gene activity. Here, through analyzing DNA signatures across 1117 microeukaryote genomes, we discovered ≈150bp shifts in A/T content associated with nucleosome organization. Often consecutively arrayed across the genome, A/T peaks were enriched surrounding transcriptional start sites in specific clades. Most nucleosomes (both and ) across eukaryotes aligned with A/T peaks, even in the presence of DNA modifications. Using artificial intelligence-based approaches, we describe DNA features associated with nucleosomes and construct a deep learning (DL) model for improved nucleosome occupancy prediction. Using this model, we found that ≈70% of "random" transfer DNA inserts from an fungal RB-TDNAseq library avoided DL predicted nucleosome-bound regions. This study reveals a eukaryote-wide strategy for generating cassettes of nucleosome-favorable DNAs that has a profound impact on nucleosome organization.
核小体是基本的重复单元,每个核小体跨度约为150个碱基对,它构成细胞核中的DNA结构,其位置对基因活性有重大影响。在这里,通过分析1117个微真核生物基因组的DNA特征,我们发现了与核小体组织相关的A/T含量约150个碱基对的偏移。A/T峰通常在基因组中连续排列,在特定进化枝的转录起始位点周围富集。即使存在DNA修饰,真核生物中的大多数核小体(包括 和 )都与A/T峰对齐。使用基于人工智能的方法,我们描述了与核小体相关的DNA特征,并构建了一个深度学习(DL)模型来改进核小体占据预测。使用这个模型,我们发现来自一个真菌RB-TDNAseq文库的约70%的“随机”转移DNA插入片段避开了DL预测的核小体结合区域。这项研究揭示了一种全真核生物范围的策略,用于生成对核小体有利的DNA盒,这对核小体组织有深远影响。