Neonatal/Congenital Heart Laboratory, Cardiovascular Research Laboratories, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA.
Department of Pediatrics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA.
Cells. 2023 Dec 8;12(24):2805. doi: 10.3390/cells12242805.
Long non-coding RNA (lncRNA) mediated transcriptional regulation is increasingly recognized as an important gene regulatory mechanism during development and disease. LncRNAs are emerging as critical regulators of chromatin state; yet the nature and the extent of their interactions with chromatin remain to be fully revealed. We have previously identified as an essential epigenetic regulator of myogenic differentiation in cardiac and skeletal myocytes in mice and humans. We further demonstrated that function is mediated by the interaction with the chromatin-modifying complex polycomb repressive complex 2 (PRC2) at the promoter of myogenic differentiation transcription factors, and . Herein, we employed unbiased chromatin isolation by RNA purification (ChIRP) and high throughput sequencing to map the repertoire of chromatin occupancy genome-wide in the mouse muscle myoblast cell line. We uncovered a total of 99732 true peaks corresponding to binding sites at high confidence (-value < 1E-5) and enrichment score ≥ 10). The -binding sites averaged 558 bp in length and were distributed widely within the coding and non-coding regions of the genome. Approximately 46% of these true peaks were mapped to gene elements, of which 1180 were mapped to experimentally validated promoter sequences. Importantly, the promoter-mapped binding sites were enriched in myogenic transcription factors and heart development while exhibiting focal interactions with known motifs of proximal promoters and transcription initiation by RNA Pol-II, including TATA-box, transcription initiator motif, CCAAT-box, and GC-box, supporting role in transcription initiation of myogenic regulators. Remarkably, nearly 40% of -binding sites mapped to gene introns were enriched with the Homeobox family of transcription factors and exhibited TA-rich motif sequences, suggesting potential motif-specific -bound introns. Lastly, more than 136521 enhancer sequences were detected in -occupancy sites at high confidence. Among these enhancers, 3390 (12%) exhibited cell type/tissue-specific enrichment in fetal heart and muscles. Together, our findings provide further insights into the genome-wide Chromatin interactome that may dictate its function in myogenic differentiation and potentially other cellular and biological processes.
长非编码 RNA(lncRNA)介导的转录调控越来越被认为是发育和疾病过程中重要的基因调控机制。lncRNA 正在成为染色质状态的关键调节因子;然而,它们与染色质相互作用的性质和程度仍有待充分揭示。我们之前已经确定 在小鼠和人类的心脏和骨骼肌成肌细胞的肌生成分化中是一种必不可少的表观遗传调节因子。我们进一步证明 的功能是通过与染色质修饰复合物多梳抑制复合物 2(PRC2)在肌生成分化转录因子的启动子上的相互作用来介导的, 和. 在此,我们采用无偏倚的 RNA 纯化染色质分离(ChIRP)和高通量测序技术,在小鼠肌肉成肌细胞系中对 全基因组的染色质占有率进行了图谱绘制。我们总共发现了 99732 个真正的峰,对应于高可信度(-值<1E-5)和富集分数≥10)的 结合位点。 结合位点的平均长度为 558bp,分布在基因组的编码和非编码区域内。这些真正的峰中约有 46%映射到基因元件上,其中 1180 个映射到实验验证的启动子序列上。重要的是,映射到启动子的结合位点富含肌生成转录因子和心脏发育,同时与已知的近端启动子和 RNA Pol-II 转录起始的 motif 表现出焦点相互作用,包括 TATA 盒、转录起始子 motif、CCAAT 盒和 GC 盒,支持 在肌生成调节剂的转录起始中的作用。值得注意的是,近 40%的 结合位点映射到基因内含子上,富含同源盒家族转录因子,并表现出 TA 丰富的 motif 序列,表明潜在的 motif 特异性 结合内含子。最后,在 染色质占有率的高可信度检测到超过 136521 个增强子序列。在这些增强子中,3390 个(12%)在胎心和肌肉中表现出细胞类型/组织特异性富集。总之,我们的研究结果提供了对 染色质互作组的更深入了解,这可能决定了它在肌生成分化中的作用,以及潜在的其他细胞和生物学过程中的作用。