Matsuno Yoko, Yamashita Takefumi, Wagatsuma Michiru, Yamakage Hajime
1Division of Clinical Preventive Medicine, Niigata University, Niigata, Japan.
2Laboratory for Systems Biology and Medicine, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, Japan.
Mob DNA. 2019 Jul 30;10:33. doi: 10.1186/s13100-019-0173-4. eCollection 2019.
Associations between X-inactive transcript (Xist)-long noncoding RNA (lncRNA) and chromatin are critical intermolecular interactions in the X-chromosome inactivation (XCI) process. Despite high-resolution analyses of the Xist RNA-binding sites, specific interaction sequences are yet to be identified. Based on elusive features of the association between Xist RNA and chromatin and the possible existence of multiple low-affinity binding sites in Xist RNA, we defined short motifs (≥5 nucleotides), termed as redundant UC/TC (r-UC/TC) or AG (r-AG) motifs, which may help in the mediation of triplex formation between the lncRNAs and duplex DNA.
The study showed that r-UC motifs are densely dispersed throughout mouse and human Xist/XIST RNAs, whereas r-AG motifs are even more densely dispersed along opossum RNA-on-the-silent X (Rsx) RNA, and also along both full-length and truncated long interspersed nuclear elements (LINE-1s, L1s) of the three species. Predicted secondary structures of the lncRNAs showed that the length range of these sequence motifs available for forming triplexes was even shorter, mainly 5- to 9-nucleotides long. Quartz crystal microbalance (QCM) measurements and Monte Carlo (MC) simulations indicated that minimum-length motifs can reinforce the binding state by increasing the copy number of the motifs in the same RNA or DNA molecule. Further, r-AG motifs in L1s had a similar length-distribution pattern, regardless of the similarities in the length or sequence of L1s across the three species; this also applies to high-frequency mutations in r-AG motifs, which suggests convergence in L1 sequence variations.
Multiple short motifs in both RNA and duplex DNA molecules could be brought together to form triplexes with either Hoogsteen or reverse Hoogsteen hydrogen bonding, by which their associations are cooperatively enhanced. This novel triplex interaction could be involved in associations between lncRNA and chromatin in XCI, particularly at the sites of L1s. Potential binding of Xist/XIST/Rsx RNAs specifically at L1s is most likely preserved through the r-AG motifs conserved in mammalian L1s through convergence in L1 nucleotide variations and by maintaining a particular r-UC/r-AG motif ratio in each of these lncRNAs, irrespective of their poorly conserved sequences.
X染色体失活转录本(Xist)-长链非编码RNA(lncRNA)与染色质之间的关联是X染色体失活(XCI)过程中关键的分子间相互作用。尽管对Xist RNA结合位点进行了高分辨率分析,但尚未确定特定的相互作用序列。基于Xist RNA与染色质之间关联的难以捉摸的特征以及Xist RNA中可能存在多个低亲和力结合位点,我们定义了短基序(≥5个核苷酸),称为冗余UC/TC(r-UC/TC)或AG(r-AG)基序,它们可能有助于介导lncRNAs与双链DNA之间形成三链体。
研究表明,r-UC基序密集分布于小鼠和人类的Xist/XIST RNA中,而r-AG基序在负鼠沉默X染色体上的RNA(Rsx)RNA中分布更为密集,在这三个物种的全长和截短的长散在核元件(LINE-1s,L1s)中也是如此。lncRNAs的预测二级结构表明,这些可用于形成三链体的序列基序的长度范围甚至更短,主要为5至9个核苷酸长。石英晶体微天平(QCM)测量和蒙特卡罗(MC)模拟表明,最短长度的基序可通过增加同一RNA或DNA分子中基序的拷贝数来加强结合状态。此外,L1s中的r-AG基序具有相似的长度分布模式,无论这三个物种的L1s在长度或序列上是否相似;这也适用于r-AG基序中的高频突变,这表明L1序列变异存在趋同现象。
RNA和双链DNA分子中的多个短基序可以聚集在一起,通过Hoogsteen或反向Hoogsteen氢键形成三链体,从而协同增强它们之间的关联。这种新型的三链体相互作用可能参与XCI中lncRNA与染色质之间的关联,特别是在L1s位点。Xist/XIST/Rsx RNAs在L1s处的潜在结合很可能通过哺乳动物L1s中保守的r-AG基序得以保留,这种保留是通过L1核苷酸变异的趋同以及在每个lncRNA中维持特定的r-UC/r-AG基序比例实现的,而不考虑它们保守性较差的序列。