Wu Xudong, Liu Tingting
School of Engineering, Guangzhou College of Technology and Business, Guangzhou, China.
Front Genet. 2025 Jun 23;16:1544802. doi: 10.3389/fgene.2025.1544802. eCollection 2025.
The DNA replication of eukaryotes proceeds in a defined temporal sequence known as the replication timing (RT) program. A recent study revealed that the early- and late-replication temporal domains have different DNA mutation patterns and that the late-replicating sequences have a substitution pattern biased towards A and T. It raises the interesting question of how the miRNAs in the late-replication domain cope with the mutation bias caused by RT.
In this study, we characterized the genomic distribution of pre-miRNAs in relation to DNA replication timing, and identified 362 pre-miRNAs within late-replicating domains (late-miRNAs) and 631 pre-miRNAs within early-replicating domains (early-miRNAs). We comprehensively examined the multiple molecular features including the secondary structural properties, the genomic sequences surrounding the pre-miRNA loci, the Dicer processing motifs, and CAGE tag-based promoters and miRNAs expression profiles. Furthermore, we performed the simulation of miRNA-target regulatory networks to elucidate the co-regulation patterns among late-miRNAs. To advance predictive capabilities, we developed a a support vector machine (SVM) classifier based on RNA-FM embedding, enabling prediction of miRNAs' replication timing domains.
Our study indicated that the late pre-miRNAs maintained their ability to fold into hairpin structures through extending their lengths at both ends under the premise of maintaining a certain GC content of the precursors. The simulation demonstrated that the late-miRNAs tend to synergistically regulate the same genes and are involved in small molecule metabolism, immune responses and so on. The comparative analysis of early- and late- miRNAs confirmed that the information of replication timing domains is inherently encoded in miRNAs' sequence-structure signatures, and suggested that late-replication specific mutation patterns leave direct imprints on miRNA architecture. This study provides insights into the impact of DNA replication timing on miRNA-mediated posttranscriptional regulation and helps us understand the evolutionary mechanism of miRNAs.
真核生物的DNA复制按照一种特定的时间序列进行,即复制时间(RT)程序。最近的一项研究表明,早期和晚期复制的时间域具有不同的DNA突变模式,并且晚期复制序列具有偏向A和T的替换模式。这就引出了一个有趣的问题,即晚期复制域中的微小RNA(miRNA)如何应对由RT引起的突变偏差。
在本研究中,我们表征了前体miRNA相对于DNA复制时间的基因组分布,在晚期复制域中鉴定出362个前体miRNA(晚期miRNA),在早期复制域中鉴定出631个前体miRNA(早期miRNA)。我们全面研究了多种分子特征,包括二级结构特性、前体miRNA位点周围的基因组序列、Dicer加工基序、基于CAGE标签的启动子和miRNA表达谱。此外,我们进行了miRNA-靶标调控网络的模拟,以阐明晚期miRNA之间的协同调控模式。为了提高预测能力,我们基于RNA-FM嵌入开发了一种支持向量机(SVM)分类器,能够预测miRNA的复制时间域。
我们的研究表明,晚期前体miRNA在保持前体一定GC含量的前提下,通过两端延长长度来维持其折叠成发夹结构的能力。模拟结果表明,晚期miRNA倾向于协同调控相同的基因,并参与小分子代谢、免疫反应等。早期和晚期miRNA的比较分析证实了复制时间域的信息内在地编码在miRNA的序列-结构特征中,并表明晚期复制特异性突变模式在miRNA结构上留下了直接印记。本研究为DNA复制时间对miRNA介导的转录后调控的影响提供了见解,并有助于我们理解miRNA的进化机制。