Wei Meng-Meng, Wang Lei, Zhao Bo-Wei, Su Xiao-Rui, You Zhu-Hong, Huang De-Shuang
IEEE J Biomed Health Inform. 2025 Apr 15;PP. doi: 10.1109/JBHI.2025.3561197.
CircRNA-miRNA interaction (CMI) plays a crucial role in the gene regulatory network of the cell. Numerous experiments have shown that abnormalities in CMI can impact molecular functions and physiological processes, leading to the occurrence of specific diseases. Current computational models for predicting CMI typically focus on local molecular entity relationships, thereby neglecting inherent molecular attributes and global structural information. To address these limitations, we propose a multi-feature fusion prediction model based on the transformer and graph attention network, named EGATCMI. Specifically, EGATCMI combines the transformer architecture with Word2vec to pre-train the sequence of circRNA and miRNA, capturing their sequence feature representation and sequence similarity. By leveraging the self-attention mechanism, EGATCMI extracts global structural feature from the CMI network. EGATCMI effectively integrates the obtained multi-feature for prediction, achieving AUC values of 0.9106 and 0.9470 on the CMI-9905 and CircBank datasets, respectively, outperforming existing methods. In case studies that the prediction of interactions between three miRNAs that are closely related to diseases and circRNAs, 8 out of 10 pairs were accurately predicted and validated. Extensive experimental results demonstrate the potential of EGATCMI as a reliable tool for candidate screening in biological investigations.
环状RNA-微RNA相互作用(CMI)在细胞的基因调控网络中起着至关重要的作用。大量实验表明,CMI异常会影响分子功能和生理过程,导致特定疾病的发生。当前用于预测CMI的计算模型通常侧重于局部分子实体关系,从而忽略了固有的分子属性和全局结构信息。为了解决这些局限性,我们提出了一种基于Transformer和图注意力网络的多特征融合预测模型,名为EGATCMI。具体而言,EGATCMI将Transformer架构与Word2vec相结合,对环状RNA和微RNA的序列进行预训练,捕捉它们的序列特征表示和序列相似性。通过利用自注意力机制,EGATCMI从CMI网络中提取全局结构特征。EGATCMI有效地整合所获得的多特征进行预测,在CMI-9905和CircBank数据集上分别实现了0.9106和0.9470的AUC值,优于现有方法。在与疾病密切相关的三种微RNA和环状RNA之间相互作用预测的案例研究中,10对中有8对被准确预测并得到验证。大量实验结果证明了EGATCMI作为生物研究中候选筛选可靠工具的潜力。