Teng Zhixia, Tian Zhaowen, Zhou Murong, Wang Guohua, Tian Zhen, Zhao Yuming
College of Computer and Control Engineering, Northeast Forestry University, 150040, Harbin, China.
School of Computer and Artificial Intelligence, Zhengzhou University, 450001, Zhengzhou, China.
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf281.
Interactions between long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) play an important role in the development of complex human diseases by collaboratively regulating gene transcription and expression. Therefore, identifying lncRNA-miRNA interactions (LMIs) is essential for diagnosing and treating complex human diseases. Because identifying LMIs with wet experiments is time-consuming and labor-intensive, some computational methods have been developed to infer LMIs. However, these approaches excel at utilizing single-modal information but struggle to integrate multimodal data from lncRNAs and miRNAs, which is essential for uncovering complex patterns in LMIs, ultimately limiting their performance. Therefore, this article proposes a novel multimodal contrastive representation learning model (MCRLMI) for LMI predictions. The model fully integrates multi-source similarity information and sequence encodings of lncRNAs and miRNAs. It leverages a graph convolutional network (GCN) and a Transformer to capture local neighborhood structural features and long-distance dependencies, respectively, enabling the collaborative modeling of structural and semantic information. Subsequently, to effectively integrate multimodal characteristics with encoded information, a multichannel attention mechanism and contrastive learning are introduced to fuse the extracted features. Finally, a Kolmogorov-Arnold Network (KAN) is trained with the optimized embeddings to predict LMIs. Extensive experiments show that the proposed MCRLMI consistently outperforms existing methods. Moreover, case studies further validate the potential of MCRLMI to identify novel LMIs in practical applications.
长链非编码RNA(lncRNA)与微小RNA(miRNA)之间的相互作用通过协同调控基因转录和表达,在复杂人类疾病的发生发展中发挥着重要作用。因此,识别lncRNA-miRNA相互作用(LMI)对于复杂人类疾病的诊断和治疗至关重要。由于通过湿实验识别LMI既耗时又费力,因此已经开发了一些计算方法来推断LMI。然而,这些方法擅长利用单模态信息,但难以整合来自lncRNA和miRNA的多模态数据,而这对于揭示LMI中的复杂模式至关重要,最终限制了它们的性能。因此,本文提出了一种用于LMI预测的新型多模态对比表示学习模型(MCRLMI)。该模型充分整合了lncRNA和miRNA的多源相似性信息和序列编码。它利用图卷积网络(GCN)和Transformer分别捕获局部邻域结构特征和长距离依赖性,从而实现结构和语义信息的协同建模。随后,为了有效地将多模态特征与编码信息整合在一起,引入了多通道注意力机制和对比学习来融合提取的特征。最后,使用优化后的嵌入训练柯尔莫哥洛夫 - 阿诺德网络(KAN)来预测LMI。大量实验表明,所提出的MCRLMI始终优于现有方法。此外,案例研究进一步验证了MCRLMI在实际应用中识别新型LMI的潜力。