Guo Yajing, Lei Xiujuan, Li Shuyu
School of Computer Science, Shaanxi Normal University, Xi'an, 710119, China.
Interdiscip Sci. 2025 Mar;17(1):86-100. doi: 10.1007/s12539-024-00660-9. Epub 2024 Nov 6.
Circular RNA (circRNA) has the capacity to bind with RNA binding protein (RBP), thereby exerting a substantial impact on diseases. Predicting binding sites aids in comprehending the interaction mechanism, thereby offering insights for disease treatment strategies. Here, we propose a novel approach based on temporal convolutional network (TCN) and cross multi-head attention mechanism to predict circRNA-RBP binding sites (circTCA). First, we employ two distinct encoding methodologies to obtain two raw matrices of circRNA sequences. Then, two parallel TCN blocks extract shallow and abstract features of the two matrices separately. The fusion of the two is achieved through cross multi-head attention mechanism and after this, global expectation pooling assigns weights to the concatenated feature. Finally, the task of classifying the input sequence is entrusted to a fully connected (FC) layer. We compare circTCA with other five methods and conduct ablation experiments to demonstrate its effectiveness. We also conduct feature visualization and assess the motifs extracted by circTCA with existing motifs. All in all, circTCA is effective for binding sites prediction of circRNA and RBP.
环状RNA(circRNA)能够与RNA结合蛋白(RBP)结合,从而对疾病产生重大影响。预测结合位点有助于理解相互作用机制,从而为疾病治疗策略提供见解。在此,我们提出了一种基于时间卷积网络(TCN)和交叉多头注意力机制的新方法来预测circRNA-RBP结合位点(circTCA)。首先,我们采用两种不同的编码方法来获得circRNA序列的两个原始矩阵。然后,两个并行的TCN模块分别提取这两个矩阵的浅层和抽象特征。通过交叉多头注意力机制实现两者的融合,在此之后,全局期望池化为连接后的特征分配权重。最后,将对输入序列进行分类的任务交给全连接(FC)层。我们将circTCA与其他五种方法进行比较,并进行消融实验以证明其有效性。我们还进行了特征可视化,并将circTCA提取的基序与现有基序进行评估。总而言之,circTCA对于circRNA和RBP的结合位点预测是有效的。