Wang Yifei, Ding Pengju, Wang Congjing, He Shiyue, Gao Xin, Yu Bin
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7681-7695. doi: 10.1109/TNNLS.2024.3390935. Epub 2025 Apr 4.
RNA-protein interactions (RPIs) play an important role in several fundamental cellular physiological processes, including cell motility, chromosome replication, transcription and translation, and signaling. Predicting RPI can guide the exploration of cellular biological functions, intervening in diseases, and designing drugs. Given this, this study proposes the RPI-gated graph convolutional network (RPI-GGCN) method for predicting RPI based on the gated graph convolutional neural network (GGCN) and co-regularized variational autoencoder (Co-VAE). First, different types of feature information were extracted from RNA and protein sequences by nine feature extraction methods. Second, Co-VAEs are used to eliminate the redundancy of fused features and generate optimal features. Finally, this study introduces gated cyclic units into graph convolutional networks (GCNs) to construct a model for RPI prediction, which efficiently extracts topological information and improves the model's interpretable feature learning and expression capabilities. In the fivefold cross-validation test, the RPI-GGCN method achieved prediction accuracies of 97.27%, 97.32%, 96.54%, 95.76%, and 94.98% on the RPI369, RPI488, RPI1446, RPI1807, and RPI2241 datasets. To test the generalization performance of the model, we used the model trained on RPI369 to predict the independent NPInter v3.0 dataset and achieved excellent performance in all six independent validation sets. By visualizing the RPI network graph based on the prediction results, we aim to provide a new perspective and reference for studying RPI mechanisms and exploring new RPIs. Extensive experimental results demonstrate that RPI-GGCN can provide an efficient, accurate, and stable RPI prediction method.
RNA-蛋白质相互作用(RPIs)在多个基本细胞生理过程中发挥着重要作用,包括细胞运动、染色体复制、转录和翻译以及信号传导。预测RPIs可以指导细胞生物学功能的探索、疾病干预和药物设计。鉴于此,本研究基于门控图卷积神经网络(GGCN)和协同正则化变分自编码器(Co-VAE)提出了用于预测RPIs的RPI门控图卷积网络(RPI-GGCN)方法。首先,通过九种特征提取方法从RNA和蛋白质序列中提取不同类型的特征信息。其次,使用Co-VAEs消除融合特征的冗余并生成最优特征。最后,本研究将门控循环单元引入图卷积网络(GCNs)以构建用于RPI预测的模型,该模型有效地提取拓扑信息并提高了模型的可解释特征学习和表达能力。在五折交叉验证测试中,RPI-GGCN方法在RPI369、RPI488、RPI1446、RPI1807和RPI2241数据集上的预测准确率分别达到了97.27%、97.32%、96.54%、95.76%和94.98%。为了测试模型的泛化性能,我们使用在RPI369上训练的模型来预测独立的NPInter v3.0数据集,并在所有六个独立验证集中都取得了优异的性能。通过基于预测结果可视化RPI网络图,我们旨在为研究RPI机制和探索新的RPIs提供新的视角和参考。大量实验结果表明,RPI-GGCN可以提供一种高效、准确且稳定的RPI预测方法。