Xie Minzhu, Xie Ruijie, Wang Hao
College of Information Science and Engineering, Hunan Normal University, China.
Methods. 2023 Dec;220:98-105. doi: 10.1016/j.ymeth.2023.11.007. Epub 2023 Nov 14.
Many studies have shown that long-chain noncoding RNAs (lncRNAs) are involved in a variety of biological processes such as post-transcriptional gene regulation, splicing, and translation by combining with corresponding proteins. Predicting lncRNA-protein interactions is an effective approach to infer the functions of lncRNAs. The paper proposes a new computational model named LPI-IBWA. At first, LPI-IBWA uses similarity kernel fusion (SKF) to integrate various types of biological information to construct lncRNA and protein similarity networks. Then, a bounded matrix completion model and a weighted k-nearest known neighbors algorithm are utilized to update the initial sparse lncRNA-protein interaction matrix. Based on the updated lncRNA-protein interaction matrix, the lncRNA similarity network and the protein similarity network are integrated into a heterogeneous network. Finally, an improved Bi-Random walk algorithm is used to predict novel latent lncRNA-protein interactions. 5-fold cross-validation experiments on a benchmark dataset showed that the AUC and AUPR of LPI-IBWA reach 0.920 and 0.736, respectively, which are higher than those of other state-of-the-art methods. Furthermore, the experimental results of case studies on a novel dataset also illustrated that LPI-IBWA could efficiently predict potential lncRNA-protein interactions.
许多研究表明,长链非编码RNA(lncRNA)通过与相应蛋白质结合,参与多种生物学过程,如转录后基因调控、剪接和翻译。预测lncRNA-蛋白质相互作用是推断lncRNA功能的有效方法。本文提出了一种名为LPI-IBWA的新计算模型。首先,LPI-IBWA使用相似性核融合(SKF)整合各种类型的生物信息,构建lncRNA和蛋白质相似性网络。然后,利用有界矩阵补全模型和加权k近邻已知邻居算法更新初始稀疏的lncRNA-蛋白质相互作用矩阵。基于更新后的lncRNA-蛋白质相互作用矩阵,将lncRNA相似性网络和蛋白质相似性网络整合为一个异构网络。最后,使用改进的双向随机游走算法预测新的潜在lncRNA-蛋白质相互作用。在一个基准数据集上进行的5折交叉验证实验表明,LPI-IBWA的AUC和AUPR分别达到0.920和0.736,高于其他现有最先进方法。此外,在一个新数据集上的案例研究实验结果也表明,LPI-IBWA可以有效地预测潜在的lncRNA-蛋白质相互作用。