Zhou Yuan-Ke, Hu Jie, Shen Zi-Ang, Zhang Wen-Ya, Du Pu-Feng
College of Intelligence and Computing, Tianjin University, Tianjin, China.
Front Genet. 2020 Dec 9;11:615144. doi: 10.3389/fgene.2020.615144. eCollection 2020.
Long non-coding RNAs (lncRNAs) play an important role in serval biological activities, including transcription, splicing, translation, and some other cellular regulation processes. lncRNAs perform their biological functions by interacting with various proteins. The studies on lncRNA-protein interactions are of great value to the understanding of lncRNA functional mechanisms. In this paper, we proposed a novel model to predict potential lncRNA-protein interactions using the SKF (similarity kernel fusion) and LapRLS (Laplacian regularized least squares) algorithms. We named this method the LPI-SKF. Various similarities of both lncRNAs and proteins were integrated into the LPI-SKF. LPI-SKF can be applied in predicting potential interactions involving novel proteins or lncRNAs. We obtained an AUROC (area under receiver operating curve) of 0.909 in a 5-fold cross-validation, which outperforms other state-of-the-art methods. A total of 19 out of the top 20 ranked interaction predictions were verified by existing data, which implied that the LPI-SKF had great potential in discovering unknown lncRNA-protein interactions accurately. All data and codes of this work can be downloaded from a GitHub repository (https://github.com/zyk2118216069/LPI-SKF).
长链非编码RNA(lncRNAs)在包括转录、剪接、翻译以及其他一些细胞调控过程在内的多种生物学活动中发挥着重要作用。lncRNAs通过与各种蛋白质相互作用来执行其生物学功能。对lncRNA-蛋白质相互作用的研究对于理解lncRNA的功能机制具有重要价值。在本文中,我们提出了一种使用SKF(相似性核融合)和LapRLS(拉普拉斯正则化最小二乘法)算法来预测潜在lncRNA-蛋白质相互作用的新模型。我们将此方法命名为LPI-SKF。lncRNAs和蛋白质的各种相似性都被整合到了LPI-SKF中。LPI-SKF可用于预测涉及新蛋白质或lncRNAs的潜在相互作用。在五折交叉验证中,我们获得了0.909的ROC曲线下面积(AUROC),这优于其他现有最先进的方法。排名前20的相互作用预测中有19个被现有数据验证,这表明LPI-SKF在准确发现未知lncRNA-蛋白质相互作用方面具有巨大潜力。这项工作的所有数据和代码都可以从GitHub仓库(https://github.com/zyk2118216069/LPI-SKF)下载。