School of Mathematics, Liaoning University, Shenyang 110036, China.
School of Mathematics, Jilin University, Changchun 130000, China.
Genomics. 2020 Mar;112(2):1335-1342. doi: 10.1016/j.ygeno.2019.08.001. Epub 2019 Aug 5.
Circular RNAs (circRNAs) are a new kind of endogenous non-coding RNAs, which have been discovered continuously. More and more studies have shown that circRNAs are related to the occurrence and development of human diseases. Identification of circRNAs associated with diseases can contribute to understand the pathogenesis, diagnosis and treatment of diseases. However, experimental methods of circRNA prediction remain expensive and time-consuming. Therefore, it is urgent to propose novel computational methods for the prediction of circRNA-disease associations. In this study, we develop a computational method called LLCDC that integrates the known circRNA-disease associations, circRNA semantic similarity network, disease semantic similarity network, reconstructed circRNA similarity network, and reconstructed disease similarity network to predict circRNAs related to human diseases. Specifically, the reconstructed similarity networks are obtained by using Locality-Constrained Linear Coding (LLC) on the known association matrix, cosine similarities of circRNAs and diseases. Then, the label propagation method is applied to the similarity networks, and four relevant score matrices are respectively obtained. Finally, we use 5-fold cross validation (5-fold CV) to evaluate the performance of LLCDC, and the AUC value of the method is 0.9177, indicating that our method performs better than the other three methods. In addition, case studies on gastric cancer, breast cancer and papillary thyroid carcinoma further verify the reliability of our method in predicting disease-associated circRNAs.
环状 RNA(circRNAs)是一种新的内源性非编码 RNA,不断被发现。越来越多的研究表明,circRNAs 与人类疾病的发生和发展有关。鉴定与疾病相关的 circRNAs 有助于了解疾病的发病机制、诊断和治疗。然而,circRNA 预测的实验方法仍然昂贵且耗时。因此,迫切需要提出新的计算方法来预测 circRNA-疾病关联。在这项研究中,我们开发了一种名为 LLCDC 的计算方法,该方法整合了已知的 circRNA-疾病关联、circRNA 语义相似性网络、疾病语义相似性网络、重建的 circRNA 相似性网络和重建的疾病相似性网络,以预测与人类疾病相关的 circRNAs。具体来说,通过在已知关联矩阵上使用局部约束线性编码(LLC)和 circRNAs 和疾病的余弦相似度,获得重建的相似性网络。然后,将标签传播方法应用于相似性网络,分别获得四个相关的得分矩阵。最后,我们使用 5 折交叉验证(5-fold CV)来评估 LLCDC 的性能,该方法的 AUC 值为 0.9177,表明我们的方法比其他三种方法表现更好。此外,对胃癌、乳腺癌和甲状腺乳头状癌的案例研究进一步验证了我们的方法在预测疾病相关 circRNAs 中的可靠性。