Liu Jin-Xing, Gao Ming-Ming, Cui Zhen, Gao Ying-Lian, Li Feng
School of Computer Science, Qufu Normal University, Rizhao, China.
Qufu Normal University Library, Qufu Normal University, Rizhao, China.
BMC Bioinformatics. 2021 May 12;22(Suppl 3):241. doi: 10.1186/s12859-020-03868-w.
In the development of science and technology, there are increasing evidences that there are some associations between lncRNAs and human diseases. Therefore, finding these associations between them will have a huge impact on our treatment and prevention of some diseases. However, the process of finding the associations between them is very difficult and requires a lot of time and effort. Therefore, it is particularly important to find some good methods for predicting lncRNA-disease associations (LDAs).
In this paper, we propose a method based on dual sparse collaborative matrix factorization (DSCMF) to predict LDAs. The DSCMF method is improved on the traditional collaborative matrix factorization method. To increase the sparsity, the L-norm is added in our method. At the same time, Gaussian interaction profile kernel is added to our method, which increase the network similarity between lncRNA and disease. Finally, the AUC value obtained by the experiment is used to evaluate the quality of our method, and the AUC value is obtained by the ten-fold cross-validation method.
The AUC value obtained by the DSCMF method is 0.8523. At the end of the paper, simulation experiment is carried out, and the experimental results of prostate cancer, breast cancer, ovarian cancer and colorectal cancer are analyzed in detail. The DSCMF method is expected to bring some help to lncRNA-disease associations research. The code can access the https://github.com/Ming-0113/DSCMF website.
在科学技术的发展过程中,越来越多的证据表明长链非编码核糖核酸(lncRNAs)与人类疾病之间存在一些关联。因此,发现它们之间的这些关联将对我们治疗和预防某些疾病产生巨大影响。然而,寻找它们之间关联的过程非常困难,需要大量的时间和精力。因此,找到一些预测长链非编码核糖核酸-疾病关联(LDA)的好方法尤为重要。
在本文中,我们提出了一种基于双稀疏协同矩阵分解(DSCMF)的方法来预测LDA。DSCMF方法是在传统协同矩阵分解方法的基础上改进而来。为了增加稀疏性,我们的方法中添加了L-范数。同时,我们的方法中添加了高斯交互轮廓核,这增加了长链非编码核糖核酸与疾病之间的网络相似性。最后,通过实验获得的AUC值用于评估我们方法的质量,AUC值是通过十折交叉验证法获得的。
DSCMF方法获得的AUC值为0.8523。在论文结尾进行了模拟实验,并详细分析了前列腺癌、乳腺癌、卵巢癌和结直肠癌的实验结果。DSCMF方法有望为长链非编码核糖核酸-疾病关联研究带来一些帮助。代码可访问https://github.com/Ming-0113/DSCMF网站。