Suppr超能文献

通过基于图回归的统一框架预测二元、离散和连续的长链非编码RNA-疾病关联。

Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression.

作者信息

Shi Jian-Yu, Huang Hua, Zhang Yan-Ning, Long Yu-Xi, Yiu Siu-Ming

机构信息

School of Life Sciences, Northwestern Polytechnical University, Xi'an, 710072, China.

School of Software and Microelectronics, Northwestern Polytechnical University, Xi'an, 710072, China.

出版信息

BMC Med Genomics. 2017 Dec 21;10(Suppl 4):65. doi: 10.1186/s12920-017-0305-y.

Abstract

BACKGROUND

In human genomes, long non-coding RNAs (lncRNAs) have attracted more and more attention because their dysfunctions are involved in many diseases. However, the associations between lncRNAs and diseases (LDA) still remain unknown in most cases. While identifying disease-related lncRNAs in vivo is costly, computational approaches are promising to not only accelerate the possible identification of associations but also provide clues on the underlying mechanism of various lncRNA-caused diseases. Former computational approaches usually only focus on predicting new associations between lncRNAs having known associations with diseases and other lncRNA-associated diseases. They also only work on binary lncRNA-disease associations (whether the pair has an association or not), which cannot reflect and reveal other biological facts, such as the number of proteins involved in LDA or how strong the association is (i.e., the intensity of LDA).

RESULTS

To address abovementioned issues, we propose a graph regression-based unified framework (GRUF). In particular, our method can work on lncRNAs, which have no previously known disease association and diseases that have no known association with any lncRNAs. Also, instead of only a binary answer for the association, our method tries to uncover more biological relationship between a pair of lncRNA and disease, which may provide better clues for researchers. We compared GRUF with three state-of-the-art approaches and demonstrated the superiority of GRUF, which achieves 5%~16% improvement in terms of the area under the receiver operating characteristic curve (AUC). GRUF also provides a predicted confidence score for the predicted LDA, which reveals the significant correlation between the score and the number of RNA-Binding Proteins involved in LDAs. Lastly, three out of top-5 LDA candidates generated by GRUF in novel prediction are verified indirectly by medical literature and known biological facts.

CONCLUSIONS

The proposed GRUF has two advantages over existing approaches. Firstly, it can be used to work on lncRNAs that have no known disease association and diseases that have no known association with any lncRNAs. Secondly, instead of providing a binary answer (with or without association), GRUF works for both discrete and continued LDA, which help revealing the pathological implications between lncRNAs and diseases.

摘要

背景

在人类基因组中,长链非编码RNA(lncRNA)因其功能失调与多种疾病相关而受到越来越多的关注。然而,在大多数情况下,lncRNA与疾病之间的关联(LDA)仍不为人所知。虽然在体内鉴定与疾病相关的lncRNA成本高昂,但计算方法有望不仅加速可能的关联识别,还能为各种lncRNA引发疾病的潜在机制提供线索。以前的计算方法通常只专注于预测与疾病有已知关联的lncRNA和其他lncRNA相关疾病之间的新关联。它们也只适用于二元lncRNA - 疾病关联(该对是否存在关联),这无法反映和揭示其他生物学事实,例如参与LDA的蛋白质数量或关联的强度(即LDA的强度)。

结果

为了解决上述问题,我们提出了一种基于图回归的统一框架(GRUF)。特别是,我们的方法可以处理以前没有已知疾病关联的lncRNA以及与任何lncRNA都没有已知关联的疾病。此外,我们的方法不是只给出关联的二元答案,而是试图揭示一对lncRNA与疾病之间更多的生物学关系,这可能为研究人员提供更好的线索。我们将GRUF与三种最先进的方法进行了比较,并证明了GRUF的优越性,其在受试者工作特征曲线下面积(AUC)方面提高了5% - 16%。GRUF还为预测的LDA提供了预测置信度分数,这揭示了该分数与参与LDA的RNA结合蛋白数量之间的显著相关性。最后,GRUF在新预测中生成的前5个LDA候选中有3个通过医学文献和已知生物学事实得到了间接验证。

结论

所提出的GRUF相对于现有方法有两个优点。首先,它可用于处理没有已知疾病关联的lncRNA以及与任何lncRNA都没有已知关联的疾病。其次,GRUF不是提供二元答案(有或无关联),而是适用于离散和连续的LDA,这有助于揭示lncRNA与疾病之间的病理意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba28/5763297/44ce91e88a1b/12920_2017_305_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验