Suppr超能文献

利用多类型层次聚类技术预测 ncRNAs 与疾病之间的新关联。

Prediction of new associations between ncRNAs and diseases exploiting multi-type hierarchical clustering.

机构信息

University of Bari Aldo Moro - Department of Computer Science, Via Orabona, 4, Bari, 70125, Italy.

CNR, Institute for Biomedical Technologies, Bari, 70126, Italy.

出版信息

BMC Bioinformatics. 2020 Feb 24;21(1):70. doi: 10.1186/s12859-020-3392-2.

Abstract

BACKGROUND

The study of functional associations between ncRNAs and human diseases is a pivotal task of modern research to develop new and more effective therapeutic approaches. Nevertheless, it is not a trivial task since it involves entities of different types, such as microRNAs, lncRNAs or target genes whose expression also depends on endogenous or exogenous factors. Such a complexity can be faced by representing the involved biological entities and their relationships as a network and by exploiting network-based computational approaches able to identify new associations. However, existing methods are limited to homogeneous networks (i.e., consisting of only one type of objects and relationships) or can exploit only a small subset of the features of biological entities, such as the presence of a particular binding domain, enzymatic properties or their involvement in specific diseases.

RESULTS

To overcome the limitations of existing approaches, we propose the system LP-HCLUS, which exploits a multi-type hierarchical clustering method to predict possibly unknown ncRNA-disease relationships. In particular, LP-HCLUS analyzes heterogeneous networks consisting of several types of objects and relationships, each possibly described by a set of features, and extracts multi-type clusters that are subsequently exploited to predict new ncRNA-disease associations. The extracted clusters are overlapping, hierarchically organized, involve entities of different types, and allow LP-HCLUS to catch multiple roles of ncRNAs in diseases at different levels of granularity. Our experimental evaluation, performed on heterogeneous attributed networks consisting of microRNAs, lncRNAs, diseases, genes and their known relationships, shows that LP-HCLUS is able to obtain better results with respect to existing approaches. The biological relevance of the obtained results was evaluated according to both quantitative (i.e., TPR@k, Areas Under the TPR@k, ROC and Precision-Recall curves) and qualitative (i.e., according to the consultation of the existing literature) criteria.

CONCLUSIONS

The obtained results prove the utility of LP-HCLUS to conduct robust predictive studies on the biological role of ncRNAs in human diseases. The produced predictions can therefore be reliably considered as new, previously unknown, relationships among ncRNAs and diseases.

摘要

背景

研究 ncRNA 与人类疾病之间的功能关联是开发新的、更有效的治疗方法的现代研究的关键任务。然而,这并不是一项简单的任务,因为它涉及到不同类型的实体,如 microRNAs、lncRNAs 或靶基因,它们的表达也取决于内源性或外源性因素。这种复杂性可以通过将涉及的生物实体及其关系表示为网络,并利用能够识别新关联的基于网络的计算方法来解决。然而,现有的方法仅限于同构网络(即,仅由一种类型的对象和关系组成),或者只能利用生物实体的一小部分特征,例如特定结合域的存在、酶特性或它们在特定疾病中的参与。

结果

为了克服现有方法的局限性,我们提出了系统 LP-HCLUS,该系统利用多类型层次聚类方法来预测可能未知的 ncRNA-疾病关系。具体来说,LP-HCLUS 分析由几种类型的对象和关系组成的异构网络,每种类型都可能由一组特征描述,并提取多类型聚类,然后利用这些聚类来预测新的 ncRNA-疾病关联。提取的聚类是重叠的、层次化组织的,涉及不同类型的实体,并允许 LP-HCLUS 在不同的粒度级别上捕获 ncRNA 在疾病中的多种作用。我们在由 microRNAs、lncRNAs、疾病、基因及其已知关系组成的异构属性网络上进行的实验评估表明,LP-HCLUS 能够获得比现有方法更好的结果。根据定量(即 TPR@k、TPR@k 下的面积、ROC 和精度-召回曲线)和定性(即根据现有文献的咨询)标准,评估了获得的结果的生物学相关性。

结论

获得的结果证明了 LP-HCLUS 在对 ncRNA 在人类疾病中的生物学作用进行稳健预测研究方面的效用。因此,可以可靠地认为所产生的预测是 ncRNA 与疾病之间的新的、以前未知的关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9553/7041288/01e8224f8aad/12859_2020_3392_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验