Suppr超能文献

CircWalk:一种基于异质网络表示学习预测环状 RNA 与疾病关联的新方法。

CircWalk: a novel approach to predict CircRNA-disease association based on heterogeneous network representation learning.

机构信息

BCB Lab, Department of Computer Engineering, Sharif University of Technology, Tehran, Iran.

Department of Computer Science and Information Technology, Institute for Advanced Studies in Basic Sciences (IASBS), Zanjan, Iran.

出版信息

BMC Bioinformatics. 2022 Aug 11;23(1):331. doi: 10.1186/s12859-022-04883-9.

Abstract

BACKGROUND

Several types of RNA in the cell are usually involved in biological processes with multiple functions. Coding RNAs code for proteins while non-coding RNAs regulate gene expression. Some single-strand RNAs can create a circular shape via the back splicing process and convert into a new type called circular RNA (circRNA). circRNAs are among the essential non-coding RNAs in the cell that involve multiple disorders. One of the critical functions of circRNAs is to regulate the expression of other genes through sponging micro RNAs (miRNAs) in diseases. This mechanism, known as the competing endogenous RNA (ceRNA) hypothesis, and additional information obtained from biological datasets can be used by computational approaches to predict novel associations between disease and circRNAs.

RESULTS

We applied multiple classifiers to validate the extracted features from the heterogeneous network and selected the most appropriate one based on some evaluation criteria. Then, the XGBoost is utilized in our pipeline to generate a novel approach, called CircWalk, to predict CircRNA-Disease associations. Our results demonstrate that CircWalk has reasonable accuracy and AUC compared with other state-of-the-art algorithms. We also use CircWalk to predict novel circRNAs associated with lung, gastric, and colorectal cancers as a case study. The results show that our approach can accurately detect novel circRNAs related to these diseases.

CONCLUSIONS

Considering the ceRNA hypothesis, we integrate multiple resources to construct a heterogeneous network from circRNAs, mRNAs, miRNAs, and diseases. Next, the DeepWalk algorithm is applied to the network to extract feature vectors for circRNAs and diseases. The extracted features are used to learn a classifier and generate a model to predict novel CircRNA-Disease associations. Our approach uses the concept of the ceRNA hypothesis and the miRNA sponge effect of circRNAs to predict their associations with diseases. Our results show that this outlook could help identify CircRNA-Disease associations more accurately.

摘要

背景

细胞中的几种类型的 RNA 通常参与具有多种功能的生物过程。编码 RNA 为蛋白质编码,而非编码 RNA 调节基因表达。一些单链 RNA 可以通过反向剪接过程形成环状形状,并转化为一种称为环状 RNA(circRNA)的新型 RNA。circRNA 是细胞中重要的非编码 RNA 之一,涉及多种疾病。circRNA 的一个关键功能是通过在疾病中海绵吸附 microRNA(miRNA)来调节其他基因的表达。这种机制被称为竞争性内源 RNA(ceRNA)假说,并且可以通过计算方法从生物数据集获得额外信息来预测疾病与 circRNA 之间的新关联。

结果

我们应用了多种分类器来验证从异构网络中提取的特征,并根据一些评估标准选择了最合适的分类器。然后,我们在管道中使用 XGBoost 生成一种新方法,称为 CircWalk,用于预测 circRNA-疾病关联。我们的结果表明,CircWalk 与其他最先进的算法相比具有合理的准确性和 AUC。我们还使用 CircWalk 来预测与肺癌、胃癌和结直肠癌相关的新型 circRNA 作为案例研究。结果表明,我们的方法可以准确检测与这些疾病相关的新型 circRNA。

结论

考虑到 ceRNA 假说,我们整合了多个资源,从 circRNA、mRNA、miRNA 和疾病构建了一个异构网络。然后,应用 DeepWalk 算法对网络进行分析,提取 circRNA 和疾病的特征向量。提取的特征用于学习分类器并生成模型,以预测新型 circRNA-疾病关联。我们的方法使用 ceRNA 假说和 circRNA 的 miRNA 海绵效应的概念来预测它们与疾病的关联。我们的结果表明,这种方法可以帮助更准确地识别 circRNA-疾病关联。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验