Hunan Normal University and Hunan Xiangjiang Artificial Intelligence Academy, Changsha, China.
College of Computer Science and Electronic Engineering, Hunan University, Changsha, China.
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab444.
Circular RNAs (circRNAs) are a category of novelty discovered competing endogenous non-coding RNAs that have been proved to implicate many human complex diseases. A large number of circRNAs have been confirmed to be involved in cancer progression and are expected to become promising biomarkers for tumor diagnosis and targeted therapy. Deciphering the underlying relationships between circRNAs and diseases may provide new insights for us to understand the pathogenesis of complex diseases and further characterize the biological functions of circRNAs. As traditional experimental methods are usually time-consuming and laborious, computational models have made significant progress in systematically exploring potential circRNA-disease associations, which not only creates new opportunities for investigating pathogenic mechanisms at the level of circRNAs, but also helps to significantly improve the efficiency of clinical trials. In this review, we first summarize the functions and characteristics of circRNAs and introduce some representative circRNAs related to tumorigenesis. Then, we mainly investigate the available databases and tools dedicated to circRNA and disease studies. Next, we present a comprehensive review of computational methods for predicting circRNA-disease associations and classify them into five categories, including network propagating-based, path-based, matrix factorization-based, deep learning-based and other machine learning methods. Finally, we further discuss the challenges and future researches in this field.
环状 RNA(circRNAs)是一类新兴的竞争性内源非编码 RNA,已被证明与许多人类复杂疾病有关。大量 circRNAs 已被证实参与癌症的进展,并有望成为肿瘤诊断和靶向治疗的有前途的生物标志物。解析 circRNAs 与疾病之间的潜在关系,可能为我们深入了解复杂疾病的发病机制提供新的思路,并进一步阐明 circRNAs 的生物学功能。由于传统的实验方法通常耗时费力,计算模型在系统探索潜在的 circRNA-疾病关联方面取得了显著进展,这不仅为在 circRNAs 水平上研究发病机制创造了新的机会,而且有助于显著提高临床试验的效率。在这篇综述中,我们首先总结了 circRNAs 的功能和特征,并介绍了一些与肿瘤发生相关的代表性 circRNAs。然后,我们主要研究了专门用于 circRNA 和疾病研究的可用数据库和工具。接下来,我们全面回顾了预测 circRNA-疾病关联的计算方法,并将其分为五类,包括基于网络传播的方法、基于路径的方法、基于矩阵分解的方法、基于深度学习的方法和其他机器学习方法。最后,我们进一步讨论了该领域的挑战和未来研究方向。