Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, College of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China.
Hunan Xiangjiang Artificial Intelligence Academy, Changsha, 410000, China.
Mol Genet Genomics. 2021 Jan;296(1):223-233. doi: 10.1007/s00438-020-01741-2. Epub 2020 Nov 6.
Circular RNAs (circRNAs) are a special class of non-coding RNAs with covalently closed-loop structures. Studies prove that circRNAs perform critical roles in various biological processes, and the aberrant expression of circRNAs is closely related to tumorigenesis. Therefore, identifying potential circRNA-disease associations is beneficial to understand the pathogenesis of complex diseases at the circRNA level and helps biomedical researchers and practitioners to discover diagnostic biomarkers accurately. However, it is tremendously laborious and time-consuming to discover disease-related circRNAs with conventional biological experiments. In this study, we develop an integrative framework, called iCDA-CMG, to predict potential associations between circRNAs and diseases. By incorporating multi-source prior knowledge, including known circRNA-disease associations, disease similarities and circRNA similarities, we adopt a collective matrix completion-based graph learning model to prioritize the most promising disease-related circRNAs for guiding laborious clinical trials. The results show that iCDA-CMG outperforms other state-of-the-art models in terms of cross-validation and independent prediction. Moreover, the case studies for several representative cancers suggest the effectiveness of iCDA-CMG in screening circRNA candidates for human diseases, which will contribute to elucidating the pathogenesis mechanisms and unveiling new opportunities for disease diagnosis and targeted therapy.
环状 RNA(circRNAs)是一类具有共价闭环结构的特殊非编码 RNA。研究证明,circRNAs 在各种生物过程中发挥着关键作用,circRNAs 的异常表达与肿瘤发生密切相关。因此,鉴定潜在的 circRNA-疾病关联有助于从 circRNA 水平理解复杂疾病的发病机制,有助于生物医学研究人员和从业者准确发现诊断生物标志物。然而,利用传统的生物学实验来发现与疾病相关的 circRNAs 是极其费力和耗时的。在本研究中,我们开发了一种名为 iCDA-CMG 的综合框架,用于预测 circRNAs 与疾病之间的潜在关联。通过整合多种来源的先验知识,包括已知的 circRNA-疾病关联、疾病相似性和 circRNA 相似性,我们采用基于集体矩阵补全的图学习模型对最有希望与疾病相关的 circRNAs 进行优先级排序,以指导费力的临床试验。结果表明,iCDA-CMG 在交叉验证和独立预测方面均优于其他最先进的模型。此外,对几种代表性癌症的案例研究表明,iCDA-CMG 在筛选人类疾病的 circRNA 候选物方面是有效的,这将有助于阐明发病机制,并为疾病诊断和靶向治疗开辟新的机会。