Suppr超能文献

基于集成学习和核岭回归的潜在miRNA-疾病关联推断的计算研究

A Computational Study of Potential miRNA-Disease Association Inference Based on Ensemble Learning and Kernel Ridge Regression.

作者信息

Peng Li-Hong, Zhou Li-Qian, Chen Xing, Piao Xue

机构信息

School of Computer Science, Hunan University of Technology, Zhuzhou, China.

School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China.

出版信息

Front Bioeng Biotechnol. 2020 Feb 6;8:40. doi: 10.3389/fbioe.2020.00040. eCollection 2020.

Abstract

As increasing experimental studies have shown that microRNAs (miRNAs) are closely related to multiple biological processes and the prevention, diagnosis and treatment of human diseases, a growing number of researchers are focusing on the identification of associations between miRNAs and diseases. Identifying such associations purely via experiments is costly and demanding, which prompts researchers to develop computational methods to complement the experiments. In this paper, a novel prediction model named Ensemble of Kernel Ridge Regression based MiRNA-Disease Association prediction (EKRRMDA) was developed. EKRRMDA obtained features of miRNAs and diseases by integrating the disease semantic similarity, the miRNA functional similarity and the Gaussian interaction profile kernel similarity for diseases and miRNAs. Under the computational framework that utilized ensemble learning and feature dimensionality reduction, multiple base classifiers that combined two Kernel Ridge Regression classifiers from the miRNA side and disease side, respectively, were obtained based on random selection of features. Then average strategy for these base classifiers was adopted to obtain final association scores of miRNA-disease pairs. In the global and local leave-one-out cross validation, EKRRMDA attained the AUCs of 0.9314 and 0.8618, respectively. Moreover, the model's average AUC with standard deviation in 5-fold cross validation was 0.9275 ± 0.0008. In addition, we implemented three different types of case studies on predicting miRNAs associated with five important diseases. As a result, there were 90% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 86% (Lymphoma), 98% (Lung Neoplasms), and 96% (Breast Neoplasms) of the top 50 predicted miRNAs verified to have associations with these diseases.

摘要

越来越多的实验研究表明,微小RNA(miRNA)与多种生物学过程以及人类疾病的预防、诊断和治疗密切相关,因此越来越多的研究人员致力于识别miRNA与疾病之间的关联。单纯通过实验来识别此类关联成本高昂且要求苛刻,这促使研究人员开发计算方法来辅助实验。本文开发了一种名为基于核岭回归集成的miRNA-疾病关联预测(EKRRMDA)的新型预测模型。EKRRMDA通过整合疾病语义相似性、miRNA功能相似性以及疾病和miRNA的高斯相互作用轮廓核相似性来获取miRNA和疾病的特征。在利用集成学习和特征降维的计算框架下,基于特征的随机选择,分别从miRNA和疾病两个方面组合两个核岭回归分类器得到多个基分类器。然后采用这些基分类器的平均策略来获得miRNA-疾病对的最终关联分数。在全局和局部留一法交叉验证中,EKRRMDA的AUC分别达到了0.9314和0.8618。此外,该模型在5折交叉验证中的平均AUC及其标准差为0.9275±0.0008。另外,我们针对预测与五种重要疾病相关的miRNA进行了三种不同类型的案例研究。结果显示,在前50个预测的miRNA中,分别有90%(食管癌)、86%(肾肿瘤)、86%(淋巴瘤)、98%(肺癌)和96%(乳腺癌)被证实与这些疾病有关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23a3/7015868/562b3a0dc800/fbioe-08-00040-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验