• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于利用正未标记数据预测潜在miRNA与疾病关联的集成学习框架。

An ensemble learning framework for potential miRNA-disease association prediction with positive-unlabeled data.

作者信息

Wu Yao, Zhu Donghua, Wang Xuefeng, Zhang Shuo

机构信息

School of Management and Economics, Beijing Institute of Technology, Beijing 100081, China.

School of Management and Economics, Beijing Institute of Technology, Beijing 100081, China.

出版信息

Comput Biol Chem. 2021 Dec;95:107566. doi: 10.1016/j.compbiolchem.2021.107566. Epub 2021 Aug 24.

DOI:10.1016/j.compbiolchem.2021.107566
PMID:34534906
Abstract

To explore the pathogenic mechanisms of MicroRNA (miRNA) on diverse diseases, many researchers have concentrated on discovering the potential associations between miRNA and disease using machine learning methods. However, the prediction accuracy of supervised machine learning methods is limited by lacking of experimentally-validated uncorrelated miRNA-disease pairs. Without these negative samples, training a highly accurate model is much more difficult. Different from traditional miRNA-disease prediction models using randomly selected unknown samples as negative training samples, we propose an ensemble learning framework to solve this positive-unlabeled (PU) learning problem. The framework incorporates two steps, i.e., a novel semi-supervised Kmeans (SS-Kmeans) to extract reliable negative samples from unknown miRNA-disease pairs and subagging method to generate diverse training sample sets to make full use of those reliable negative samples for ensemble learning. Combined with effective random vector functional link (RVFL) network as prediction model, the proposed framework showed superior prediction accuracy comparing with other popular approaches. A case study on lung and gastric neoplasms further confirms the framework's efficacy at identifying miRNA disease associations.

摘要

为了探究微小RNA(miRNA)在多种疾病中的致病机制,许多研究人员致力于利用机器学习方法发现miRNA与疾病之间的潜在关联。然而,监督式机器学习方法的预测准确性受到缺乏经实验验证的不相关miRNA-疾病对的限制。没有这些阴性样本,训练一个高度准确的模型要困难得多。与使用随机选择的未知样本作为阴性训练样本的传统miRNA-疾病预测模型不同,我们提出了一个集成学习框架来解决这个正例未标注(PU)学习问题。该框架包含两个步骤,即一种新颖的半监督Kmeans(SS-Kmeans)方法,用于从未知的miRNA-疾病对中提取可靠的阴性样本,以及子agging方法,用于生成多样化的训练样本集,以便充分利用这些可靠的阴性样本进行集成学习。结合有效的随机向量函数链接(RVFL)网络作为预测模型,所提出的框架与其他流行方法相比,显示出更高的预测准确性。一项关于肺癌和胃癌的案例研究进一步证实了该框架在识别miRNA-疾病关联方面的有效性。

相似文献

1
An ensemble learning framework for potential miRNA-disease association prediction with positive-unlabeled data.一种用于利用正未标记数据预测潜在miRNA与疾病关联的集成学习框架。
Comput Biol Chem. 2021 Dec;95:107566. doi: 10.1016/j.compbiolchem.2021.107566. Epub 2021 Aug 24.
2
Predicting potential miRNA-disease associations based on more reliable negative sample selection.基于更可靠的负样本选择来预测潜在的 miRNA-疾病关联。
BMC Bioinformatics. 2022 Oct 17;23(1):432. doi: 10.1186/s12859-022-04978-3.
3
A semi-supervised machine learning framework for microRNA classification.一种用于 microRNA 分类的半监督机器学习框架。
Hum Genomics. 2019 Oct 22;13(Suppl 1):43. doi: 10.1186/s40246-019-0221-7.
4
Prediction of potential miRNA-disease associations based on stacked autoencoder.基于堆叠自编码器的 miRNA 疾病关联预测。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac021.
5
An integrated framework for the identification of potential miRNA-disease association based on novel negative samples extraction strategy.基于新型负样本提取策略的潜在 miRNA-疾病关联识别的综合框架。
RNA Biol. 2019 Mar;16(3):257-269. doi: 10.1080/15476286.2019.1568820. Epub 2019 Jan 28.
6
Predicting miRNA-disease associations using an ensemble learning framework with resampling method.基于重采样方法的集成学习框架预测 miRNA-疾病关联。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab543.
7
A Supervised Ensemble Approach for Sensitive microRNA Target Prediction.基于监督集成方法的 miRNA 靶基因预测
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):37-46. doi: 10.1109/TCBB.2018.2858252. Epub 2018 Jul 20.
8
Improved low-rank matrix recovery method for predicting miRNA-disease association.改进的低秩矩阵恢复方法用于预测 miRNA-疾病关联。
Sci Rep. 2017 Jul 20;7(1):6007. doi: 10.1038/s41598-017-06201-3.
9
Combining Supervised and Unsupervised Learning for Improved miRNA Target Prediction.结合监督学习和无监督学习提高 miRNA 靶标预测。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1594-1604. doi: 10.1109/TCBB.2017.2727042. Epub 2017 Jul 13.
10
A semi-supervised deep learning method based on stacked sparse auto-encoder for cancer prediction using RNA-seq data.基于堆叠稀疏自动编码器的半监督深度学习方法在 RNA-seq 数据癌症预测中的应用。
Comput Methods Programs Biomed. 2018 Nov;166:99-105. doi: 10.1016/j.cmpb.2018.10.004. Epub 2018 Oct 5.

引用本文的文献

1
TriFusion enables accurate prediction of miRNA-disease association by a tri-channel fusion neural network.TriFusion 通过三通道融合神经网络实现 miRNA-疾病关联的准确预测。
Commun Biol. 2024 Aug 30;7(1):1067. doi: 10.1038/s42003-024-06734-0.
2
A Novel Framework for Data Assessment That Uses Edge Technology to Improve the Detection of Communicable Diseases.一种利用边缘技术改进传染病检测的数据评估新框架。
Diagnostics (Basel). 2024 May 30;14(11):1148. doi: 10.3390/diagnostics14111148.
3
DAE-CFR: detecting microRNA-disease associations using deep autoencoder and combined feature representation.
DAE-CFR:使用深度自动编码器和组合特征表示来检测 microRNA-疾病关联。
BMC Bioinformatics. 2024 Mar 29;25(1):139. doi: 10.1186/s12859-024-05757-y.
4
Predicting potential miRNA-disease associations based on more reliable negative sample selection.基于更可靠的负样本选择来预测潜在的 miRNA-疾病关联。
BMC Bioinformatics. 2022 Oct 17;23(1):432. doi: 10.1186/s12859-022-04978-3.
5
A Highly Discriminative Hybrid Feature Selection Algorithm for Cancer Diagnosis.一种用于癌症诊断的高判别混合特征选择算法。
ScientificWorldJournal. 2022 Aug 9;2022:1056490. doi: 10.1155/2022/1056490. eCollection 2022.