• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过梯度提升决策树与逻辑回归相结合来预测潜在的 miRNA-疾病关联。

Predicting potential miRNA-disease associations by combining gradient boosting decision tree with logistic regression.

机构信息

College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China.

College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China.

出版信息

Comput Biol Chem. 2020 Apr;85:107200. doi: 10.1016/j.compbiolchem.2020.107200. Epub 2020 Jan 28.

DOI:10.1016/j.compbiolchem.2020.107200
PMID:32058946
Abstract

MicroRNAs (miRNAs) have been proved to play an indispensable role in many fundamental biological processes, and the dysregulation of miRNAs is closely correlated with human complex diseases. Many studies have focused on the prediction of potential miRNA-disease associations. Considering the insufficient number of known miRNA-disease associations and the poor performance of many existing prediction methods, a novel model combining gradient boosting decision tree with logistic regression (GBDT-LR) is proposed to prioritize miRNA candidates for diseases. To balance positive and negative samples, GBDT-LR firstly adopted k-means clustering to screen negative samples from unknown miRNA-disease associations. Then, the gradient boosting decision tree (GBDT) model, which has an intrinsic advantage in finding many distinguishing features and feature combinations is applied to extract features. Finally, the new features extracted by the GBDT model are input into a logistic regression (LR) model for predicting the final miRNA-disease association score. The experimental results show that the average AUC of GBDT-LR in 5-fold cross-validation (CV) can achieve 0.9274. Besides, in the case studies, 90 %, 94 % and 88 % of the top 50 miRNAs potentially associated with colon cancer, gastric cancer, and pancreatic cancer were confirmed by databases, respectively. Compared with the other three state-of-the-art methods, GBDT-LR can achieve the best prediction performance. The source code and dataset of GBDT-LR are freely available at https://github.com/Pualalala/GBDT-LR.

摘要

微小 RNA(miRNA)已被证明在许多基本的生物过程中起着不可或缺的作用,miRNA 的失调与人类复杂疾病密切相关。许多研究都集中在预测潜在的 miRNA-疾病关联上。考虑到已知 miRNA-疾病关联的数量不足和许多现有预测方法的性能不佳,提出了一种结合梯度提升决策树和逻辑回归(GBDT-LR)的新模型,以优先考虑候选 miRNA 与疾病的关联。为了平衡正样本和负样本,GBDT-LR 首先采用 K-means 聚类从未知 miRNA-疾病关联中筛选负样本。然后,应用梯度提升决策树(GBDT)模型来提取特征,该模型在寻找许多有区别的特征和特征组合方面具有内在优势。最后,将 GBDT 模型提取的新特征输入逻辑回归(LR)模型,以预测最终的 miRNA-疾病关联评分。实验结果表明,在 5 折交叉验证(CV)中,GBDT-LR 的平均 AUC 可以达到 0.9274。此外,在案例研究中,与结肠癌、胃癌和胰腺癌潜在相关的前 50 个 miRNA 中有 90%、94%和 88%分别被数据库证实。与其他三种最先进的方法相比,GBDT-LR 可以实现最佳的预测性能。GBDT-LR 的源代码和数据集可在 https://github.com/Pualalala/GBDT-LR 上免费获取。

相似文献

1
Predicting potential miRNA-disease associations by combining gradient boosting decision tree with logistic regression.通过梯度提升决策树与逻辑回归相结合来预测潜在的 miRNA-疾病关联。
Comput Biol Chem. 2020 Apr;85:107200. doi: 10.1016/j.compbiolchem.2020.107200. Epub 2020 Jan 28.
2
Adaptive boosting-based computational model for predicting potential miRNA-disease associations.基于自适应提升的计算模型,用于预测潜在的 miRNA-疾病关联。
Bioinformatics. 2019 Nov 1;35(22):4730-4738. doi: 10.1093/bioinformatics/btz297.
3
EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction.EGBMMDA:用于 miRNA-疾病关联预测的极端梯度提升机。
Cell Death Dis. 2018 Jan 5;9(1):3. doi: 10.1038/s41419-017-0003-x.
4
Ensemble of decision tree reveals potential miRNA-disease associations.决策树集成揭示潜在的 miRNA-疾病关联。
PLoS Comput Biol. 2019 Jul 22;15(7):e1007209. doi: 10.1371/journal.pcbi.1007209. eCollection 2019 Jul.
5
Predicting miRNA-disease association from heterogeneous information network with GraRep embedding model.基于 GraRep 嵌入模型的异质信息网络预测 miRNA-疾病关联
Sci Rep. 2020 Apr 20;10(1):6658. doi: 10.1038/s41598-020-63735-9.
6
NEMPD: a network embedding-based method for predicting miRNA-disease associations by preserving behavior and attribute information.NEMPD:一种基于网络嵌入的方法,通过保留行为和属性信息来预测 miRNA-疾病关联。
BMC Bioinformatics. 2020 Sep 10;21(1):401. doi: 10.1186/s12859-020-03716-x.
7
NPCMF: Nearest Profile-based Collaborative Matrix Factorization method for predicting miRNA-disease associations.NPCMF:基于最近邻 Profile 的协同矩阵分解方法,用于预测 miRNA-疾病关联。
BMC Bioinformatics. 2019 Jun 24;20(1):353. doi: 10.1186/s12859-019-2956-5.
8
DNRLMF-MDA:Predicting microRNA-Disease Associations Based on Similarities of microRNAs and Diseases.DNRLMF-MDA:基于 miRNA 和疾病相似性预测 miRNA-疾病关联。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):233-243. doi: 10.1109/TCBB.2017.2776101. Epub 2017 Nov 22.
9
An improved random forest-based computational model for predicting novel miRNA-disease associations.基于随机森林的新型 miRNA-疾病关联预测计算模型的改进。
BMC Bioinformatics. 2019 Dec 3;20(1):624. doi: 10.1186/s12859-019-3290-7.
10
Predicting miRNA-disease association based on inductive matrix completion.基于归纳矩阵补全的 miRNA-疾病关联预测。
Bioinformatics. 2018 Dec 15;34(24):4256-4265. doi: 10.1093/bioinformatics/bty503.

引用本文的文献

1
Development of a risk prediction model for sepsis-related delirium based on multiple machine learning approaches and an online calculator.基于多种机器学习方法和在线计算器开发脓毒症相关性谵妄风险预测模型。
PLoS One. 2025 Jul 16;20(7):e0323831. doi: 10.1371/journal.pone.0323831. eCollection 2025.
2
Prediction of cardiovascular diseases based on GBDT+LR.基于梯度提升决策树(GBDT)与逻辑回归(LR)的心血管疾病预测
Sci Rep. 2025 Jul 1;15(1):20906. doi: 10.1038/s41598-025-04921-5.
3
Prediction of metastases in confusing mediastinal lymph nodes based on flourine-18 fluorodeoxyglucose (F-FDG) positron emission tomography/computed tomography (PET/CT) imaging using machine learning.
基于机器学习的氟-18氟脱氧葡萄糖(F-FDG)正电子发射断层扫描/计算机断层扫描(PET/CT)成像预测纵隔淋巴结转移情况复杂的转移灶
Quant Imaging Med Surg. 2024 Jul 1;14(7):4723-4734. doi: 10.21037/qims-24-100. Epub 2024 Jun 17.
4
Prediction of miRNAs and diseases association based on sparse autoencoder and MLP.基于稀疏自编码器和多层感知器的微小RNA与疾病关联预测
Front Genet. 2024 May 30;15:1369811. doi: 10.3389/fgene.2024.1369811. eCollection 2024.
5
Exploring potential circRNA biomarkers for cancers based on double-line heterogeneous graph representation learning.基于双线性异质图表示学习的癌症潜在环状 RNA 生物标志物研究
BMC Med Inform Decis Mak. 2024 Jun 6;24(1):159. doi: 10.1186/s12911-024-02564-6.
6
IDMIR: identification of dysregulated miRNAs associated with disease based on a miRNA-miRNA interaction network constructed through gene expression data.IDMIR:基于通过基因表达数据构建的 miRNA-miRNA 相互作用网络,鉴定与疾病相关的失调 miRNA。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae258.
7
A novel approach for denoising electrocardiogram signals to detect cardiovascular diseases using an efficient hybrid scheme.一种使用高效混合方案对心电图信号进行去噪以检测心血管疾病的新方法。
Front Cardiovasc Med. 2024 Apr 4;11:1277123. doi: 10.3389/fcvm.2024.1277123. eCollection 2024.
8
DAE-CFR: detecting microRNA-disease associations using deep autoencoder and combined feature representation.DAE-CFR:使用深度自动编码器和组合特征表示来检测 microRNA-疾病关联。
BMC Bioinformatics. 2024 Mar 29;25(1):139. doi: 10.1186/s12859-024-05757-y.
9
MHGTMDA: Molecular heterogeneous graph transformer based on biological entity graph for miRNA-disease associations prediction.MHGTMDA:基于生物实体图的分子异构图变换器用于miRNA-疾病关联预测。
Mol Ther Nucleic Acids. 2024 Feb 5;35(1):102139. doi: 10.1016/j.omtn.2024.102139. eCollection 2024 Mar 12.
10
Synchronous Mutual Learning Network and Asynchronous Multi-Scale Embedding Network for miRNA-Disease Association Prediction.用于 miRNA-疾病关联预测的同步互学习网络和异步多尺度嵌入网络。
Interdiscip Sci. 2024 Sep;16(3):532-553. doi: 10.1007/s12539-023-00602-x. Epub 2024 Feb 4.