• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估随机森林和基于迭代随机森林的方法应用于基因表达数据时的性能。

Evaluating the performance of random forest and iterative random forest based methods when applied to gene expression data.

作者信息

Walker Angelica M, Cliff Ashley, Romero Jonathon, Shah Manesh B, Jones Piet, Felipe Machado Gazolla Joao Gabriel, Jacobson Daniel A, Kainer David

机构信息

The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee Knoxville, 821 Volunteer Blvd, Knoxville 37996, TN, USA.

Computational and Predictive Biology, Oak Ridge National Laboratory, 1 Bethel Valley Rd, Oak Ridge 37830, TN, USA.

出版信息

Comput Struct Biotechnol J. 2022 Jun 22;20:3372-3386. doi: 10.1016/j.csbj.2022.06.037. eCollection 2022.

DOI:10.1016/j.csbj.2022.06.037
PMID:35832622
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9260260/
Abstract

Gene-to-gene networks, such as Gene Regulatory Networks (GRN) and Predictive Expression Networks (PEN) capture relationships between genes and are beneficial for use in downstream biological analyses. There exists multiple network inference tools to produce these gene-to-gene networks from matrices of gene expression data. Random Forest-Leave One Out Prediction (RF-LOOP) is a method that has been shown to be efficient at producing these gene-to-gene networks, frequently known as GEne Network Inference with Ensemble of trees (GENIE3). Random Forest can be replaced in this process by iterative Random Forest (iRF), which performs variable selection and boosting. Here we validate that iterative Random Forest-Leave One Out Prediction (iRF-LOOP) produces higher quality networks than GENIE3 (RF-LOOP). We use both synthetic and empirical networks from the Dialogue for Reverse Engineering Assessment and Methods (DREAM) Challenges by Sage Bionetworks, as well as two additional empirical networks created from and expression data.

摘要

基因到基因网络,如基因调控网络(GRN)和预测性表达网络(PEN),捕捉基因之间的关系,有利于用于下游生物学分析。存在多种网络推理工具可从基因表达数据矩阵生成这些基因到基因网络。随机森林留一法预测(RF-LOOP)是一种已被证明在生成这些基因到基因网络方面效率很高的方法,通常称为基于树集成的基因网络推理(GENIE3)。在此过程中,随机森林可以被迭代随机森林(iRF)取代,iRF可进行变量选择和增强。在这里,我们验证了迭代随机森林留一法预测(iRF-LOOP)生成的网络质量高于GENIE3(RF-LOOP)。我们使用了来自Sage生物网络公司逆向工程评估与方法对话(DREAM)挑战的合成网络和实证网络,以及从[具体内容1]和[具体内容2]表达数据创建的另外两个实证网络。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/1ca61333da26/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/217bb62365b9/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/108aba54e8aa/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/943564bc4a62/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/cc9f9a042be9/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/d3d3f13c0447/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/66ccce04cd7f/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/6d73a6eb9ff5/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/c9f6c888324a/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/ac8ff383eb4c/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/1ca61333da26/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/217bb62365b9/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/108aba54e8aa/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/943564bc4a62/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/cc9f9a042be9/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/d3d3f13c0447/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/66ccce04cd7f/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/6d73a6eb9ff5/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/c9f6c888324a/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/ac8ff383eb4c/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/9260260/1ca61333da26/gr9.jpg

相似文献

1
Evaluating the performance of random forest and iterative random forest based methods when applied to gene expression data.评估随机森林和基于迭代随机森林的方法应用于基因表达数据时的性能。
Comput Struct Biotechnol J. 2022 Jun 22;20:3372-3386. doi: 10.1016/j.csbj.2022.06.037. eCollection 2022.
2
A High-Performance Computing Implementation of Iterative Random Forest for the Creation of Predictive Expression Networks.基于迭代随机森林的高性能计算实现,用于创建预测表达网络。
Genes (Basel). 2019 Dec 2;10(12):996. doi: 10.3390/genes10120996.
3
Inferring regulatory networks from expression data using tree-based methods.基于树的方法从表达数据推断调控网络。
PLoS One. 2010 Sep 28;5(9):e12776. doi: 10.1371/journal.pone.0012776.
4
NIMEFI: gene regulatory network inference using multiple ensemble feature importance algorithms.NIMEFI:使用多种集成特征重要性算法进行基因调控网络推断
PLoS One. 2014 Mar 25;9(3):e92709. doi: 10.1371/journal.pone.0092709. eCollection 2014.
5
Integrative random forest for gene regulatory network inference.用于基因调控网络推断的集成随机森林
Bioinformatics. 2015 Jun 15;31(12):i197-205. doi: 10.1093/bioinformatics/btv268.
6
Unsupervised Gene Network Inference with Decision Trees and Random Forests.使用决策树和随机森林进行无监督基因网络推断
Methods Mol Biol. 2019;1883:195-215. doi: 10.1007/978-1-4939-8882-2_8.
7
Iterative random forests to discover predictive and stable high-order interactions.迭代随机森林发现预测和稳定的高阶交互。
Proc Natl Acad Sci U S A. 2018 Feb 20;115(8):1943-1948. doi: 10.1073/pnas.1711236115. Epub 2018 Jan 19.
8
dynGENIE3: dynamical GENIE3 for the inference of gene networks from time series expression data.dynGENIE3:用于从时间序列表达数据中推断基因网络的动态 GENIE3。
Sci Rep. 2018 Feb 21;8(1):3384. doi: 10.1038/s41598-018-21715-0.
9
A multicenter random forest model for effective prognosis prediction in collaborative clinical research network.多中心随机森林模型在协作临床研究网络中的有效预后预测。
Artif Intell Med. 2020 Mar;103:101814. doi: 10.1016/j.artmed.2020.101814. Epub 2020 Feb 5.
10
Network inference with ensembles of bi-clustering trees.基于二部聚类树集成的网络推断。
BMC Bioinformatics. 2019 Oct 28;20(1):525. doi: 10.1186/s12859-019-3104-y.

引用本文的文献

1
Integrated single-cell and transcriptomic analysis of bone marrow-derived metastatic neuroblastoma reveals molecular mechanisms of metabolic reprogramming.骨髓源性转移性神经母细胞瘤的单细胞与转录组学整合分析揭示代谢重编程的分子机制。
Sci Rep. 2025 Aug 5;15(1):28519. doi: 10.1038/s41598-025-13626-8.
2
Identification of biomarkers for the diagnosis of chronic kidney disease (CKD) with dilated cardiomyopathy (DCM) by bioinformatics analysis and machine learning.通过生物信息学分析和机器学习识别用于诊断伴有扩张型心肌病(DCM)的慢性肾脏病(CKD)的生物标志物。
Front Genet. 2025 May 30;16:1562891. doi: 10.3389/fgene.2025.1562891. eCollection 2025.
3

本文引用的文献

1
The Ancient Salicoid Genome Duplication Event: A Platform for Reconstruction of De Novo Gene Evolution in Populus trichocarpa.古萨利科基因组复制事件:杨属新基因进化重建的平台。
Genome Biol Evol. 2021 Sep 1;13(9). doi: 10.1093/gbe/evab198.
2
Plant Metabolic Network 15: A resource of genome-wide metabolism databases for 126 plants and algae.植物代谢网络 15:126 种植物和藻类的全基因组代谢数据库资源。
J Integr Plant Biol. 2021 Nov;63(11):1888-1905. doi: 10.1111/jipb.13163. Epub 2021 Oct 27.
3
The gibberellin signaling negative regulator RGA-LIKE3 promotes seed storage protein accumulation.
STLBRF: an improved random forest algorithm based on standardized-threshold for feature screening of gene expression data.
STLBRF:一种基于标准化阈值的改进随机森林算法,用于基因表达数据的特征筛选。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae048.
4
Machine learning-based prediction of the risk of moderate-to-severe catheter-related bladder discomfort in general anaesthesia patients: a prospective cohort study.基于机器学习的全麻患者中中重度导管相关膀胱不适风险预测:一项前瞻性队列研究。
BMC Anesthesiol. 2024 Sep 19;24(1):334. doi: 10.1186/s12871-024-02720-5.
5
Machine Learning Prediction of Quantum Yields and Wavelengths of Aggregation-Induced Emission Molecules.聚集诱导发光分子量子产率和波长的机器学习预测
Materials (Basel). 2024 Apr 4;17(7):1664. doi: 10.3390/ma17071664.
6
Screening for immune-related biomarkers associated with myasthenia gravis and dilated cardiomyopathy based on bioinformatics analysis and machine learning.基于生物信息学分析和机器学习筛选与重症肌无力和扩张型心肌病相关的免疫相关生物标志物
Heliyon. 2024 Mar 20;10(7):e28446. doi: 10.1016/j.heliyon.2024.e28446. eCollection 2024 Apr 15.
7
Machine Learning Algorithms Applied to Predict Autism Spectrum Disorder Based on Gut Microbiome Composition.基于肠道微生物群组成应用机器学习算法预测自闭症谱系障碍
Biomedicines. 2023 Sep 26;11(10):2633. doi: 10.3390/biomedicines11102633.
8
Gene regulatory network inference using mixed-norms regularized multivariate model with covariance selection.基于协方差选择的混合范数正则化多元模型的基因调控网络推断
PLoS Comput Biol. 2023 Jul 31;19(7):e1010832. doi: 10.1371/journal.pcbi.1010832. eCollection 2023 Jul.
9
A novel intelligent displacement prediction model of karst tunnels.一种新型的岩溶隧道智能位移预测模型。
Sci Rep. 2022 Oct 10;12(1):16983. doi: 10.1038/s41598-022-21333-x.
赤霉素信号负调控因子RGA-LIKE3促进种子贮藏蛋白积累。
Plant Physiol. 2021 Apr 23;185(4):1697-1707. doi: 10.1093/plphys/kiaa114.
4
New function of Hypoxia-responsive unknown protein in enhanced resistance to biotic stress.缺氧应答未知蛋白的新功能可增强生物胁迫抗性。
Plant Signal Behav. 2021 Mar 4;16(3):1868131. doi: 10.1080/15592324.2020.1868131. Epub 2020 Dec 28.
5
A High-Performance Computing Implementation of Iterative Random Forest for the Creation of Predictive Expression Networks.基于迭代随机森林的高性能计算实现,用于创建预测表达网络。
Genes (Basel). 2019 Dec 2;10(12):996. doi: 10.3390/genes10120996.
6
Unified feature association networks through integration of transcriptomic and proteomic data.通过整合转录组学和蛋白质组学数据的统一特征关联网络。
PLoS Comput Biol. 2019 Sep 17;15(9):e1007241. doi: 10.1371/journal.pcbi.1007241. eCollection 2019 Sep.
7
Gene length corrected trimmed mean of M-values (GeTMM) processing of RNA-seq data performs similarly in intersample analyses while improving intrasample comparisons.基因长度校正后的均方根值(GeTMM)处理 RNA-seq 数据在样本间分析中表现相似,同时改善了样本内比较。
BMC Bioinformatics. 2018 Jun 22;19(1):236. doi: 10.1186/s12859-018-2246-7.
8
Iterative random forests to discover predictive and stable high-order interactions.迭代随机森林发现预测和稳定的高阶交互。
Proc Natl Acad Sci U S A. 2018 Feb 20;115(8):1943-1948. doi: 10.1073/pnas.1711236115. Epub 2018 Jan 19.
9
SCENIC: single-cell regulatory network inference and clustering.SCENIC:单细胞调控网络推断与聚类
Nat Methods. 2017 Nov;14(11):1083-1086. doi: 10.1038/nmeth.4463. Epub 2017 Oct 9.
10
Recursive random forest algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways.用于构建调控生物途径的多层层次基因调控网络的递归随机森林算法。
PLoS One. 2017 Feb 3;12(2):e0171532. doi: 10.1371/journal.pone.0171532. eCollection 2017.