• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

致编辑的信:关于随机森林文献中“交互作用”一词及相关表述

Letter to the Editor: On the term 'interaction' and related phrases in the literature on Random Forests.

作者信息

Boulesteix Anne-Laure, Janitza Silke, Hapfelmeier Alexander, Van Steen Kristel, Strobl Carolin

出版信息

Brief Bioinform. 2015 Mar;16(2):338-45. doi: 10.1093/bib/bbu012. Epub 2014 Apr 9.

DOI:10.1093/bib/bbu012
PMID:24723569
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4364067/
Abstract

In an interesting and quite exhaustive review on Random Forests (RF) methodology in bioinformatics Touw et al. address--among other topics--the problem of the detection of interactions between variables based on RF methodology. We feel that some important statistical concepts, such as 'interaction', 'conditional dependence' or 'correlation', are sometimes employed inconsistently in the bioinformatics literature in general and in the literature on RF in particular. In this letter to the Editor, we aim to clarify some of the central statistical concepts and point out some confusing interpretations concerning RF given by Touw et al. and other authors.

摘要

在一篇关于生物信息学中随机森林(RF)方法的有趣且相当详尽的综述中,陶乌等人探讨了——除其他主题外——基于RF方法检测变量间相互作用的问题。我们认为,一些重要的统计概念,如“相互作用”“条件依赖性”或“相关性”,在一般的生物信息学文献中,尤其是在关于RF的文献中,有时使用并不一致。在这封给编辑的信中,我们旨在阐明一些核心统计概念,并指出陶乌等人及其他作者对RF给出的一些令人困惑的解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16fe/4364067/6dc59133655a/bbu012f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16fe/4364067/304e0377e3d1/bbu012f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16fe/4364067/6dc59133655a/bbu012f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16fe/4364067/304e0377e3d1/bbu012f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16fe/4364067/6dc59133655a/bbu012f2p.jpg

相似文献

1
Letter to the Editor: On the term 'interaction' and related phrases in the literature on Random Forests.致编辑的信:关于随机森林文献中“交互作用”一词及相关表述
Brief Bioinform. 2015 Mar;16(2):338-45. doi: 10.1093/bib/bbu012. Epub 2014 Apr 9.
2
Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?生命科学中的随机森林数据挖掘:是漫步公园还是迷失丛林?
Brief Bioinform. 2013 May;14(3):315-26. doi: 10.1093/bib/bbs034. Epub 2012 Jul 10.
3
A comparative study of forest methods for time-to-event data: variable selection and predictive performance.森林方法在生存时间数据中的比较研究:变量选择和预测性能。
BMC Med Res Methodol. 2021 Sep 25;21(1):193. doi: 10.1186/s12874-021-01386-8.
4
A comparison of the conditional inference survival forest model to random survival forests based on a simulation study as well as on two applications with time-to-event data.基于模拟研究以及对两个事件发生时间数据应用的情况,对条件推断生存森林模型与随机生存森林进行比较。
BMC Med Res Methodol. 2017 Jul 28;17(1):115. doi: 10.1186/s12874-017-0383-8.
5
Unbiased split variable selection for random survival forests using maximally selected rank statistics.使用最大选择秩统计量对随机生存森林进行无偏分裂变量选择。
Stat Med. 2017 Apr 15;36(8):1272-1284. doi: 10.1002/sim.7212. Epub 2017 Jan 15.
6
A multicenter random forest model for effective prognosis prediction in collaborative clinical research network.多中心随机森林模型在协作临床研究网络中的有效预后预测。
Artif Intell Med. 2020 Mar;103:101814. doi: 10.1016/j.artmed.2020.101814. Epub 2020 Feb 5.
7
Intervention in prediction measure: a new approach to assessing variable importance for random forests.预测度量中的干预:一种评估随机森林变量重要性的新方法。
BMC Bioinformatics. 2017 May 2;18(1):230. doi: 10.1186/s12859-017-1650-8.
8
SNP interaction detection with Random Forests in high-dimensional genetic data.利用随机森林在高维遗传数据中检测 SNP 相互作用。
BMC Bioinformatics. 2012 Jul 15;13:164. doi: 10.1186/1471-2105-13-164.
9
Practical aspects of gene regulatory inference via conditional inference forests from expression data.通过条件推断森林从表达数据进行基因调控推断的实践方面。
Genet Epidemiol. 2016 Dec;40(8):767-778. doi: 10.1002/gepi.22017. Epub 2016 Nov 11.
10
Sequential feature selection and inference using multi-variate random forests.基于多变量随机森林的序列特征选择和推断。
Bioinformatics. 2018 Apr 15;34(8):1336-1344. doi: 10.1093/bioinformatics/btx784.

引用本文的文献

1
Quantifying soil wind erosion attribution in Inner Mongolia's desert grassland.内蒙古荒漠草原土壤风蚀归因量化研究
Sci Rep. 2025 Apr 24;15(1):14319. doi: 10.1038/s41598-025-98339-8.
2
Insights into heart failure metabolite markers through explainable machine learning.通过可解释机器学习深入了解心力衰竭代谢物标志物
Comput Struct Biotechnol J. 2025 Mar 4;27:1012-1022. doi: 10.1016/j.csbj.2025.02.041. eCollection 2025.
3
Associations between NIH Toolbox Emotion Battery measures and previous suicide attempt in bipolar I disorder.

本文引用的文献

1
Genome-wide analysis of A-to-I RNA editing by single-molecule sequencing in Drosophila.在果蝇中通过单分子测序进行全基因组范围内的 A-to-I RNA 编辑分析。
Nat Struct Mol Biol. 2013 Nov;20(11):1333-9. doi: 10.1038/nsmb.2675. Epub 2013 Sep 29.
2
Pathway hunting by random survival forests.随机生存森林的通路搜索。
Bioinformatics. 2013 Jan 1;29(1):99-105. doi: 10.1093/bioinformatics/bts643. Epub 2012 Nov 4.
3
SNP interaction detection with Random Forests in high-dimensional genetic data.利用随机森林在高维遗传数据中检测 SNP 相互作用。
美国国立卫生研究院工具箱情绪量表与双相I型障碍既往自杀未遂之间的关联。
J Affect Disord. 2025 Mar 1;372:470-480. doi: 10.1016/j.jad.2024.12.040. Epub 2024 Dec 11.
4
Transcriptome Study of 2 Black Cohorts Reveals Long Noncoding RNAs Associated With Hypertension-Related mRNAs.2 个黑人队列的转录组研究揭示了与高血压相关的 mRNAs 相关的长非编码 RNA。
J Am Heart Assoc. 2024 Jun 4;13(11):e034417. doi: 10.1161/JAHA.124.034417. Epub 2024 May 31.
5
Consequences of domestication in eastern oyster: Insights from whole genomic analyses.东部牡蛎驯化的后果:来自全基因组分析的见解
Evol Appl. 2024 May 29;17(6):e13710. doi: 10.1111/eva.13710. eCollection 2024 Jun.
6
Directives in Covid-19 government guidance: an international comparison.新冠疫情期间政府指导方针中的指令:一项国际比较。
Appl Corpus Linguistics. 2023 Jun 22:100063. doi: 10.1016/j.acorp.2023.100063.
7
Development and external validation of a machine learning-based prediction model for the cancer-related fatigue diagnostic screening in adult cancer patients: a cross-sectional study in China.基于机器学习的癌症相关疲劳诊断筛选在成年癌症患者中的预测模型的开发和外部验证:中国的一项横断面研究。
Support Care Cancer. 2023 Jan 10;31(2):106. doi: 10.1007/s00520-022-07570-w.
8
Humans and Hoofed Livestock Are the Main Sources of Fecal Contamination of Rivers Used for Crop Irrigation: A Microbial Source Tracking Approach.人类和有蹄类家畜是用于作物灌溉的河流粪便污染的主要来源:一种微生物源追踪方法。
Front Microbiol. 2022 Jun 30;13:768527. doi: 10.3389/fmicb.2022.768527. eCollection 2022.
9
Overview of Metabolomic Analysis and the Integration with Multi-Omics for Economic Traits in Cattle.牛经济性状的代谢组学分析及其与多组学整合概述
Metabolites. 2021 Oct 30;11(11):753. doi: 10.3390/metabo11110753.
10
Predictive Factors of Recovery after an Acute Lateral Ankle Sprain: A Longitudinal Study.急性外侧踝关节扭伤后恢复的预测因素:一项纵向研究
Sports (Basel). 2021 Mar 18;9(3):41. doi: 10.3390/sports9030041.
BMC Bioinformatics. 2012 Jul 15;13:164. doi: 10.1186/1471-2105-13-164.
4
Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?生命科学中的随机森林数据挖掘:是漫步公园还是迷失丛林?
Brief Bioinform. 2013 May;14(3):315-26. doi: 10.1093/bib/bbs034. Epub 2012 Jul 10.
5
Binding profiles of chromatin-modifying proteins are predictive for transcriptional activity and promoter-proximal pausing.染色质修饰蛋白的结合图谱可预测转录活性和启动子近端暂停。
J Comput Biol. 2012 Feb;19(2):126-38. doi: 10.1089/cmb.2011.0258.
6
Random forest Gini importance favours SNPs with large minor allele frequency: impact, sources and recommendations.随机森林基尼重要性有利于具有较大次要等位基因频率的 SNPs:影响、来源和建议。
Brief Bioinform. 2012 May;13(3):292-304. doi: 10.1093/bib/bbr053. Epub 2011 Sep 10.
7
Letter to the editor: on the stability and ranking of predictors from random forest variable importance measures.致编辑的信:关于随机森林变量重要性度量的预测因子的稳定性和排名。
Brief Bioinform. 2011 Jul;12(4):369-73. doi: 10.1093/bib/bbr016. Epub 2011 Apr 15.
8
Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects.存在交互作用时全基因组关联研究中随机森林性能的评估。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S64. doi: 10.1186/1753-6561-3-s7-s64.
9
An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.递归分区介绍:分类和回归树、装袋和随机森林的原理、应用和特点。
Psychol Methods. 2009 Dec;14(4):323-48. doi: 10.1037/a0016973.
10
Capturing the spectrum of interaction effects in genetic association studies by simulated evaporative cooling network analysis.通过模拟蒸发冷却网络分析在基因关联研究中捕捉相互作用效应谱。
PLoS Genet. 2009 Mar;5(3):e1000432. doi: 10.1371/journal.pgen.1000432. Epub 2009 Mar 20.