Suppr超能文献

致编辑的信:关于随机森林变量重要性度量的预测因子的稳定性和排名。

Letter to the editor: on the stability and ranking of predictors from random forest variable importance measures.

出版信息

Brief Bioinform. 2011 Jul;12(4):369-73. doi: 10.1093/bib/bbr016. Epub 2011 Apr 15.

Abstract

A recent study examined the stability of rankings from random forests using two variable importance measures (mean decrease accuracy (MDA) and mean decrease Gini (MDG)) and concluded that rankings based on the MDG were more robust than MDA. However, studies examining data-specific characteristics on ranking stability have been few. Rankings based on the MDG measure showed sensitivity to within-predictor correlation and differences in category frequencies, even when the number of categories was held constant, and thus may produce spurious results. The MDA measure was robust to these data characteristics. Further, under strong within-predictor correlation, MDG rankings were less stable than those using MDA.

摘要

最近的一项研究使用两种变量重要性度量(平均减少精度(MDA)和平均减少基尼(MDG))来检验随机森林的排名稳定性,并得出结论,基于 MDG 的排名比 MDA 更稳健。然而,关于排名稳定性的特定数据特征的研究很少。即使类别数量保持不变,基于 MDG 度量的排名也对预测器内相关性和类别频率差异敏感,因此可能产生虚假结果。MDA 度量对这些数据特征具有鲁棒性。此外,在强预测器内相关性下,MDG 排名比 MDA 排名稳定性差。

相似文献

引用本文的文献

7
Biogeography and environmental preferences of (Mart.) Becc.(马尔特)贝奇的生物地理学与环境偏好
Ecol Evol. 2023 Nov 27;13(11):e10749. doi: 10.1002/ece3.10749. eCollection 2023 Nov.

本文引用的文献

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验